Creating And Digitizing Language Corpora

Creating And Digitizing Language Corpora Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Creating And Digitizing Language Corpora book. This book definitely worth reading, it is an incredibly well-written.

Creating and Digitizing Language Corpora

Author : Joan C. Beal,Karen P. Corrigan,Hermann L. Moisl
Publisher : Unknown
Page : 128 pages
File Size : 43,7 Mb
Release : 2007
Category : Electronic
ISBN : OCLC:909789856

Get Book

Creating and Digitizing Language Corpora by Joan C. Beal,Karen P. Corrigan,Hermann L. Moisl Pdf

Creating and Digitizing Language Corpora

Author : Karen P. Corrigan,Adam Mearns
Publisher : Springer
Page : 359 pages
File Size : 49,8 Mb
Release : 2016-09-19
Category : Language Arts & Disciplines
ISBN : 9781137386458

Get Book

Creating and Digitizing Language Corpora by Karen P. Corrigan,Adam Mearns Pdf

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Creating and Digitizing Language Corpora

Author : J. Beal,K. Corrigan,H. Moisl
Publisher : Springer
Page : 250 pages
File Size : 49,5 Mb
Release : 2007-07-12
Category : Language Arts & Disciplines
ISBN : 9780230223202

Get Book

Creating and Digitizing Language Corpora by J. Beal,K. Corrigan,H. Moisl Pdf

A range of electronic corpora has become accessible via the WWW and CD-ROM. This coincides with improvements in standards governing the collecting, encoding and archiving of such data. This book develops similar standards for enriching and preserving 'unconventional' data': the fragmentary texts and voices left to us as accidents of history.

Creating and Digitizing Language Corpora

Author : Karen P. Corrigan,Adam Mearns
Publisher : Palgrave Macmillan
Page : 359 pages
File Size : 40,9 Mb
Release : 2016-09-27
Category : Language Arts & Disciplines
ISBN : 1137386444

Get Book

Creating and Digitizing Language Corpora by Karen P. Corrigan,Adam Mearns Pdf

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Creating and Digitizing Language Corpora

Author : J. Beal,K. Corrigan,H. Moisl
Publisher : Palgrave Macmillan
Page : 250 pages
File Size : 49,8 Mb
Release : 2007-07-12
Category : Language Arts & Disciplines
ISBN : 1403943672

Get Book

Creating and Digitizing Language Corpora by J. Beal,K. Corrigan,H. Moisl Pdf

A range of electronic corpora has become accessible via the WWW and CD-ROM. This coincides with improvements in standards governing the collecting, encoding and archiving of such data. This book develops similar standards for enriching and preserving 'unconventional' data': the fragmentary texts and voices left to us as accidents of history.

Creating and Digitizing Language Corpora

Author : J. Beal,K. Corrigan,H. Moisl
Publisher : Springer
Page : 245 pages
File Size : 50,8 Mb
Release : 2007-06-27
Category : Language Arts & Disciplines
ISBN : 9780230223936

Get Book

Creating and Digitizing Language Corpora by J. Beal,K. Corrigan,H. Moisl Pdf

A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.

Building a National Corpus

Author : Dawn Knight,Steve Morris,Laura Arman,Jennifer Needs,Mair Rees
Publisher : Springer Nature
Page : 192 pages
File Size : 41,5 Mb
Release : 2021-10-08
Category : Language Arts & Disciplines
ISBN : 9783030818586

Get Book

Building a National Corpus by Dawn Knight,Steve Morris,Laura Arman,Jennifer Needs,Mair Rees Pdf

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.

The Handbook of Language Variation and Change

Author : J. K. Chambers,Natalie Schilling
Publisher : John Wiley & Sons
Page : 628 pages
File Size : 49,7 Mb
Release : 2018-05-01
Category : Language Arts & Disciplines
ISBN : 9781119457084

Get Book

The Handbook of Language Variation and Change by J. K. Chambers,Natalie Schilling Pdf

Reflecting a multitude of developments in the study of language change and variation over the last ten years, this extensively updated second edition features a number of new chapters and remains the authoritative reference volume on a core research area in linguistics. A fully revised and expanded edition of this acclaimed reference work, which has established its reputation based on its unrivalled scope and depth of analysis in this interdisciplinary field Includes seven new chapters, while the remainder have undergone thorough revision and updating to incorporate the latest research and reflect numerous developments in the field Accessibly structured by theme, covering topics including data collection and evaluation, linguistic structure, language and time, language contact, language domains, and social differentiation Brings together an experienced, international editorial and contributor team to provides an unrivalled learning, teaching and reference tool for researchers and students in sociolinguistics

Creation and Use of Historical English Corpora in Spain

Author : Nila Vázquez
Publisher : Cambridge Scholars Publishing
Page : 495 pages
File Size : 54,5 Mb
Release : 2014-10-21
Category : Language Arts & Disciplines
ISBN : 9781443870191

Get Book

Creation and Use of Historical English Corpora in Spain by Nila Vázquez Pdf

Even before the Helsinki Corpus was published, Spain had a good amount of Historical English researchers, such as the group directed by Teresa Fanego in Santiago de Compostela. In the last couple of decades, the number of scholars working in the field of Historical Corpus Linguistics has increased, and, nowadays, there are some interesting projects in Spain that will result in the publication of valuable material for scholars throughout the world. The aim of this volume is twofold. On the on...

The Routledge Handbook of Corpus Linguistics

Author : Anne O'Keeffe,Michael J. McCarthy
Publisher : Routledge
Page : 684 pages
File Size : 53,8 Mb
Release : 2022-02-08
Category : Language Arts & Disciplines
ISBN : 9780429632648

Get Book

The Routledge Handbook of Corpus Linguistics by Anne O'Keeffe,Michael J. McCarthy Pdf

The Routledge Handbook of Corpus Linguistics 2e provides an updated overview of a dynamic and rapidly growing area with a widely applied methodology. Over a decade on from the first edition of the Handbook, this collection of 47 chapters from experts in key areas offers a comprehensive introduction to both the development and use of corpora as well as their ever-evolving applications to other areas, such as digital humanities, sociolinguistics, stylistics, translation studies, materials design, language teaching and teacher development, media discourse, discourse analysis, forensic linguistics, second language acquisition and testing. The new edition updates all core chapters and includes new chapters on corpus linguistics and statistics, digital humanities, translation, phonetics and phonology, second language acquisition, social media and theoretical perspectives. Chapters provide annotated further reading lists and step-by-step guides as well as detailed overviews across a wide range of themes. The Handbook also includes a wealth of case studies that draw on some of the many new corpora and corpus tools that have emerged in the last decade. Organised across four themes, moving from the basic start-up topics such as corpus building and design to analysis, application and reflection, this second edition remains a crucial point of reference for advanced undergraduates, postgraduates and scholars in applied linguistics.

Data Collection in Sociolinguistics

Author : Christine Mallinson,Becky Childs,Gerard Van Herk
Publisher : Routledge
Page : 352 pages
File Size : 45,7 Mb
Release : 2013-05-29
Category : Language Arts & Disciplines
ISBN : 9781136486005

Get Book

Data Collection in Sociolinguistics by Christine Mallinson,Becky Childs,Gerard Van Herk Pdf

This edited volume provides up-to-date, succinct, relevant, and informative discussion about methods of data collection in sociolinguistic research. It covers the main areas of research design, conducting research, and sharing data findings with longer chapters and shorter vignettes written by a range of top sociolinguists, both veteran and emerging scholars. Here is the one-stop, go-to guide for the numerous quantitative, qualitative, and mixed methods that are used in sociolinguistic research, ensuring that Data Collection in Sociolinguistics will be not only useful in the classroom but also as a reference tool for active researchers. For more information, visit sociolinguisticdatacollection.com.

Quantitative Corpus Linguistics with R

Author : Stefan Th. Gries
Publisher : Taylor & Francis
Page : 274 pages
File Size : 48,8 Mb
Release : 2016-10-14
Category : Education
ISBN : 9781317597667

Get Book

Quantitative Corpus Linguistics with R by Stefan Th. Gries Pdf

As in its first edition, the new edition of Quantitative Corpus Linguistics with R demonstrates how to process corpus-linguistic data with the open-source programming language and environment R. Geared in general towards linguists working with observational data, and particularly corpus linguists, it introduces R programming with emphasis on: data processing and manipulation in general; text processing with and without regular expressions of large bodies of textual and/or literary data, and; basic aspects of statistical analysis and visualization. This book is extremely hands-on and leads the reader through dozens of small applications as well as larger case studies. Along with an array of exercise boxes and separate answer keys, the text features a didactic sequential approach in case studies by way of subsections that zoom in to every programming problem. The companion website to the book contains all relevant R code (amounting to approximately 7,000 lines of heavily commented code), most of the data sets as well as pointers to others, and a dedicated Google newsgroup. This new edition is ideal for both researchers in corpus linguistics and instructors who want to promote hands-on approaches to data in corpus linguistics courses.

Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig

Author : Dawn Knight,Steve Morris,Tess Fitzpatrick
Publisher : Springer Nature
Page : 178 pages
File Size : 40,8 Mb
Release : 2021-07-05
Category : Language Arts & Disciplines
ISBN : 9783030724849

Get Book

Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig by Dawn Knight,Steve Morris,Tess Fitzpatrick Pdf

This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.

Corpus Linguistics and the Analysis of Sociolinguistic Change

Author : Joan O'Sullivan
Publisher : Routledge
Page : 241 pages
File Size : 54,6 Mb
Release : 2019-10-28
Category : Language Arts & Disciplines
ISBN : 9781000710779

Get Book

Corpus Linguistics and the Analysis of Sociolinguistic Change by Joan O'Sullivan Pdf

Corpus Linguistics and the Analysis of Sociolinguistic Change demonstrates how particular styles and varieties of language are chosen and represented in the media, to reveal changing language ideologies and sociolinguistic change. Drawing on a corpus of ads broadcast on an Irish radio station between 1977 and 2017, this book shows how corpus linguistic tools can be creatively employed, in conjunction with frameworks and concepts such as audience and referee design and indexicality, and examines how accents and dialects (vernacular and prestige) are exploited in the ads across the decades. In addition, this book: illustrates the key principles of corpus design for sociolinguistics studies and offers a framework for future diachronic corpus studies of advertising on social media; provides a model for analysing corpus data at both inter-varietal and intra-varietal levels in terms of both accent and dialectal features and explores the efficacy of using particular corpus linguistic tools; identifies key factors which can be used by researchers as evidence for sociolinguistic change and links these factors to relevant theories and frameworks; demonstrates how corpus tools can be used to compare advertising discourse with naturally occurring discourse, with particular reference to markers of (pseudo) intimate discourse. Building on the growing body of research relating to variation and change in Irish English, this book is key reading for researchers and advanced students undertaking research within the areas of sociolinguistics and corpus linguistics.