Designing And Evaluating Language Corpora

Designing And Evaluating Language Corpora Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Designing And Evaluating Language Corpora book. This book definitely worth reading, it is an incredibly well-written.

Designing and Evaluating Language Corpora

Author : Jesse Egbert,Douglas Biber,Bethany Gray
Publisher : Cambridge University Press
Page : 299 pages
File Size : 41,5 Mb
Release : 2022-04-14
Category : Computers
ISBN : 9781107151383

Get Book

Designing and Evaluating Language Corpora by Jesse Egbert,Douglas Biber,Bethany Gray Pdf

This volume introduces a new framework for conceptualizing and achieving corpus representativeness in a rigorous, yet practical way.

Analysing Representation

Author : Frazer Heritage,Charlotte Taylor
Publisher : Taylor & Francis
Page : 316 pages
File Size : 45,9 Mb
Release : 2024-05-31
Category : Language Arts & Disciplines
ISBN : 9781040018989

Get Book

Analysing Representation by Frazer Heritage,Charlotte Taylor Pdf

Analysing Representation: A Corpus and Discourse Textbook guides readers through the process of researching how people and phenomena are represented in discourse and introduces them to key tools they can use from corpus linguistics and (critical) discourse analysis. This book takes a step-by-step approach to introducing each concept and includes exercises and further reading to help readers check their progress and prepare for independent research. It is unique in introducing readers to a range of experts representing the full range of work in this area. This book is aimed at final-year undergraduate, taught postgraduate and doctoral level students. It wil also be useful to scholars who are new to combining corpus and discourse methods in investigations of representation.

Multi-Dimensional Analysis

Author : Tony Berber Sardinha,Marcia Veirano Pinto
Publisher : Bloomsbury Publishing
Page : 304 pages
File Size : 49,9 Mb
Release : 2019-03-21
Category : Computers
ISBN : 9781350023840

Get Book

Multi-Dimensional Analysis by Tony Berber Sardinha,Marcia Veirano Pinto Pdf

Multi-Dimensional Analysis: Research Methods and Current Issues provides a comprehensive guide both to the statistical methods in Multi-Dimensional Analysis (MDA) and its key elements, such as corpus building, tagging, and tools. The major goal is to explain the steps involved in the method so that readers may better understand this complex research framework and conduct MD research on their own. Multi-Dimensional Analysis is a method that allows the researcher to describe different registers (textual varieties defined by their social use) such as academic settings, regional discourse, social media, movies, and pop songs. Through multivariate statistical techniques, MDA identifies complementary correlation groupings of dozens of variables, including variables which belong both to the grammatical and semantic domains. Such groupings are then associated with situational variables of texts like information density, orality, and narrativity to determine linguistic constructs known as dimensions of variation, which provide a scale for the comparison of a large number of texts and registers. This book is a comprehensive research guide to MDA.

Corpus Linguistics for Health Communication

Author : Gavin Brookes,Luke C. Collins
Publisher : Taylor & Francis
Page : 261 pages
File Size : 40,5 Mb
Release : 2023-12-22
Category : Language Arts & Disciplines
ISBN : 9781003819790

Get Book

Corpus Linguistics for Health Communication by Gavin Brookes,Luke C. Collins Pdf

Corpus Linguistics for Health Communication provides an accessible and practical introduction to the use of corpus linguistics methods to analyse health-related language use across various contexts and genres. Offering a critical review of the field, discussion of extended case studies, and practical exercises based on spoken, written, and digital language data, this book: introduces the fields of health communication and corpus linguistics and critically reviews cutting-edge studies in the burgeoning area of corpus-based health communication; describes the processes involved in planning a corpus linguistics study of health communication, including designing and building a corpus, selecting tools, and implementing techniques of analysis; demonstrates how corpus linguistics methods can – and have – been applied to the study of spoken, written, and digital health communication, offering critical reflections and suggesting areas for future development. Corpus Linguistics for Health Communication is essential reading for those working at the interface of corpus linguistics and health communication. Both those with a little or a lot of experience in either field will find value in its pages.

Developing Linguistic Corpora

Author : Martin Wynne
Publisher : Oxbow Books Limited
Page : 100 pages
File Size : 41,9 Mb
Release : 2005
Category : Language Arts & Disciplines
ISBN : UVA:X004991162

Get Book

Developing Linguistic Corpora by Martin Wynne Pdf

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Teaching and Language Corpora

Author : Anne Wichmann,Steven Fligelstone
Publisher : Routledge
Page : 312 pages
File Size : 43,6 Mb
Release : 2014-06-11
Category : Foreign Language Study
ISBN : 9781317889571

Get Book

Teaching and Language Corpora by Anne Wichmann,Steven Fligelstone Pdf

Corpora are well-established as a resource for language research; they are now also increasingly being used for teaching purposes. This book is the first of its kind to deal explicitly and in a wide-ranging way with the use of corpora in teaching. It contains an extensive collection of articles by corpus linguists and practising teachers, covering not only the use of data to inform and create teaching materials but also the direct exploitation of corpora by students, both in the study of linguistics in general and in the acquisition of proficiency in individual languages, including English, Welsh, German, French and Italian. In addition, the book offers practical information on the sources of corpora and concordances, including those suitable for work on non-roman scripts such as Greek and Cyrillic.

Evaluation and Translation

Author : Carol Maier
Publisher : Routledge
Page : 237 pages
File Size : 54,6 Mb
Release : 2014-04-23
Category : Language Arts & Disciplines
ISBN : 9781317640844

Get Book

Evaluation and Translation by Carol Maier Pdf

The definition of value or quality with respect to work in translation has historically been a particularly vexed issue. Today, however, the growing demand for translations in such fields as technology and business and the increased scrutiny of translators' work by scholars in many disciplines is giving rise to a need for more nuanced, more specialized, and more explicit methods of determining value. Some refer to this determination as evaluation, others use the term assessment. Either way, the question is one of measurement and judgement, which are always unavoidably subjective and frequently rest on criteria that are not overtly expressed. This means that devising more complex evaluative practices involves not only quantitative techniques but also an exploration of the attitudes, preferences, or individual values on which criteria are established. Intended as an interrogation and a critique that can serve to prompt a more thorough and open consideration of evaluative criteria, this special issue of The Translator offers examinations of diverse evaluative practices and contains both empirical and hermeneutic work. Topics addressed include the evaluation of student translations using more up-to-date and positive methods such as those employed in corpus studies; the translation of non?standard language; translation into the second language; terminology; the application of theoretical criteria to practice; a social?textual perspective; and the reviewing of literary translations in the press. In addition, reviews by a number of literary translators discuss specific translations both into and out of English.

History, Features, and Typology of Language Corpora

Author : Niladri Sekhar Dash,S. Arulmozi
Publisher : Springer
Page : 293 pages
File Size : 46,5 Mb
Release : 2018-02-01
Category : Language Arts & Disciplines
ISBN : 9789811074585

Get Book

History, Features, and Typology of Language Corpora by Niladri Sekhar Dash,S. Arulmozi Pdf

This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and charts for easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.

Web Corpus Construction

Author : Roland Schäfer,Felix Bildhauer
Publisher : Morgan & Claypool Publishers
Page : 147 pages
File Size : 54,6 Mb
Release : 2013-07-01
Category : Computers
ISBN : 9781608459841

Get Book

Web Corpus Construction by Roland Schäfer,Felix Bildhauer Pdf

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora). For additional material please visit the companion website: sites.morganclaypool.com/wcc Table of Contents: Preface / Acknowledgments / Web Corpora / Data Collection / Post-Processing / Linguistic Processing / Corpus Evaluation and Comparison / Bibliography / Authors' Biographies

Corpus Linguistics

Author : Douglas Biber,Susan Conrad,Randi Reppen
Publisher : Cambridge University Press
Page : 128 pages
File Size : 46,8 Mb
Release : 1998-04-23
Category : Language Arts & Disciplines
ISBN : 9781316582565

Get Book

Corpus Linguistics by Douglas Biber,Susan Conrad,Randi Reppen Pdf

This book is about investigating the way people use language in speech and writing. It introduces the corpus-based approach to linguistics, based on analysis of large databases of real language examples stored on computer. Each chapter focuses on a different area of linguistics, including lexicography, grammar, discourse, register variation, language acquisition, and historical linguistics. Example analyses are presented in each chapter to provide concrete descriptions of the research methods and advantages of corpus-based techniques. Ten methodology boxes provide clear and concise explanations of the issues in doing corpus-based research and reading corpus-based studies and there is a useful appendix of resources for corpus-based investigation. This lucid and comprehensive introduction to the subject will be welcomed by a broad range of readers, from undergraduate students to professional researchers.

Multiple Affordances of Language Corpora for Data-driven Learning

Author : Agnieszka Leńko-Szymańska,Alex Boulton
Publisher : John Benjamins Publishing Company
Page : 312 pages
File Size : 53,6 Mb
Release : 2015-05-15
Category : Foreign Language Study
ISBN : 9789027268716

Get Book

Multiple Affordances of Language Corpora for Data-driven Learning by Agnieszka Leńko-Szymańska,Alex Boulton Pdf

In recent years, corpora have found their way into language instruction, albeit often indirectly, through their role in syllabus and course design and in the production of teaching materials and other resources. An alternative and more innovative use is for teachers and students alike to explore corpus data directly as part of the learning process. This volume addresses this latter application of corpora by providing research insights firmly based in the classroom context and reporting on several state-of-the-art projects around the world where learners have direct access to corpus resources and tools and utilize them to improve their control of the language systems and skills or their professional expertise as translators. Its aim is to present recent advances in data-driven learning, addressing issues involving different types of corpora, for different learner profiles, in different ways for different purposes, and using a variety of different research methodologies and perspectives.

Corpora in Language Teaching and Learning

Author : Yvonne Alexandra Breyer
Publisher : Peter Lang Gmbh, Internationaler Verlag Der Wissenschaften
Page : 0 pages
File Size : 44,7 Mb
Release : 2011
Category : Computational linguistics
ISBN : 3631630417

Get Book

Corpora in Language Teaching and Learning by Yvonne Alexandra Breyer Pdf

This book highlights the potential and the challenges of corpora in language education with a particular focus on the teacher's perspective. For this purpose, the study explores the relevance of the corpus approach to central paradigms underlying contemporary language education. Furthermore, a critical analysis investigates the persisting gap between research findings and their implementation in teaching practices. As a result, key factors in advancing the popularisation of corpora in language education are identified. A survey and a case study verify this gap and, importantly, underline the pivotal role of adequate teacher education if corpus-based language teaching is to make any significant impact on current teaching practices.

Variation across Speech and Writing

Author : Douglas Biber
Publisher : Cambridge University Press
Page : 128 pages
File Size : 42,6 Mb
Release : 1991-12-19
Category : Language Arts & Disciplines
ISBN : 9781316582329

Get Book

Variation across Speech and Writing by Douglas Biber Pdf

Similarities and differences between speech and writing have been the subject of innumerable studies, but until now there has been no attempt to provide a unified linguistic analysis of the whole range of spoken and written registers in English. In this widely acclaimed empirical study, Douglas Biber uses computational techniques to analyse the linguistic characteristics of twenty three spoken and written genres, enabling identification of the basic, underlying dimensions of variation in English. In Variation Across Speech and Writing, six dimensions of variation are identified through a factor analysis, on the basis of linguistic co-occurence patterns. The resulting model of variation provides for the description of the distinctive linguistic characteristics of any spoken or written text andd emonstrates the ways in which the polarization of speech and writing has been misleading, and thus enables reconciliation of the contradictory conclusions reached in previous research.

Register Variation on the Web

Author : Douglas Biber,Jesse Egbert
Publisher : Cambridge University Press
Page : 265 pages
File Size : 40,5 Mb
Release : 2018-08-23
Category : Computers
ISBN : 9781107122161

Get Book

Register Variation on the Web by Douglas Biber,Jesse Egbert Pdf

Explores and provides situational and linguistic descriptions of the full range of registers found on the searchable web.

Dimensions of Register Variation

Author : Douglas Biber
Publisher : Cambridge University Press
Page : 448 pages
File Size : 41,9 Mb
Release : 1995-08-31
Category : Language Arts & Disciplines
ISBN : 9780521473316

Get Book

Dimensions of Register Variation by Douglas Biber Pdf

Douglas Biber's new book extends and refines the research and methodology reported in his ground breaking Variation Across Speech and Writing (CUP 1988). In Dimensions of Register Variation he gives a linguistic analysis of register in four widely differing languages: English, Nukulaelae Tuvaluan, Korean, and Somali. Using the multi-dimensional analytical framework employed in his earlier work, Biber carries out a principled comparison of both synchronic and diachronic patterns of variation across the four languages. Striking similarities as well as differences emerge, allowing Biber to predict for the first time cross-linguistic universals of register variation. This major new work will provide the foundation for the further investigation of cross-linguistic universals governing the pattern of discourse variation across registers, and will be of wide interest to any scholar interested in style, register and literacy.