Empirical Methods For Exploiting Parallel Texts

Empirical Methods For Exploiting Parallel Texts Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Empirical Methods For Exploiting Parallel Texts book. This book definitely worth reading, it is an incredibly well-written.

Empirical Methods for Exploiting Parallel Texts

Author : I. Dan Melamed
Publisher : MIT Press
Page : 224 pages
File Size : 50,7 Mb
Release : 2001
Category : Computers
ISBN : 0262133806

Get Book

Empirical Methods for Exploiting Parallel Texts by I. Dan Melamed Pdf

This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Parallel Text Processing

Author : Jean Véronis
Publisher : Springer Science & Business Media
Page : 417 pages
File Size : 40,8 Mb
Release : 2013-03-14
Category : Language Arts & Disciplines
ISBN : 9789401725354

Get Book

Parallel Text Processing by Jean Véronis Pdf

l This book evolved from the ARCADE evaluation exercise that started in 1995. The project's goal is to evaluate alignment systems for parallel texts, i. e. , texts accompanied by their translation. Thirteen teams from various places around the world have participated so far and for the first time, some ten to fifteen years after the first alignment techniques were designed, the community has been able to get a clear picture of the behaviour of alignment systems. Several chapters in this book describe the details of competing systems, and the last chapter is devoted to the description of the evaluation protocol and results. The remaining chapters were especially commissioned from researchers who have been major figures in the field in recent years, in an attempt to address a wide range of topics that describe the state of the art in parallel text processing and use. As I recalled in the introduction, the Rosetta stone won eternal fame as the prototype of parallel texts, but such texts are probably almost as old as the invention of writing. Nowadays, parallel texts are electronic, and they are be coming an increasingly important resource for building the natural language processing tools needed in the "multilingual information society" that is cur rently emerging at an incredible speed. Applications are numerous, and they are expanding every day: multilingual lexicography and terminology, machine and human translation, cross-language information retrieval, language learning, etc.

Computational Linguistics and Intelligent Text Processing

Author : Alexander Gelbukh
Publisher : Springer Science & Business Media
Page : 664 pages
File Size : 52,9 Mb
Release : 2003-01-31
Category : Language Arts & Disciplines
ISBN : 9783540005322

Get Book

Computational Linguistics and Intelligent Text Processing by Alexander Gelbukh Pdf

CICLing 2003 (www.CICLing.org) was the 4th annual Conference on Intelligent Text Processing and Computational Linguistics. It was intended to provide a balanced view of the cutting-edge developments in both the theoretical foundations of computational linguistics and the practice of natural language text processing with its numerous applications. A feature of CICLing conferences is their wide scope that covers nearly all areas of computational linguistics and all aspects of natural language processing applications. The conference is a forum for dialogue between the specialists working in these two areas. This year we were honored by the presence of our keynote speakers Eric Brill (Microsoft Research, USA), Aravind Joshi (U. Pennsylvania, USA), Adam Kilgarriff (Brighton U., UK), and Ted Pedersen (U. Minnesota, USA), who delivered excellent extended lectures and organized vivid discussions. Of 92 submissions received, after careful reviewing 67 were selected for presentation; 43 as full papers and 24 as short papers, by 150 authors from 23 countries: Spain (23 authors), China (20), USA (16), Mexico (13), Japan (12), UK (11), Czech Republic (8), Korea and Sweden (7 each), Canada and Ireland (5 each), Hungary (4), Brazil (3), Belgium, Germany, Italy, Romania, Russia and Tunisia (2 each), Cuba, Denmark, Finland and France (1 each).

Bibliography of Translation Studies: 2001

Author : Lynne Bowker
Publisher : Routledge
Page : 93 pages
File Size : 48,8 Mb
Release : 2017-07-05
Category : Language Arts & Disciplines
ISBN : 9781351573856

Get Book

Bibliography of Translation Studies: 2001 by Lynne Bowker Pdf

A volume of selected, annotated references arranged under specific headings to provide a non-partisan guide to teachers involved in designing courses in translation and/or interpreting.

Bitext Alignment

Author : Jörg Tiedemann
Publisher : Morgan & Claypool Publishers
Page : 168 pages
File Size : 52,6 Mb
Release : 2011
Category : Computers
ISBN : 9781608455102

Get Book

Bitext Alignment by Jörg Tiedemann Pdf

This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

Routledge Encyclopedia of Translation Technology

Author : Sin-Wai Chan
Publisher : Routledge
Page : 718 pages
File Size : 44,8 Mb
Release : 2014-11-13
Category : Foreign Language Study
ISBN : 9781317608158

Get Book

Routledge Encyclopedia of Translation Technology by Sin-Wai Chan Pdf

The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.

Envisioning Machine Translation in the Information Future

Author : John S. White
Publisher : Springer
Page : 260 pages
File Size : 43,8 Mb
Release : 2003-07-31
Category : Computers
ISBN : 9783540399650

Get Book

Envisioning Machine Translation in the Information Future by John S. White Pdf

Envisioning Machine Translation in the Information Future When the organizing committee of AMTA-2000 began planning, it was in that brief moment in history when we were absorbed in contemplation of the passing of the century and the millennium. Nearly everyone was comparing lists of the most important accomplishments and people of the last 10, 100, or 1000 years, imagining the radical changes likely over just the next few years, and at least mildly anxious about the potential Y2K apocalypse. The millennial theme for the conference, “Envisioning MT in the Information Future,” arose from this period. The year 2000 has now come, and nothing terrible has happened (yet) to our electronic infrastructure. Our musings about great people and events probably did not ennoble us much, and whatever sense of jubilee we held has since dissipated. So it may seem a bit obsolete or anachronistic to cast this AMTA conference into visionary themes.

Machine Learning: ECML 2003

Author : Nada Lavrač,Dragan Gamberger,Ljupco Todorovski,Hendrik Blockeel
Publisher : Springer
Page : 512 pages
File Size : 48,5 Mb
Release : 2003-11-18
Category : Computers
ISBN : 9783540398578

Get Book

Machine Learning: ECML 2003 by Nada Lavrač,Dragan Gamberger,Ljupco Todorovski,Hendrik Blockeel Pdf

The proceedings of ECML/PKDD2003 are published in two volumes: the P- ceedings of the 14th European Conference on Machine Learning (LNAI 2837) and the Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (LNAI 2838). The two conferences were held on September 22–26, 2003 in Cavtat, a small tourist town in the vicinity of Dubrovnik, Croatia. As machine learning and knowledge discovery are two highly related ?elds, theco-locationofbothconferencesisbene?cialforbothresearchcommunities.In Cavtat, ECML and PKDD were co-located for the third time in a row, following the successful co-location of the two European conferences in Freiburg (2001) and Helsinki (2002). The co-location of ECML2003 and PKDD2003 resulted in a joint program for the two conferences, including paper presentations, invited talks, tutorials, and workshops. Out of 332 submitted papers, 40 were accepted for publication in the ECML2003proceedings,and40wereacceptedforpublicationinthePKDD2003 proceedings. All the submitted papers were reviewed by three referees. In ad- tion to submitted papers, the conference program consisted of four invited talks, four tutorials, seven workshops, two tutorials combined with a workshop, and a discovery challenge.

Advances in Information Retrieval

Author : Mohand Boughanem,Catherine Berrut,Josiane Mothe,Chantal Soule-Dupuy
Publisher : Springer Science & Business Media
Page : 841 pages
File Size : 50,7 Mb
Release : 2009-03-27
Category : Computers
ISBN : 9783642009570

Get Book

Advances in Information Retrieval by Mohand Boughanem,Catherine Berrut,Josiane Mothe,Chantal Soule-Dupuy Pdf

This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR.

The Oxford Handbook of Computational Linguistics

Author : Ruslan Mitkov
Publisher : Oxford University Press
Page : 808 pages
File Size : 40,6 Mb
Release : 2004
Category : Computers
ISBN : 9780199276349

Get Book

The Oxford Handbook of Computational Linguistics by Ruslan Mitkov Pdf

This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.

A Chronology of Translation in China and the West

Author : Sin-wai Chan
Publisher : Chinese University Press
Page : 596 pages
File Size : 43,9 Mb
Release : 2009
Category : Education
ISBN : 9629963558

Get Book

A Chronology of Translation in China and the West by Sin-wai Chan Pdf

This book is a study of the major events and publications in the world of translation in China and the West from its beginning in the legendary period to 2004, with special references to works published in Chinese and English. It covers a total of 72 countries/places and 1,000 works. All the events and activities in the field have been grouped into 22 areas or categories for easy referencing. This book is a valuable reference tool for all scholars working in the field of translation.

Creation, Use, and Deployment of Digital Information

Author : Herre van Oostendorp,Leen Breure,Andrew Dillon
Publisher : Routledge
Page : 380 pages
File Size : 52,9 Mb
Release : 2005-05-06
Category : Education
ISBN : 9781135618186

Get Book

Creation, Use, and Deployment of Digital Information by Herre van Oostendorp,Leen Breure,Andrew Dillon Pdf

The aim of this book is to present results of scientific research on how digital information should be designed and how artifacts or systems containing digital content should maximize usability, and to explain how context can influence the nature and effectiveness of digital communication. Using a philosophical, cognitive, and technical standpoint, the book covers the issue of what digital information actually is. The text also presents research outcomes from the perspective of research in information science--broadly construed--a term now used to cover a range of theoretical and practical approaches. Creation, Use, and Deployment of Digital Information is broken down into three parts: *Part I presents information on how electronic documents can be realized--the complexities, alternatives, functions, and restrictions are treated here. *Part II discusses how human beings process information and how technical solutions can satisfy human restrictions. *Part III treats the context in which digital information processing and deployment takes place. The book has much to offer to academics in many disciplines, including science, the arts, psychology, education, and the information and computing sciences.

A Dictionary of Translation Technology

Author : Sin-wai Chan
Publisher : Chinese University Press
Page : 660 pages
File Size : 48,5 Mb
Release : 2004
Category : Computers
ISBN : 9629961482

Get Book

A Dictionary of Translation Technology by Sin-wai Chan Pdf

This dictionary is intended for anyone who is interested in translation and translation technology. Especially, translation as an academic discipline, a language activity, a specialized profession, or a business undertaking. The book covers theory and practice of translation and interpretation in a number of areas. Addressing and explaining important concepts in computer translation, computer-aided translation, and translation tools. Most popular and commercially available translation software are included along with their website addresses for handy reference. This dictionary has 1,377 entries. The entries are alphabetized and defined in a simple and concise manner.

Computer Vision - ECCV 2002

Author : Anders Heyden,Gunnar Sparr,Mads Nielsen,Peter Johansen
Publisher : Springer
Page : 860 pages
File Size : 48,9 Mb
Release : 2003-08-02
Category : Computers
ISBN : 9783540479796

Get Book

Computer Vision - ECCV 2002 by Anders Heyden,Gunnar Sparr,Mads Nielsen,Peter Johansen Pdf

Premiering in 1990 in Antibes, France, the European Conference on Computer Vision, ECCV, has been held biennially at venues all around Europe. These conferences have been very successful, making ECCV a major event to the computer vision community. ECCV 2002 was the seventh in the series. The privilege of organizing it was shared by three universities: The IT University of Copenhagen, the University of Copenhagen, and Lund University, with the conference venue in Copenhagen. These universities lie ̈ geographically close in the vivid Oresund region, which lies partly in Denmark and partly in Sweden, with the newly built bridge (opened summer 2000) crossing the sound that formerly divided the countries. We are very happy to report that this year’s conference attracted more papers than ever before, with around 600 submissions. Still, together with the conference board, we decided to keep the tradition of holding ECCV as a single track conference. Each paper was anonymously refereed by three different reviewers. For the nal selection, for the rst time for ECCV, a system with area chairs was used. These met with the program chairsinLundfortwodaysinFebruary2002toselectwhatbecame45oralpresentations and 181 posters.Also at this meeting the selection was made without knowledge of the authors’identity.

Toward Category-Level Object Recognition

Author : Jean Ponce,Martial Hebert,Cordelia Schmid,Andrew Zisserman
Publisher : Springer
Page : 622 pages
File Size : 43,9 Mb
Release : 2007-01-25
Category : Computers
ISBN : 9783540687955

Get Book

Toward Category-Level Object Recognition by Jean Ponce,Martial Hebert,Cordelia Schmid,Andrew Zisserman Pdf

This volume is a post-event proceedings volume and contains selected papers based on presentations given, and vivid discussions held, during two workshops held in Taormina in 2003 and 2004. The 30 thoroughly revised papers presented are organized in the following topical sections: recognition of specific objects, recognition of object categories, recognition of object categories with geometric relations, and joint recognition and segmentation.