Empirical Methods for Exploiting Parallel Texts

Empirical Methods for Exploiting Parallel Texts PDF Author: I. Dan Melamed
Publisher: MIT Press
ISBN: 9780262133807
Category : Computers
Languages : en
Pages : 224

Get Book Here

Book Description
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Empirical Methods for Exploiting Parallel Texts

Empirical Methods for Exploiting Parallel Texts PDF Author: I. Dan Melamed
Publisher: MIT Press
ISBN: 9780262133807
Category : Computers
Languages : en
Pages : 224

Get Book Here

Book Description
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Empirical Methods for Exploiting Parallel Texts [microform]

Empirical Methods for Exploiting Parallel Texts [microform] PDF Author: Melamed, Ilya Dan
Publisher: Ann Arbor, Mich. : University Microfilms International
ISBN:
Category : Linguistic models
Languages : en
Pages : 204

Get Book Here

Book Description


Empirical Methods for Explotings Parallel Texts

Empirical Methods for Explotings Parallel Texts PDF Author: I. Dan Melamed
Publisher:
ISBN:
Category :
Languages : en
Pages : 195

Get Book Here

Book Description


Bibliography of Translation Studies: 2001

Bibliography of Translation Studies: 2001 PDF Author: Lynne Bowker
Publisher: Routledge
ISBN: 1351573853
Category : Language Arts & Disciplines
Languages : en
Pages : 93

Get Book Here

Book Description
A volume of selected, annotated references arranged under specific headings to provide a non-partisan guide to teachers involved in designing courses in translation and/or interpreting.

Routledge Encyclopedia of Translation Technology

Routledge Encyclopedia of Translation Technology PDF Author: Chan Sin-wai
Publisher: Routledge
ISBN: 1317608143
Category : Foreign Language Study
Languages : en
Pages : 958

Get Book Here

Book Description
The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.

Bitext Alignment

Bitext Alignment PDF Author: Jörg Tiedemann
Publisher: Morgan & Claypool Publishers
ISBN: 1608455106
Category : Computers
Languages : en
Pages : 168

Get Book Here

Book Description
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

Machine Learning: ECML 2003

Machine Learning: ECML 2003 PDF Author: Nada Lavrač
Publisher: Springer
ISBN: 3540398570
Category : Computers
Languages : en
Pages : 521

Get Book Here

Book Description
The proceedings of ECML/PKDD2003 are published in two volumes: the P- ceedings of the 14th European Conference on Machine Learning (LNAI 2837) and the Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (LNAI 2838). The two conferences were held on September 22–26, 2003 in Cavtat, a small tourist town in the vicinity of Dubrovnik, Croatia. As machine learning and knowledge discovery are two highly related ?elds, theco-locationofbothconferencesisbene?cialforbothresearchcommunities.In Cavtat, ECML and PKDD were co-located for the third time in a row, following the successful co-location of the two European conferences in Freiburg (2001) and Helsinki (2002). The co-location of ECML2003 and PKDD2003 resulted in a joint program for the two conferences, including paper presentations, invited talks, tutorials, and workshops. Out of 332 submitted papers, 40 were accepted for publication in the ECML2003proceedings,and40wereacceptedforpublicationinthePKDD2003 proceedings. All the submitted papers were reviewed by three referees. In ad- tion to submitted papers, the conference program consisted of four invited talks, four tutorials, seven workshops, two tutorials combined with a workshop, and a discovery challenge.

Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing PDF Author: Alexander Gelbukh
Publisher: Springer
ISBN: 3540246304
Category : Language Arts & Disciplines
Languages : en
Pages : 669

Get Book Here

Book Description
CICLing 2004 was the 5th Annual Conference on Intelligent Text Processing and Computational Linguistics; see www.CICLing.org. CICLing conferences are intended to provide a balanced view of the cutting-edge developments in both theoretical foundations of computational linguistics and the practice of natural language text processing with its numerous applications. A feature of CICLing conferences is their wide scope that covers nearly all areas of computational linguistics and all aspects of natural language processing applications. These conferences are a forum for dialogue between the specialists working in the two areas. This year we were honored by the presence of our invited speakers Martin KayofStanfordUniversity,PhilipResnikoftheUniversityofMaryland,Ricardo Baeza-Yates of the University of Chile, and Nick Campbell of the ATR Spoken Language Translation Research Laboratories. They delivered excellent extended lectures and organized vivid discussions. Of129submissionsreceived(74fullpapersand44shortpapers),aftercareful international reviewing 74 papers were selected for presentation (40 full papers and35shortpapers),writtenby176authorsfrom21countries:Korea(37),Spain (34), Japan (22), Mexico (15), China (11), Germany (10), Ireland (10), UK (10), Singapore (6), Canada (3), Czech Rep. (3), France (3), Brazil (2), Sweden (2), Taiwan (2), Turkey (2), USA (2), Chile (1), Romania (1), Thailand (1), and The Netherlands (1); the ?gures in parentheses stand for the number of authors from the corresponding country.

Semantic Domains in Computational Linguistics

Semantic Domains in Computational Linguistics PDF Author: Alfio Gliozzo
Publisher: Springer Science & Business Media
ISBN: 3540681582
Category : Language Arts & Disciplines
Languages : en
Pages : 138

Get Book Here

Book Description
Semantic fields are lexically coherent – the words they contain co-occur in texts. In this book the authors introduce and define semantic domains, a computational model for lexical semantics inspired by the theory of semantic fields. Semantic domains allow us to exploit domain features for texts, terms and concepts, and they can significantly boost the performance of natural-language processing systems. Semantic domains can be derived from existing lexical resources or can be acquired from corpora in an unsupervised manner. They also have the property of interlinguality, and they can be used to relate terms in different languages in multilingual application scenarios. The authors give a comprehensive explanation of the computational model, with detailed chapters on semantic domains, domain models, and applications of the technique in text categorization, word sense disambiguation, and cross-language text categorization. This book is suitable for researchers and graduate students in computational linguistics.

Advances in Information Retrieval

Advances in Information Retrieval PDF Author: Mohand Boughanem
Publisher: Springer Science & Business Media
ISBN: 3642009573
Category : Computers
Languages : en
Pages : 841

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR.