Empirical Methods for Exploiting Parallel Texts

Empirical Methods for Exploiting Parallel Texts PDF Author: I. Dan Melamed
Publisher: MIT Press
ISBN: 9780262133807
Category : Computers
Languages : en
Pages : 224

Get Book Here

Book Description
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Empirical Methods for Exploiting Parallel Texts

Empirical Methods for Exploiting Parallel Texts PDF Author: I. Dan Melamed
Publisher: MIT Press
ISBN: 9780262133807
Category : Computers
Languages : en
Pages : 224

Get Book Here

Book Description
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.

Parallel Text Processing

Parallel Text Processing PDF Author: Jean Véronis
Publisher: Springer Science & Business Media
ISBN: 9401725357
Category : Language Arts & Disciplines
Languages : en
Pages : 417

Get Book Here

Book Description
l This book evolved from the ARCADE evaluation exercise that started in 1995. The project's goal is to evaluate alignment systems for parallel texts, i. e. , texts accompanied by their translation. Thirteen teams from various places around the world have participated so far and for the first time, some ten to fifteen years after the first alignment techniques were designed, the community has been able to get a clear picture of the behaviour of alignment systems. Several chapters in this book describe the details of competing systems, and the last chapter is devoted to the description of the evaluation protocol and results. The remaining chapters were especially commissioned from researchers who have been major figures in the field in recent years, in an attempt to address a wide range of topics that describe the state of the art in parallel text processing and use. As I recalled in the introduction, the Rosetta stone won eternal fame as the prototype of parallel texts, but such texts are probably almost as old as the invention of writing. Nowadays, parallel texts are electronic, and they are be coming an increasingly important resource for building the natural language processing tools needed in the "multilingual information society" that is cur rently emerging at an incredible speed. Applications are numerous, and they are expanding every day: multilingual lexicography and terminology, machine and human translation, cross-language information retrieval, language learning, etc.

Machine Learning: ECML 2003

Machine Learning: ECML 2003 PDF Author: Nada Lavrač
Publisher: Springer Science & Business Media
ISBN: 3540201211
Category : Computers
Languages : en
Pages : 521

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 14th European Conference on Machine Learning, ECML 2003, held in Cavtat-Dubrovnik, Croatia in September 2003 in conjunction with PKDD 2003. The 40 revised full papers presented together with 4 invited contributions were carefully reviewed and, together with another 40 ones for PKDD 2003, selected from a total of 332 submissions. The papers address all current issues in machine learning including support vector machine, inductive inference, feature selection algorithms, reinforcement learning, preference learning, probabilistic grammatical inference, decision tree learning, clustering, classification, agent learning, Markov networks, boosting, statistical parsing, Bayesian learning, supervised learning, and multi-instance learning.

Bitext Alignment

Bitext Alignment PDF Author: Jörg Tiedemann
Publisher: Morgan & Claypool Publishers
ISBN: 1608455114
Category : Computers
Languages : en
Pages : 167

Get Book Here

Book Description
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks

Toward Category-Level Object Recognition

Toward Category-Level Object Recognition PDF Author: Jean Ponce
Publisher: Springer
ISBN: 3540687955
Category : Computers
Languages : en
Pages : 622

Get Book Here

Book Description
This volume is a post-event proceedings volume and contains selected papers based on presentations given, and vivid discussions held, during two workshops held in Taormina in 2003 and 2004. The 30 thoroughly revised papers presented are organized in the following topical sections: recognition of specific objects, recognition of object categories, recognition of object categories with geometric relations, and joint recognition and segmentation.

Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing PDF Author: Alexander Gelbukh
Publisher: Springer Science & Business Media
ISBN: 3540005323
Category : Language Arts & Disciplines
Languages : en
Pages : 664

Get Book Here

Book Description
CICLing 2003 (www.CICLing.org) was the 4th annual Conference on Intelligent Text Processing and Computational Linguistics. It was intended to provide a balanced view of the cutting-edge developments in both the theoretical foundations of computational linguistics and the practice of natural language text processing with its numerous applications. A feature of CICLing conferences is their wide scope that covers nearly all areas of computational linguistics and all aspects of natural language processing applications. The conference is a forum for dialogue between the specialists working in these two areas. This year we were honored by the presence of our keynote speakers Eric Brill (Microsoft Research, USA), Aravind Joshi (U. Pennsylvania, USA), Adam Kilgarriff (Brighton U., UK), and Ted Pedersen (U. Minnesota, USA), who delivered excellent extended lectures and organized vivid discussions. Of 92 submissions received, after careful reviewing 67 were selected for presentation; 43 as full papers and 24 as short papers, by 150 authors from 23 countries: Spain (23 authors), China (20), USA (16), Mexico (13), Japan (12), UK (11), Czech Republic (8), Korea and Sweden (7 each), Canada and Ireland (5 each), Hungary (4), Brazil (3), Belgium, Germany, Italy, Romania, Russia and Tunisia (2 each), Cuba, Denmark, Finland and France (1 each).

Computer and Information Sciences -- ISCIS 2003

Computer and Information Sciences -- ISCIS 2003 PDF Author: Adnan Yazici
Publisher: Springer Science & Business Media
ISBN: 3540204091
Category : Computers
Languages : en
Pages : 1128

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 18th International Symposium on Computer and Information Sciences, ISCIS 2003, held in Antalya, Turkey in November 2003. The 135 revised papers presented together with 2 invited papers were carefully reviewed and selected from over 360 submissions. The papers are organized in topical sections on architectures and systems, theoretical computer science, databases and information retrieval, e-commerce, graphics and computer vision, intelligent systems and robotics, multimedia, networks and security, parallel and distributed computing, soft computing, and software engineering.

The Oxford Handbook of Computational Linguistics

The Oxford Handbook of Computational Linguistics PDF Author: Ruslan Mitkov
Publisher: Oxford University Press
ISBN: 019927634X
Category : Computers
Languages : en
Pages : 808

Get Book Here

Book Description
This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.

Computational Linguistics

Computational Linguistics PDF Author:
Publisher:
ISBN:
Category : Computational linguistics
Languages : en
Pages : 652

Get Book Here

Book Description


Routledge Encyclopedia of Translation Technology

Routledge Encyclopedia of Translation Technology PDF Author: Chan Sin-wai
Publisher: Routledge
ISBN: 1317608143
Category : Foreign Language Study
Languages : en
Pages : 958

Get Book Here

Book Description
The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.