Machine Learning in Translation Corpora Processing

Machine Learning in Translation Corpora Processing PDF Author: Krzysztof Wolk
Publisher: CRC Press
ISBN: 0429588836
Category : Computers
Languages : en
Pages : 205

Get Book Here

Book Description
This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.

Machine Learning in Translation Corpora Processing

Machine Learning in Translation Corpora Processing PDF Author: Krzysztof Wolk
Publisher: CRC Press
ISBN: 0429588836
Category : Computers
Languages : en
Pages : 205

Get Book Here

Book Description
This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.

Machine Learning in Translation Corpora Processing

Machine Learning in Translation Corpora Processing PDF Author: Krzysztof Wolk
Publisher: CRC Press
ISBN: 0429590776
Category : Computers
Languages : en
Pages : 281

Get Book Here

Book Description
This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.

Learning Machine Translation

Learning Machine Translation PDF Author: Cyril Goutte
Publisher: MIT Press
ISBN: 0262072971
Category : Computers
Languages : en
Pages : 329

Get Book Here

Book Description
How Machine Learning can improve machine translation: enabling technologies and new statistical techniques.

Neural Machine Translation

Neural Machine Translation PDF Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409

Get Book Here

Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.

Parallel Corpora for Contrastive and Translation Studies

Parallel Corpora for Contrastive and Translation Studies PDF Author: Irene Doval
Publisher: John Benjamins Publishing Company
ISBN: 9027262845
Category : Language Arts & Disciplines
Languages : en
Pages : 313

Get Book Here

Book Description
This volume assesses the state of the art of parallel corpus research as a whole, reporting on advances in both recent developments of parallel corpora – with some particular references to comparable corpora as well– and in ways of exploiting them for a variety of purposes. The first part of the book is devoted to new roles that parallel corpora can and should assume in translation studies and in contrastive linguistics, to the usefulness and usability of parallel corpora, and to advances in parallel corpus alignment, annotation and retrieval. There follows an up-to-date presentation of a number of parallel corpus projects currently being carried out in Europe, some of them multimodal, with certain chapters illustrating case studies developed on the basis of the corpora at hand. In most of these chapters, attention is paid to specific technical issues of corpus building. The third part of the book reflects on specific applications and on the creation of bilingual resources from parallel corpora. This volume will be welcomed by scholars, postgraduate and PhD students in the fields of contrastive linguistics, translation studies, lexicography, language teaching and learning, machine translation, and natural language processing.

Progress in Machine Translation

Progress in Machine Translation PDF Author: Sergei Nirenburg
Publisher: IOS Press
ISBN: 9789051990744
Category : Computers
Languages : en
Pages : 338

Get Book Here

Book Description


Computational Linguistics and Intelligent Text Processing

Computational Linguistics and Intelligent Text Processing PDF Author: Alexander Gelbukh
Publisher: Springer Science & Business Media
ISBN: 3642121152
Category : Computers
Languages : en
Pages : 778

Get Book Here

Book Description
This book constitutes the proceedings of the 11th International Conference on Computational Linguistics and Intelligent Text Processing, held in Iaşi, Romania, in March 2010. The 60 paper included in the volume were carefully reviewed and selected from numerous submissions. The book also includes 3 invited papers. The topics covered are: lexical resources, syntax and parsing, word sense disambiguation and named entity recognition, semantics and dialog, humor and emotions, machine translation and multilingualism, information extraction, information retrieval, text categorization and classification, plagiarism detection, text summarization, and speech generation.

Language Corpora Annotation and Processing

Language Corpora Annotation and Processing PDF Author: Niladri Sekhar Dash
Publisher: Springer Nature
ISBN: 9811629609
Category : Computational linguistics
Languages : en
Pages :

Get Book Here

Book Description
This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.

Parallel Text Processing

Parallel Text Processing PDF Author: Jean Véronis
Publisher: Springer Science & Business Media
ISBN: 9780792365464
Category : Computers
Languages : en
Pages : 442

Get Book Here

Book Description
With the rising importance of multilingualism in language industries, brought about by global markets and world-wide information exchange, parallel corpora, i.e. corpora of texts accompanied by their translation, have become key resources in the development of natural language processing tools. The applications based upon parallel corpora are numerous and growing in number: multilingual lexicography and terminology, machine and human translation, cross-language information retrieval, language learning, etc. The book's chapters have been commissioned from major figures in the field of parallel corpus building and exploitation, with the aim of showing the state of the art in parallel text alignment and use ten to fifteen years after the first parallel-text alignment techniques were developed. Within the book, the following broad themes are addressed: (i) techniques for the alignment of parallel texts at various levels such as sentence, clause, and word; (ii) the use of parallel texts in fields as diverse as translation, lexicography, and information retrieval; (iii) available corpus resources and the evaluation of alignment methods. The book will be of interest to researchers and advanced students of computational linguistics, terminology, lexicography and translation, both in academia and industry.

Machine Translation

Machine Translation PDF Author: Thierry Poibeau
Publisher: MIT Press
ISBN: 0262534215
Category : Computers
Languages : en
Pages : 298

Get Book Here

Book Description
A concise, nontechnical overview of the development of machine translation, including the different approaches, evaluation issues, and major players in the industry. The dream of a universal translation device goes back many decades, long before Douglas Adams's fictional Babel fish provided this service in The Hitchhiker's Guide to the Galaxy. Since the advent of computers, research has focused on the design of digital machine translation tools—computer programs capable of automatically translating a text from a source language to a target language. This has become one of the most fundamental tasks of artificial intelligence. This volume in the MIT Press Essential Knowledge series offers a concise, nontechnical overview of the development of machine translation, including the different approaches, evaluation issues, and market potential. The main approaches are presented from a largely historical perspective and in an intuitive manner, allowing the reader to understand the main principles without knowing the mathematical details. The book begins by discussing problems that must be solved during the development of a machine translation system and offering a brief overview of the evolution of the field. It then takes up the history of machine translation in more detail, describing its pre-digital beginnings, rule-based approaches, the 1966 ALPAC (Automatic Language Processing Advisory Committee) report and its consequences, the advent of parallel corpora, the example-based paradigm, the statistical paradigm, the segment-based approach, the introduction of more linguistic knowledge into the systems, and the latest approaches based on deep learning. Finally, it considers evaluation challenges and the commercial status of the field, including activities by such major players as Google and Systran.