Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Statistical Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Syntax-based Statistical Machine Translation
Author: Philip Williams
Publisher: Springer Nature
ISBN: 3031021649
Category : Computers
Languages : en
Pages : 190
Book Description
This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
Publisher: Springer Nature
ISBN: 3031021649
Category : Computers
Languages : en
Pages : 190
Book Description
This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
Neural Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Verbmobil: Foundations of Speech-to-Speech Translation
Author: Wolfgang Wahlster
Publisher: Springer Science & Business Media
ISBN: 3662042304
Category : Computers
Languages : en
Pages : 676
Book Description
In 1992 it seemed very difficult to answer the question whether it would be possible to develop a portable system for the automatic recognition and translation of spon taneous speech. Previous research work on speech processing had focused on read speech only and international projects aimed at automated text translation had just been terminated without achieving their objectives. Within this context, the German Federal Ministry of Education and Research (BMBF) made a careful analysis of all national and international research projects conducted in the field of speech and language technology before deciding to launch an eight-year basic-research lead project in which research groups were to cooperate in an interdisciplinary and international effort covering the disciplines of computer science, computational linguistics, translation science, signal processing, communi cation science and artificial intelligence. At some point, the project comprised up to 135 work packages with up to 33 research groups working on these packages. The project was controlled by means of a network plan. Every two years the project sit uation was assessed and the project goals were updated. An international scientific advisory board provided advice for BMBF. A new scientific approach was chosen for this project: coping with the com plexity of spontaneous speech with all its pertinent phenomena such as ambiguities, self-corrections, hesitations and disfluencies took precedence over the intended lex icon size. Another important aspect was that prosodic information was exploited at all processing stages.
Publisher: Springer Science & Business Media
ISBN: 3662042304
Category : Computers
Languages : en
Pages : 676
Book Description
In 1992 it seemed very difficult to answer the question whether it would be possible to develop a portable system for the automatic recognition and translation of spon taneous speech. Previous research work on speech processing had focused on read speech only and international projects aimed at automated text translation had just been terminated without achieving their objectives. Within this context, the German Federal Ministry of Education and Research (BMBF) made a careful analysis of all national and international research projects conducted in the field of speech and language technology before deciding to launch an eight-year basic-research lead project in which research groups were to cooperate in an interdisciplinary and international effort covering the disciplines of computer science, computational linguistics, translation science, signal processing, communi cation science and artificial intelligence. At some point, the project comprised up to 135 work packages with up to 33 research groups working on these packages. The project was controlled by means of a network plan. Every two years the project sit uation was assessed and the project goals were updated. An international scientific advisory board provided advice for BMBF. A new scientific approach was chosen for this project: coping with the com plexity of spontaneous speech with all its pertinent phenomena such as ambiguities, self-corrections, hesitations and disfluencies took precedence over the intended lex icon size. Another important aspect was that prosodic information was exploited at all processing stages.
Learning Machine Translation
Author: Cyril Goutte
Publisher: MIT Press
ISBN: 0262072971
Category : Computers
Languages : en
Pages : 329
Book Description
How Machine Learning can improve machine translation: enabling technologies and new statistical techniques.
Publisher: MIT Press
ISBN: 0262072971
Category : Computers
Languages : en
Pages : 329
Book Description
How Machine Learning can improve machine translation: enabling technologies and new statistical techniques.
Readings in Machine Translation
Author: Sergei Nirenburg
Publisher: MIT Press
ISBN: 9780262140744
Category : Computers
Languages : en
Pages : 444
Book Description
The field of machine translation (MT) - the automation of translation between human languages - has existed for more than 50 years. MT helped to usher in the field of computational linguistics and has influenced methods and applications in knowledge representation, information theory, and mathematical statistics.
Publisher: MIT Press
ISBN: 9780262140744
Category : Computers
Languages : en
Pages : 444
Book Description
The field of machine translation (MT) - the automation of translation between human languages - has existed for more than 50 years. MT helped to usher in the field of computational linguistics and has influenced methods and applications in knowledge representation, information theory, and mathematical statistics.
KI 2002: Advances in Artificial Intelligence
Author: Matthias Jarke
Publisher: Springer
ISBN: 3540457518
Category : Computers
Languages : en
Pages : 319
Book Description
This book constitutes the refereed proceedings of the 25th Annual German conference on Artificial Intelligence, KI 2002, held in Aachen, Germany in September 2002. The 20 revised full papers presented were carefully reviewed and selected from 58 submissions. The book offers topical sections on natural language processing; machine learning; knowledge representation, semantic web, and AI; neural networks; logic programming, theorem proving, and model checking; and vision and spatial reasoning.
Publisher: Springer
ISBN: 3540457518
Category : Computers
Languages : en
Pages : 319
Book Description
This book constitutes the refereed proceedings of the 25th Annual German conference on Artificial Intelligence, KI 2002, held in Aachen, Germany in September 2002. The 20 revised full papers presented were carefully reviewed and selected from 58 submissions. The book offers topical sections on natural language processing; machine learning; knowledge representation, semantic web, and AI; neural networks; logic programming, theorem proving, and model checking; and vision and spatial reasoning.
Human Language Technology. Challenges of the Information Society
Author: Zygmunt Vetulani
Publisher: Springer
ISBN: 364204235X
Category : Computers
Languages : en
Pages : 486
Book Description
Half a centuryago not manypeople had realizedthat a new epoch in the history of homo sapiens had just started. The term “Information Society Age” seems an appropriate name for this epoch. Communication was without a doubt a lever of the conquest of the human race over the rest of the animate world. There is little doubt that the human racebegan when our predecessorsstarted to communicate with each other using language.This highly abstractmeans of communicationwas probably one of the major factors contributing to the evolutionary success of the human race within the animal world. Physically weak and imperfect, humans started to dominate the rest of the world through the creation of communication-based societies where individuals communicated initially to satisfy immediate needs, and then to create, accumulate and process knowledge for future use. The crucial step in the history of humanity was the invention of writing. It is worth noting that writing is a human invention, not a phenomenon resulting from natural evolution. Humans invented writing as a technique for recording speech as well as for storing and facilitating the dissemination of knowledge across the world. Humans continue to be born illiterate, and therefore teaching and conscious supervised learning is necessary to maintain this basic social skill.
Publisher: Springer
ISBN: 364204235X
Category : Computers
Languages : en
Pages : 486
Book Description
Half a centuryago not manypeople had realizedthat a new epoch in the history of homo sapiens had just started. The term “Information Society Age” seems an appropriate name for this epoch. Communication was without a doubt a lever of the conquest of the human race over the rest of the animate world. There is little doubt that the human racebegan when our predecessorsstarted to communicate with each other using language.This highly abstractmeans of communicationwas probably one of the major factors contributing to the evolutionary success of the human race within the animal world. Physically weak and imperfect, humans started to dominate the rest of the world through the creation of communication-based societies where individuals communicated initially to satisfy immediate needs, and then to create, accumulate and process knowledge for future use. The crucial step in the history of humanity was the invention of writing. It is worth noting that writing is a human invention, not a phenomenon resulting from natural evolution. Humans invented writing as a technique for recording speech as well as for storing and facilitating the dissemination of knowledge across the world. Humans continue to be born illiterate, and therefore teaching and conscious supervised learning is necessary to maintain this basic social skill.
Quality Estimation for Machine Translation
Author: Lucia Specia
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148
Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148
Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.
Advances in Empirical Translation Studies
Author: Meng Ji
Publisher: Cambridge University Press
ISBN: 1108423272
Category : Computers
Languages : en
Pages : 285
Book Description
Introduces the integration of theoretical and applied translation studies for socially-oriented and data-driven empirical translation research.
Publisher: Cambridge University Press
ISBN: 1108423272
Category : Computers
Languages : en
Pages : 285
Book Description
Introduces the integration of theoretical and applied translation studies for socially-oriented and data-driven empirical translation research.