Author: Lucia Specia
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148
Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.
Quality Estimation for Machine Translation
Author: Lucia Specia
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148
Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148
Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.
Translation Quality Assessment
Author: Joss Moorkens
Publisher: Springer
ISBN: 3319912410
Category : Computers
Languages : en
Pages : 292
Book Description
This is the first volume that brings together research and practice from academic and industry settings and a combination of human and machine translation evaluation. Its comprehensive collection of papers by leading experts in human and machine translation quality and evaluation who situate current developments and chart future trends fills a clear gap in the literature. This is critical to the successful integration of translation technologies in the industry today, where the lines between human and machine are becoming increasingly blurred by technology: this affects the whole translation landscape, from students and trainers to project managers and professionals, including in-house and freelance translators, as well as, of course, translation scholars and researchers. The editors have broad experience in translation quality evaluation research, including investigations into professional practice with qualitative and quantitative studies, and the contributors are leading experts in their respective fields, providing a unique set of complementary perspectives on human and machine translation quality and evaluation, combining theoretical and applied approaches.
Publisher: Springer
ISBN: 3319912410
Category : Computers
Languages : en
Pages : 292
Book Description
This is the first volume that brings together research and practice from academic and industry settings and a combination of human and machine translation evaluation. Its comprehensive collection of papers by leading experts in human and machine translation quality and evaluation who situate current developments and chart future trends fills a clear gap in the literature. This is critical to the successful integration of translation technologies in the industry today, where the lines between human and machine are becoming increasingly blurred by technology: this affects the whole translation landscape, from students and trainers to project managers and professionals, including in-house and freelance translators, as well as, of course, translation scholars and researchers. The editors have broad experience in translation quality evaluation research, including investigations into professional practice with qualitative and quantitative studies, and the contributors are leading experts in their respective fields, providing a unique set of complementary perspectives on human and machine translation quality and evaluation, combining theoretical and applied approaches.
Machine Translation
Author: Junhui Li
Publisher:
ISBN: 9789813361638
Category : Computational linguistics
Languages : en
Pages : 143
Book Description
This book constitutes the refereed proceedings of the 16th China Conference on Machine Translation, CCMT 2020, held in Hohhot, China*, in October 2020. The 13 papers presented in this volume were carefully reviewed and selected from 78 submissions and focus on all aspects of machine translation, including preprocessing, neural machine translation models, hybrid model, evaluation method, and post-editing. The conference was moved online due to COVID-19.
Publisher:
ISBN: 9789813361638
Category : Computational linguistics
Languages : en
Pages : 143
Book Description
This book constitutes the refereed proceedings of the 16th China Conference on Machine Translation, CCMT 2020, held in Hohhot, China*, in October 2020. The 13 papers presented in this volume were carefully reviewed and selected from 78 submissions and focus on all aspects of machine translation, including preprocessing, neural machine translation models, hybrid model, evaluation method, and post-editing. The conference was moved online due to COVID-19.
The Human Factor in Machine Translation
Author: Sin-wai Chan
Publisher: Routledge
ISBN: 1351376241
Category : Language Arts & Disciplines
Languages : en
Pages : 269
Book Description
Machine translation has become increasingly popular, especially with the introduction of neural machine translation in major online translation systems. However, despite the rapid advances in machine translation, the role of a human translator remains crucial. As illustrated by the chapters in this book, man-machine interaction is essential in machine translation, localisation, terminology management, and crowdsourcing translation. In fact, the importance of a human translator before, during, and after machine processing, cannot be overemphasised as human intervention is the best way to ensure the translation quality of machine translation. This volume explores the role of a human translator in machine translation from various perspectives, affording a comprehensive look at this topical research area. This book is essential reading for anyone involved in translation studies, machine translation or interested in translation technology.
Publisher: Routledge
ISBN: 1351376241
Category : Language Arts & Disciplines
Languages : en
Pages : 269
Book Description
Machine translation has become increasingly popular, especially with the introduction of neural machine translation in major online translation systems. However, despite the rapid advances in machine translation, the role of a human translator remains crucial. As illustrated by the chapters in this book, man-machine interaction is essential in machine translation, localisation, terminology management, and crowdsourcing translation. In fact, the importance of a human translator before, during, and after machine processing, cannot be overemphasised as human intervention is the best way to ensure the translation quality of machine translation. This volume explores the role of a human translator in machine translation from various perspectives, affording a comprehensive look at this topical research area. This book is essential reading for anyone involved in translation studies, machine translation or interested in translation technology.
Machine Translation
Author: Shujian Huang
Publisher: Springer Nature
ISBN: 9811517215
Category : Computers
Languages : en
Pages : 141
Book Description
This book constitutes the refereed proceedings of the 15th China Conference on Machine Translation, CCMT 2019, held in Nanchang, China, in September 2019. The 10 full papers presented in this volume were carefully reviewed and selected from 21 submissions and focus on all aspects of machine translation, including preprocessing, neural machine translation models, hybrid model, evaluation method, and post-editing.
Publisher: Springer Nature
ISBN: 9811517215
Category : Computers
Languages : en
Pages : 141
Book Description
This book constitutes the refereed proceedings of the 15th China Conference on Machine Translation, CCMT 2019, held in Nanchang, China, in September 2019. The 10 full papers presented in this volume were carefully reviewed and selected from 21 submissions and focus on all aspects of machine translation, including preprocessing, neural machine translation models, hybrid model, evaluation method, and post-editing.
Neural Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Dual Learning
Author: Tao Qin
Publisher: Springer Nature
ISBN: 9811588848
Category : Computers
Languages : en
Pages : 191
Book Description
Many AI (and machine learning) tasks present in dual forms, e.g., English-to-Chinese translation vs. Chinese-to-English translation, speech recognition vs. speech synthesis,question answering vs. question generation, and image classification vs. image generation. Dual learning is a new learning framework that leverages the primal-dual structure of AI tasks to obtain effective feedback or regularization signals in order to enhance the learning/inference process. Since it was first introduced four years ago, the concept has attracted considerable attention in multiple fields, and been proven effective in numerous applications, such as machine translation, image-to-image translation, speech synthesis and recognition, (visual) question answering and generation, image captioning and generation, and code summarization and generation. Offering a systematic and comprehensive overview of dual learning, this book enables interested researchers (both established and newcomers) and practitioners to gain a better understanding of the state of the art in the field. It also provides suggestions for further reading and tools to help readers advance the area. The book is divided into five parts. The first part gives a brief introduction to machine learning and deep learning. The second part introduces the algorithms based on the dual reconstruction principle using machine translation, image translation, speech processing and other NLP/CV tasks as the demo applications. It covers algorithms, such as dual semi-supervised learning, dual unsupervised learning and multi-agent dual learning. In the context of image translation, it introduces algorithms including CycleGAN, DualGAN, DiscoGAN cdGAN and more recent techniques/applications. The third part presents various work based on the probability principle, including dual supervised learning and dual inference based on the joint-probability principle and dual semi-supervised learning based on the marginal-probability principle. The fourth part reviews various theoretical studies on dual learning and discusses its connections to other learning paradigms. The fifth part provides a summary and suggests future research directions.
Publisher: Springer Nature
ISBN: 9811588848
Category : Computers
Languages : en
Pages : 191
Book Description
Many AI (and machine learning) tasks present in dual forms, e.g., English-to-Chinese translation vs. Chinese-to-English translation, speech recognition vs. speech synthesis,question answering vs. question generation, and image classification vs. image generation. Dual learning is a new learning framework that leverages the primal-dual structure of AI tasks to obtain effective feedback or regularization signals in order to enhance the learning/inference process. Since it was first introduced four years ago, the concept has attracted considerable attention in multiple fields, and been proven effective in numerous applications, such as machine translation, image-to-image translation, speech synthesis and recognition, (visual) question answering and generation, image captioning and generation, and code summarization and generation. Offering a systematic and comprehensive overview of dual learning, this book enables interested researchers (both established and newcomers) and practitioners to gain a better understanding of the state of the art in the field. It also provides suggestions for further reading and tools to help readers advance the area. The book is divided into five parts. The first part gives a brief introduction to machine learning and deep learning. The second part introduces the algorithms based on the dual reconstruction principle using machine translation, image translation, speech processing and other NLP/CV tasks as the demo applications. It covers algorithms, such as dual semi-supervised learning, dual unsupervised learning and multi-agent dual learning. In the context of image translation, it introduces algorithms including CycleGAN, DualGAN, DiscoGAN cdGAN and more recent techniques/applications. The third part presents various work based on the probability principle, including dual supervised learning and dual inference based on the joint-probability principle and dual semi-supervised learning based on the marginal-probability principle. The fourth part reviews various theoretical studies on dual learning and discusses its connections to other learning paradigms. The fifth part provides a summary and suggests future research directions.
Statistical Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Trends in E-Tools and Resources for Translators and Interpreters
Author:
Publisher: BRILL
ISBN: 9004351795
Category : Language Arts & Disciplines
Languages : en
Pages : 265
Book Description
Trends in E-Tools and Resources for Translators and Interpreters offers a collection of contributions from key players in the field of translation and interpreting that accurately outline some of the most cutting-edge technologies in this field that are available or under development at the moment in both professional and academic contexts. Particularly, this volume provides a wide picture of the state of the art, looking not only at the world of technology for translators but also at the hitherto overlooked world of technology for interpreters. This volume is accessible and comprehensive enough to be of benefit to different categories of readers: scholars, professionals and trainees. Contributors are: Pierrette Bouillon, Gloria Corpas Pastor, Hernani Costa, Isabel Durán-Muñoz, Claudio Fantinuoli, Johanna Gerlach, Joanna Gough, Asheesh Gulati, Veronique Hoste, Amélie Josselin, David Lewis, Lieve Macken, John Moran, Aurelie Picton, Emmanuel Planas, Éric Poirier, Victoria Porro, Celia Rico Pérez, Christian Saam, Pilar Sánchez-Gijón, Míriam Seghiri Domínguez, Violeta Seretan, Arda Tezcan, Olga Torres, and Anna Zaretskaya.
Publisher: BRILL
ISBN: 9004351795
Category : Language Arts & Disciplines
Languages : en
Pages : 265
Book Description
Trends in E-Tools and Resources for Translators and Interpreters offers a collection of contributions from key players in the field of translation and interpreting that accurately outline some of the most cutting-edge technologies in this field that are available or under development at the moment in both professional and academic contexts. Particularly, this volume provides a wide picture of the state of the art, looking not only at the world of technology for translators but also at the hitherto overlooked world of technology for interpreters. This volume is accessible and comprehensive enough to be of benefit to different categories of readers: scholars, professionals and trainees. Contributors are: Pierrette Bouillon, Gloria Corpas Pastor, Hernani Costa, Isabel Durán-Muñoz, Claudio Fantinuoli, Johanna Gerlach, Joanna Gough, Asheesh Gulati, Veronique Hoste, Amélie Josselin, David Lewis, Lieve Macken, John Moran, Aurelie Picton, Emmanuel Planas, Éric Poirier, Victoria Porro, Celia Rico Pérez, Christian Saam, Pilar Sánchez-Gijón, Míriam Seghiri Domínguez, Violeta Seretan, Arda Tezcan, Olga Torres, and Anna Zaretskaya.
Natural Language Processing and Chinese Computing
Author: Lu Wang
Publisher: Springer Nature
ISBN: 3030884805
Category : Computers
Languages : en
Pages : 861
Book Description
This two-volume set of LNAI 13028 and LNAI 13029 constitutes the refereed proceedings of the 10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021, held in Qingdao, China, in October 2021. The 66 full papers, 23 poster papers, and 27 workshop papers presented were carefully reviewed and selected from 446 submissions. They are organized in the following areas: Fundamentals of NLP; Machine Translation and Multilinguality; Machine Learning for NLP; Information Extraction and Knowledge Graph; Summarization and Generation; Question Answering; Dialogue Systems; Social Media and Sentiment Analysis; NLP Applications and Text Mining; and Multimodality and Explainability.
Publisher: Springer Nature
ISBN: 3030884805
Category : Computers
Languages : en
Pages : 861
Book Description
This two-volume set of LNAI 13028 and LNAI 13029 constitutes the refereed proceedings of the 10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021, held in Qingdao, China, in October 2021. The 66 full papers, 23 poster papers, and 27 workshop papers presented were carefully reviewed and selected from 446 submissions. They are organized in the following areas: Fundamentals of NLP; Machine Translation and Multilinguality; Machine Learning for NLP; Information Extraction and Knowledge Graph; Summarization and Generation; Question Answering; Dialogue Systems; Social Media and Sentiment Analysis; NLP Applications and Text Mining; and Multimodality and Explainability.