Author: Deyi Xiong
Publisher: Springer
ISBN: 9812873562
Category : Language Arts & Disciplines
Languages : en
Pages : 159
Book Description
This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.
Linguistically Motivated Statistical Machine Translation
Author: Deyi Xiong
Publisher: Springer
ISBN: 9812873562
Category : Language Arts & Disciplines
Languages : en
Pages : 159
Book Description
This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.
Publisher: Springer
ISBN: 9812873562
Category : Language Arts & Disciplines
Languages : en
Pages : 159
Book Description
This book provides a wide variety of algorithms and models to integrate linguistic knowledge into Statistical Machine Translation (SMT). It helps advance conventional SMT to linguistically motivated SMT by enhancing the following three essential components: translation, reordering and bracketing models. It also serves the purpose of promoting the in-depth study of the impacts of linguistic knowledge on machine translation. Finally it provides a systematic introduction of Bracketing Transduction Grammar (BTG) based SMT, one of the state-of-the-art SMT formalisms, as well as a case study of linguistically motivated SMT on a BTG-based platform.
Neural Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
Publisher: Cambridge University Press
ISBN: 1108497322
Category : Computers
Languages : en
Pages : 409
Book Description
Learn how to build machine translation systems with deep learning from the ground up, from basic concepts to cutting-edge research.
A Machine Translation Approach to Cross Language Text Retrieval
Author: María Gabriela Fernandez-Diaz
Publisher: Universal-Publishers
ISBN: 1581122675
Category : Language Arts & Disciplines
Languages : en
Pages : 137
Book Description
Cross Language Text Retrieval (CLTR) has been defined as the retrieval of documents in a language different from that of the original query. To make this possible some kind of mechanism has to be applied in order to translate the information contained in the source sentence. Many different approaches have been carried out with the purpose of transferring the information from the source language query to the target language one. Though all these methods deal with a way of translating as much information as possible from the source query, little research has been conducted in relation to the field of Machine Translation (MT). The purpose of this research work is to determine the feasibility of using MT techniques for CLTR. Specifically, I will describe how a MT system has been adapted without much effort to translate Spanish queries of a specific domain, i.e. Finance and Economics, into English in order to retrieve documents related to that field. The results of this process will then be compared with the results obtained from the retrieval of the original English queries. Thus, I will discuss the advantages and disadvantages of using MT for CLTR.
Publisher: Universal-Publishers
ISBN: 1581122675
Category : Language Arts & Disciplines
Languages : en
Pages : 137
Book Description
Cross Language Text Retrieval (CLTR) has been defined as the retrieval of documents in a language different from that of the original query. To make this possible some kind of mechanism has to be applied in order to translate the information contained in the source sentence. Many different approaches have been carried out with the purpose of transferring the information from the source language query to the target language one. Though all these methods deal with a way of translating as much information as possible from the source query, little research has been conducted in relation to the field of Machine Translation (MT). The purpose of this research work is to determine the feasibility of using MT techniques for CLTR. Specifically, I will describe how a MT system has been adapted without much effort to translate Spanish queries of a specific domain, i.e. Finance and Economics, into English in order to retrieve documents related to that field. The results of this process will then be compared with the results obtained from the retrieval of the original English queries. Thus, I will discuss the advantages and disadvantages of using MT for CLTR.
Syntax-based Statistical Machine Translation
Author: Philip Williams
Publisher: Morgan & Claypool Publishers
ISBN: 1627055029
Category : Computers
Languages : en
Pages : 211
Book Description
This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
Publisher: Morgan & Claypool Publishers
ISBN: 1627055029
Category : Computers
Languages : en
Pages : 211
Book Description
This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
Human Language Technologies – The Baltic Perspective
Author: I. Skadiņa
Publisher: IOS Press
ISBN: 1614997012
Category : Computers
Languages : en
Pages : 188
Book Description
Throughout the last decade, the Baltic states have played an active role in regional and international language technology activities, supporting less-resourced languages in the digital age. This book presents the proceedings of the 7th International Conference: Human Language Technologies – The Baltic Perspective (Baltic HLT 2016), held in Riga, Latvia, in October 2016. Baltic HLT 2016 provided a forum for sharing ideas and recent advances in human language processing with a special focus on less-resourced languages. Papers selected for the conference cover a wide range of topics, including a general overview of language technology progress in the Baltic states, actual research topics in written and spoken language processing, the creation of language resources and their applications, and proposals for a European language platform. The book is divided into five sections: overview; speech technologies and corpora; machine translation; written language resources; and methods and tools for language processing. The book will be a useful resource, not only for Baltic language researchers, but also for those working with other less-resourced languages in Europe and beyond.
Publisher: IOS Press
ISBN: 1614997012
Category : Computers
Languages : en
Pages : 188
Book Description
Throughout the last decade, the Baltic states have played an active role in regional and international language technology activities, supporting less-resourced languages in the digital age. This book presents the proceedings of the 7th International Conference: Human Language Technologies – The Baltic Perspective (Baltic HLT 2016), held in Riga, Latvia, in October 2016. Baltic HLT 2016 provided a forum for sharing ideas and recent advances in human language processing with a special focus on less-resourced languages. Papers selected for the conference cover a wide range of topics, including a general overview of language technology progress in the Baltic states, actual research topics in written and spoken language processing, the creation of language resources and their applications, and proposals for a European language platform. The book is divided into five sections: overview; speech technologies and corpora; machine translation; written language resources; and methods and tools for language processing. The book will be a useful resource, not only for Baltic language researchers, but also for those working with other less-resourced languages in Europe and beyond.
Statistical Machine Translation
Author: Philipp Koehn
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Publisher: Cambridge University Press
ISBN: 0521874157
Category : Computers
Languages : en
Pages : 447
Book Description
The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation.
Machine Translation: From Real Users to Research
Author: Robert E. Frederking
Publisher: Springer
ISBN: 3540301941
Category : Language Arts & Disciplines
Languages : en
Pages : 291
Book Description
The previous conference in this series (AMTA 2002) took up the theme “From Research to Real Users”, and sought to explore why recent research on data-driven machine translation didn’t seem to be moving to the marketplace. As it turned out, the ?rst commercial products of the data-driven research movement were just over the horizon, andintheinterveningtwoyearstheyhavebeguntoappearinthemarketplace. Atthesame time,rule-basedmachinetranslationsystemsareintroducingdata-driventechniquesinto the mix in their products. Machine translation as a software application has a 50-year history. There are an increasing number of exciting deployments of MT, many of which will be exhibited and discussed at the conference. But the scale of commercial use has never approached the estimates of the latent demand. In light of this, we reversed the question from AMTA 2002, to look at the next step in the path to commercial success for MT. We took user needs as our theme, and explored how or whether market requirements are feeding into research programs. The transition of research discoveries to practical use involves te- nicalquestionsthatarenotassexyasthosethathavedriventheresearchcommunityand research funding. Important product issues such as system customizability, computing resource requirements, and usability and ?tness for particular tasks need to engage the creativeenergiesofallpartsofourcommunity,especiallyresearch,aswemovemachine translation from a niche application to a more pervasive language conversion process. Thesetopicswereaddressedattheconferencethroughthepaperscontainedinthesep- ceedings, and even more speci?cally through several invited presentations and panels.
Publisher: Springer
ISBN: 3540301941
Category : Language Arts & Disciplines
Languages : en
Pages : 291
Book Description
The previous conference in this series (AMTA 2002) took up the theme “From Research to Real Users”, and sought to explore why recent research on data-driven machine translation didn’t seem to be moving to the marketplace. As it turned out, the ?rst commercial products of the data-driven research movement were just over the horizon, andintheinterveningtwoyearstheyhavebeguntoappearinthemarketplace. Atthesame time,rule-basedmachinetranslationsystemsareintroducingdata-driventechniquesinto the mix in their products. Machine translation as a software application has a 50-year history. There are an increasing number of exciting deployments of MT, many of which will be exhibited and discussed at the conference. But the scale of commercial use has never approached the estimates of the latent demand. In light of this, we reversed the question from AMTA 2002, to look at the next step in the path to commercial success for MT. We took user needs as our theme, and explored how or whether market requirements are feeding into research programs. The transition of research discoveries to practical use involves te- nicalquestionsthatarenotassexyasthosethathavedriventheresearchcommunityand research funding. Important product issues such as system customizability, computing resource requirements, and usability and ?tness for particular tasks need to engage the creativeenergiesofallpartsofourcommunity,especiallyresearch,aswemovemachine translation from a niche application to a more pervasive language conversion process. Thesetopicswereaddressedattheconferencethroughthepaperscontainedinthesep- ceedings, and even more speci?cally through several invited presentations and panels.
Computational Linguistics and Intelligent Text Processing
Author: Alexander Gelbukh
Publisher: Springer
ISBN: 3642003826
Category : Computers
Languages : en
Pages : 619
Book Description
th CICLing 2009 markedthe 10 anniversary of the Annual Conference on Intel- gent Text Processing and Computational Linguistics. The CICLing conferences provide a wide-scope forum for the discussion of the art and craft of natural language processing research as well as the best practices in its applications. This volume contains ?ve invited papers and the regular papers accepted for oral presentation at the conference. The papers accepted for poster presentation were published in a special issue of another journal (see the website for more information). Since 2001, the proceedings of CICLing conferences have been published in Springer’s Lecture Notes in Computer Science series, as volumes 2004, 2276, 2588, 2945, 3406, 3878, 4394, and 4919. This volume has been structured into 12 sections: – Trends and Opportunities – Linguistic Knowledge Representation Formalisms – Corpus Analysis and Lexical Resources – Extraction of Lexical Knowledge – Morphology and Parsing – Semantics – Word Sense Disambiguation – Machine Translation and Multilinguism – Information Extraction and Text Mining – Information Retrieval and Text Comparison – Text Summarization – Applications to the Humanities A total of 167 papers by 392 authors from 40 countries were submitted for evaluation by the International Program Committee, see Tables 1 and 2. This volume contains revised versions of 44 papers, by 120 authors, selected for oral presentation; the acceptance rate was 26. 3%.
Publisher: Springer
ISBN: 3642003826
Category : Computers
Languages : en
Pages : 619
Book Description
th CICLing 2009 markedthe 10 anniversary of the Annual Conference on Intel- gent Text Processing and Computational Linguistics. The CICLing conferences provide a wide-scope forum for the discussion of the art and craft of natural language processing research as well as the best practices in its applications. This volume contains ?ve invited papers and the regular papers accepted for oral presentation at the conference. The papers accepted for poster presentation were published in a special issue of another journal (see the website for more information). Since 2001, the proceedings of CICLing conferences have been published in Springer’s Lecture Notes in Computer Science series, as volumes 2004, 2276, 2588, 2945, 3406, 3878, 4394, and 4919. This volume has been structured into 12 sections: – Trends and Opportunities – Linguistic Knowledge Representation Formalisms – Corpus Analysis and Lexical Resources – Extraction of Lexical Knowledge – Morphology and Parsing – Semantics – Word Sense Disambiguation – Machine Translation and Multilinguism – Information Extraction and Text Mining – Information Retrieval and Text Comparison – Text Summarization – Applications to the Humanities A total of 167 papers by 392 authors from 40 countries were submitted for evaluation by the International Program Committee, see Tables 1 and 2. This volume contains revised versions of 44 papers, by 120 authors, selected for oral presentation; the acceptance rate was 26. 3%.
Human Language Technologies - The Baltic Perspective
Author: A. Utka
Publisher: IOS Press
ISBN: 1614994420
Category : Computers
Languages : en
Pages : 276
Book Description
In the modern information society, there is an ever-growing need for improved natural language processing and human language technologies.This book presents the proceedings of the Sixth International Conference 'Human Language Technologies – The Baltic Perspective' (Baltic HLT 2014) held in Kaunas, Lithuania in September 2014. The Baltic HLT conferences provide an important forum for gathering and consolidating ideas, and are an opportunity for the Baltic countries to present important research results to an international audience. The book contains 39 long and short papers presented at the conference. These cover a wide range of topics: syntactic analysis, sentiment analysis, co-reference resolution, authorship attribution, information extraction, document clustering, machine translation, corpus and parallel corpus compiling, speech recognition, synthesis and others. The book is divided into three main sections: speech technology, methods in computational linguistics, and preparation of language resources. This book will be of interest to anyone whose work involves the use and application of computational linguistics and related disciplines.
Publisher: IOS Press
ISBN: 1614994420
Category : Computers
Languages : en
Pages : 276
Book Description
In the modern information society, there is an ever-growing need for improved natural language processing and human language technologies.This book presents the proceedings of the Sixth International Conference 'Human Language Technologies – The Baltic Perspective' (Baltic HLT 2014) held in Kaunas, Lithuania in September 2014. The Baltic HLT conferences provide an important forum for gathering and consolidating ideas, and are an opportunity for the Baltic countries to present important research results to an international audience. The book contains 39 long and short papers presented at the conference. These cover a wide range of topics: syntactic analysis, sentiment analysis, co-reference resolution, authorship attribution, information extraction, document clustering, machine translation, corpus and parallel corpus compiling, speech recognition, synthesis and others. The book is divided into three main sections: speech technology, methods in computational linguistics, and preparation of language resources. This book will be of interest to anyone whose work involves the use and application of computational linguistics and related disciplines.
Mobile Speech and Advanced Natural Language Solutions
Author: Amy Neustein
Publisher: Springer Science & Business Media
ISBN: 1461460182
Category : Technology & Engineering
Languages : en
Pages : 373
Book Description
"Mobile Speech and Advanced Natural Language Solutions" presents the discussion of the most recent advances in intelligent human-computer interaction, including fascinating new study findings on talk-in-interaction, which is the province of conversation analysis, a subfield in sociology/sociolinguistics, a new and emerging area in natural language understanding. Editors Amy Neustein and Judith A. Markowitz have recruited a talented group of contributors to introduce the next generation natural language technologies for practical speech processing applications that serve the consumer’s need for well-functioning natural language-driven personal assistants and other mobile devices, while also addressing business’ need for better functioning IVR-driven call centers that yield a more satisfying experience for the caller. This anthology is aimed at two distinct audiences: one consisting of speech engineers and system developers; the other comprised of linguists and cognitive scientists. The text builds on the experience and knowledge of each of these audiences by exposing them to the work of the other.
Publisher: Springer Science & Business Media
ISBN: 1461460182
Category : Technology & Engineering
Languages : en
Pages : 373
Book Description
"Mobile Speech and Advanced Natural Language Solutions" presents the discussion of the most recent advances in intelligent human-computer interaction, including fascinating new study findings on talk-in-interaction, which is the province of conversation analysis, a subfield in sociology/sociolinguistics, a new and emerging area in natural language understanding. Editors Amy Neustein and Judith A. Markowitz have recruited a talented group of contributors to introduce the next generation natural language technologies for practical speech processing applications that serve the consumer’s need for well-functioning natural language-driven personal assistants and other mobile devices, while also addressing business’ need for better functioning IVR-driven call centers that yield a more satisfying experience for the caller. This anthology is aimed at two distinct audiences: one consisting of speech engineers and system developers; the other comprised of linguists and cognitive scientists. The text builds on the experience and knowledge of each of these audiences by exposing them to the work of the other.