Author: Nancy Ide
Publisher: Springer
ISBN: 9402408819
Category : Language Arts & Disciplines
Languages : en
Pages : 1440
Book Description
This handbook offers a thorough treatment of the science of linguistic annotation. Leaders in the field guide the reader through the process of modeling, creating an annotation language, building a corpus and evaluating it for correctness. Essential reading for both computer scientists and linguistic researchers.Linguistic annotation is an increasingly important activity in the field of computational linguistics because of its critical role in the development of language models for natural language processing applications. Part one of this book covers all phases of the linguistic annotation process, from annotation scheme design and choice of representation format through both the manual and automatic annotation process, evaluation, and iterative improvement of annotation accuracy. The second part of the book includes case studies of annotation projects across the spectrum of linguistic annotation types, including morpho-syntactic tagging, syntactic analyses, a range of semantic analyses (semantic roles, named entities, sentiment and opinion), time and event and spatial analyses, and discourse level analyses including discourse structure, co-reference, etc. Each case study addresses the various phases and processes discussed in the chapters of part one.
Handbook of Linguistic Annotation
Language Corpora Annotation and Processing
Author: Niladri Sekhar Dash
Publisher: Springer Nature
ISBN: 9811629609
Category : Language Arts & Disciplines
Languages : en
Pages : 292
Book Description
This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.
Publisher: Springer Nature
ISBN: 9811629609
Category : Language Arts & Disciplines
Languages : en
Pages : 292
Book Description
This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.
A Practical Handbook of Corpus Linguistics
Author: Magali Paquot
Publisher: Springer Nature
ISBN: 3030462161
Category : Philosophy
Languages : en
Pages : 671
Book Description
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
Publisher: Springer Nature
ISBN: 3030462161
Category : Philosophy
Languages : en
Pages : 671
Book Description
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
The Handbook of Computational Linguistics and Natural Language Processing
Author: Alexander Clark
Publisher: John Wiley & Sons
ISBN: 1118448677
Category : Language Arts & Disciplines
Languages : en
Pages : 802
Book Description
This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP). Features contributions by the top researchers in the field, reflecting the work that is driving the discipline forward Includes an introduction to the major theoretical issues in these fields, as well as the central engineering applications that the work has produced Presents the major developments in an accessible way, explaining the close connection between scientific understanding of the computational properties of natural language and the creation of effective language technologies Serves as an invaluable state-of-the-art reference source for computational linguists and software engineers developing NLP applications in industrial research and development labs of software companies
Publisher: John Wiley & Sons
ISBN: 1118448677
Category : Language Arts & Disciplines
Languages : en
Pages : 802
Book Description
This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP). Features contributions by the top researchers in the field, reflecting the work that is driving the discipline forward Includes an introduction to the major theoretical issues in these fields, as well as the central engineering applications that the work has produced Presents the major developments in an accessible way, explaining the close connection between scientific understanding of the computational properties of natural language and the creation of effective language technologies Serves as an invaluable state-of-the-art reference source for computational linguists and software engineers developing NLP applications in industrial research and development labs of software companies
The Cambridge Handbook of Learner Corpus Research
Author: Sylviane Granger
Publisher: Cambridge University Press
ISBN: 1316432149
Category : Language Arts & Disciplines
Languages : en
Pages : 1199
Book Description
The origins of learner corpus research go back to the late 1980s when large electronic collections of written or spoken data started to be collected from foreign/second language learners, with a view to advancing our understanding of the mechanisms of second language acquisition and developing tailor-made pedagogical tools. Engaging with the interdisciplinary nature of this fast-growing field, The Cambridge Handbook of Learner Corpus Research explores the diverse and extensive applications of learner corpora, with 27 chapters written by internationally renowned experts. This comprehensive work is a vital resource for students, teachers and researchers, offering fresh perspectives and a unique overview of the field. With representative studies in each chapter which provide an essential guide on how to conduct learner corpus research in a wide range of areas, this work is a cutting-edge account of learner corpus collection, annotation, methodology, theory, analysis and applications.
Publisher: Cambridge University Press
ISBN: 1316432149
Category : Language Arts & Disciplines
Languages : en
Pages : 1199
Book Description
The origins of learner corpus research go back to the late 1980s when large electronic collections of written or spoken data started to be collected from foreign/second language learners, with a view to advancing our understanding of the mechanisms of second language acquisition and developing tailor-made pedagogical tools. Engaging with the interdisciplinary nature of this fast-growing field, The Cambridge Handbook of Learner Corpus Research explores the diverse and extensive applications of learner corpora, with 27 chapters written by internationally renowned experts. This comprehensive work is a vital resource for students, teachers and researchers, offering fresh perspectives and a unique overview of the field. With representative studies in each chapter which provide an essential guide on how to conduct learner corpus research in a wide range of areas, this work is a cutting-edge account of learner corpus collection, annotation, methodology, theory, analysis and applications.
Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100
Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100
Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
Corpus Pragmatics
Author: Karin Aijmer
Publisher: Cambridge University Press
ISBN: 1107015049
Category : Language Arts & Disciplines
Languages : en
Pages : 481
Book Description
The first handbook to survey and expand the burgeoning field of corpus pragmatics, the intersection of pragmatics and corpus linguistics.
Publisher: Cambridge University Press
ISBN: 1107015049
Category : Language Arts & Disciplines
Languages : en
Pages : 481
Book Description
The first handbook to survey and expand the burgeoning field of corpus pragmatics, the intersection of pragmatics and corpus linguistics.
Natural Language Annotation for Machine Learning
Author: James Pustejovsky
Publisher: "O'Reilly Media, Inc."
ISBN: 1449306667
Category : Computers
Languages : en
Pages : 344
Book Description
Includes bibliographical references (p. 305-315) and index.
Publisher: "O'Reilly Media, Inc."
ISBN: 1449306667
Category : Computers
Languages : en
Pages : 344
Book Description
Includes bibliographical references (p. 305-315) and index.
Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441119809
Category : Language Arts & Disciplines
Languages : en
Pages : 321
Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.
Publisher: Bloomsbury Publishing
ISBN: 1441119809
Category : Language Arts & Disciplines
Languages : en
Pages : 321
Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.
Linguistic Fieldwork
Author: C. Bowern
Publisher: Springer
ISBN: 1137340800
Category : Language Arts & Disciplines
Languages : en
Pages : 297
Book Description
Linguistic Fieldwork offers practical guidance on areas such as applying for funding, the first session on a new language, writing up the data and returning materials to communities. This expanded second edition provides new content on the results of research, on prosody elicitation, on field experiment design, and on working in complex syntax.
Publisher: Springer
ISBN: 1137340800
Category : Language Arts & Disciplines
Languages : en
Pages : 297
Book Description
Linguistic Fieldwork offers practical guidance on areas such as applying for funding, the first session on a new language, writing up the data and returning materials to communities. This expanded second edition provides new content on the results of research, on prosody elicitation, on field experiment design, and on working in complex syntax.