Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora PDF Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441119914
Category : Language Arts & Disciplines
Languages : en
Pages : 321

Get Book

Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora PDF Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441119914
Category : Language Arts & Disciplines
Languages : en
Pages : 321

Get Book

Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora PDF Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441119809
Category : Language Arts & Disciplines
Languages : en
Pages : 321

Get Book

Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora PDF Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441116753
Category : Language Arts & Disciplines
Languages : en
Pages : 321

Get Book

Book Description
Introduces corpus linguistics with a focus on linguistically annotated corpora, enabling analysis of a wide range of linguistic phenomena.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora PDF Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
ISBN: 1441164472
Category : Language Arts & Disciplines
Languages : en
Pages : 321

Get Book

Book Description
Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Spoken Corpora and Linguistic Studies

Spoken Corpora and Linguistic Studies PDF Author: Tommaso Raso
Publisher: John Benjamins Publishing Company
ISBN: 9027270031
Category : Language Arts & Disciplines
Languages : en
Pages : 508

Get Book

Book Description
The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media

Corpus Annotation

Corpus Annotation PDF Author: R. G. Garside
Publisher: Routledge
ISBN: 9781138148581
Category : Computational linguistics
Languages : en
Pages : 0

Get Book

Book Description
Corpus Annotation gives an up-to-date picture of this fascinating new area of research, and will provide essential reading for newcomers to the field as well as those already involved in corpus annotation. Early chapters introduce the different levels and techniques of corpus annotation. Later chapters deal with software developments, applications, and the development of standards for the evaluation of corpus annotation. While the book takes detailed account of research world-wide, its focus is particularly on the work of the UCREL (University Centre for Computer Corpus Research on Language) team at Lancaster University, which has been at the forefront of developments in the field of corpus annotation since its beginnings in the 1970s.

Corpus Linguistics and Linguistic Theory

Corpus Linguistics and Linguistic Theory PDF Author: Christian Mair
Publisher: Rodopi
ISBN: 9789042014930
Category : Computers
Languages : en
Pages : 408

Get Book

Book Description
From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age.Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).

Developing Linguistic Corpora

Developing Linguistic Corpora PDF Author: Martin Wynne
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100

Get Book

Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Corpus Linguistics 25 Years on

Corpus Linguistics 25 Years on PDF Author:
Publisher: BRILL
ISBN: 9401204349
Category : Language Arts & Disciplines
Languages : en
Pages : 391

Get Book

Book Description
This volume offers a state-of-the-art picture of work undertaken in the field of computer-aided corpus linguistics. While the focus is on English, central insights can be generalised to other languages, as well. As a work intended to mark the Silver Jubilee of ICAME, the International Computer Archive of Modern and Medieval English, the book combines surveys of the discipline by some of its major pioneers, including founders of ICAME itself, with cutting-edge work by younger scholars. It is divided into three sections: “Overviewing years of corpus linguistic studies”, “Descriptive studies in English syntax and semantics”, and “Second Language Acquisition, parallel corpora and specialist corpora”. The book bears witness to the impressive advances that have characterised the development of corpus linguistics over the past few decades – from terminological issues to practical applications, from theoretical and descriptive research to applied approaches, from monolingual to multilingual and specialist corpora, from corpus design to corpus exploitation tools.

Corpus Analysis

Corpus Analysis PDF Author:
Publisher: BRILL
ISBN: 9004334416
Category : Language Arts & Disciplines
Languages : en
Pages : 294

Get Book

Book Description
The papers published in this volume were originally presented at the Third North American Symposium on Corpus Linguistics and Language Teaching held on 23-25 March 2001 at the Park Plaza Hotel in Boston, Massachusetts. Each paper analyses some aspect of language use or structure in one or more of the many linguistic corpora now available. The number of different corpora investigated in the book is a real testament to the progress that has been made in recent years in developing new corpora, particularly spoken corpora, as over half of the papers deal either wholly or partially with the analysis of spoken data. This book will be of particular interest to undergraduate and graduate students and scholars interested in corpus, socio and applied linguistics, discourse analysis, pragmatics, and language teaching.