Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora PDF Author: J. Beal
Publisher: Springer
ISBN: 0230223931
Category : Language Arts & Disciplines
Languages : en
Pages : 266

Get Book Here

Book Description
A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora PDF Author: J. Beal
Publisher: Springer
ISBN: 0230223931
Category : Language Arts & Disciplines
Languages : en
Pages : 266

Get Book Here

Book Description
A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora PDF Author: Karen P. Corrigan
Publisher: Palgrave Macmillan
ISBN: 9781137386441
Category : Language Arts & Disciplines
Languages : en
Pages : 359

Get Book Here

Book Description
This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Creating and Digitizing Language Corpora: Diachronic databases

Creating and Digitizing Language Corpora: Diachronic databases PDF Author: Joan C. Beal
Publisher:
ISBN:
Category : Computational linguistics
Languages : en
Pages : 0

Get Book Here

Book Description


Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig

Corpus Design and Construction in Minoritised Language Contexts - Cynllunio a Chreu Corpws mewn Cyd-destunau Ieithoedd Lleiafrifoledig PDF Author: Dawn Knight
Publisher: Springer Nature
ISBN: 3030724840
Category : Language Arts & Disciplines
Languages : en
Pages : 178

Get Book Here

Book Description
This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora PDF Author: Karen P. Corrigan
Publisher: Palgrave Macmillan
ISBN: 9781137386441
Category : Language Arts & Disciplines
Languages : en
Pages : 0

Get Book Here

Book Description
This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Building a National Corpus

Building a National Corpus PDF Author: Dawn Knight
Publisher: Springer Nature
ISBN: 3030818586
Category : Language Arts & Disciplines
Languages : en
Pages : 192

Get Book Here

Book Description
This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.

The Handbook of Language Variation and Change

The Handbook of Language Variation and Change PDF Author: J. K. Chambers
Publisher: John Wiley & Sons
ISBN: 1119457084
Category : Language Arts & Disciplines
Languages : en
Pages : 628

Get Book Here

Book Description
Reflecting a multitude of developments in the study of language change and variation over the last ten years, this extensively updated second edition features a number of new chapters and remains the authoritative reference volume on a core research area in linguistics. A fully revised and expanded edition of this acclaimed reference work, which has established its reputation based on its unrivalled scope and depth of analysis in this interdisciplinary field Includes seven new chapters, while the remainder have undergone thorough revision and updating to incorporate the latest research and reflect numerous developments in the field Accessibly structured by theme, covering topics including data collection and evaluation, linguistic structure, language and time, language contact, language domains, and social differentiation Brings together an experienced, international editorial and contributor team to provides an unrivalled learning, teaching and reference tool for researchers and students in sociolinguistics

A Taste for Corpora

A Taste for Corpora PDF Author: Fanny Meunier
Publisher: John Benjamins Publishing
ISBN: 9027203504
Category : Language Arts & Disciplines
Languages : en
Pages : 313

Get Book Here

Book Description
The eleven contributions to this volume, written by expert corpus linguists, tackle corpora from a wide range of perspectives and aim to shed light on the numerous linguistic and pedagogical uses to which corpora can be put. They present cutting-edge research in the authors respective domain of expertise and suggest directions for future research. The main focus of the book is on learner corpora, but it also includes reflections on the role of other types of corpora, such as native corpora, expert users corpora, parallel corpora or corpora of New Englishes. For readers who are already familiar with corpora, this volume offers an informed account of the key role that corpus data play in applied linguistics today. As for readers who are new to corpus linguistics, the overview of approaches, methods and domains of applications presented will undoubtedly help them develop their own taste for corpora. This volume has been edited in honour of Sylviane Granger, who has been one of the pioneers of learner corpus research."

Translation-Driven Corpora

Translation-Driven Corpora PDF Author: Federico Zanettin
Publisher: Routledge
ISBN: 1317639847
Category : Language Arts & Disciplines
Languages : en
Pages : 209

Get Book Here

Book Description
Electronic texts and text analysis tools have opened up a wealth of opportunities to higher education and language service providers, but learning to use these resources continues to pose challenges to scholars and professionals alike. Translation-Driven Corpora aims to introduce readers to corpus tools and methods which may be used in translation research and practice. Each chapter focuses on specific aspects of corpus creation and use. An introduction to corpora and overview of applications of corpus linguistics methodologies to translation studies is followed by a discussion of corpus design and acquisition. Different stages and tools involved in corpus compilation and use are outlined, from corpus encoding and annotation to indexing and data retrieval, and the various methods and techniques that allow end users to make sense of corpus data are described. The volume also offers detailed guidelines for the construction and analysis of multilingual corpora. Corpus creation and use are illustrated through practical examples and case studies, with each chapter outlining a set of tasks aimed at guiding researchers, students and translators to practice some of the methods and use some of the resources discussed. These tasks are meant as hands-on activities to be carried out using the materials and links available in an accompanying DVD. Suggested further readings at the end of each chapter are complemented by an extensive bibliography at the end of the volume. Translation-Driven Corpora is designed for use by teachers and students in the classroom or by researchers and professionals for self-learning. It is an invaluable resource for anyone interested in this fast growing area of scholarly and professional activity.

A Practical Handbook of Corpus Linguistics

A Practical Handbook of Corpus Linguistics PDF Author: Magali Paquot
Publisher: Springer Nature
ISBN: 3030462161
Category : Philosophy
Languages : en
Pages : 686

Get Book Here

Book Description
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.