Corpus Linguistics Beyond the Word PDF Download

Are you looking for read ebook online? Search for your book and save it on your Kindle device, PC, phones or tablets. Download Corpus Linguistics Beyond the Word PDF full book. Access full book title Corpus Linguistics Beyond the Word by . Download full books in PDF and EPUB format.

Corpus Linguistics Beyond the Word

Author:
Publisher: BRILL
ISBN: 9401203849
Category : Language Arts & Disciplines
Languages : en
Pages : 287

Get Book Here

Book Description
This volume will be of particular interest to readers interested in expanding the applications of corpus linguistics techniques through new tools and approaches. The text includes selected papers from the Fifth North American Symposium, hosted by the Linguistics Department at Montclair State University in Montclair New Jersey in May 2004. The symposium papers represented several areas of corpus studies including language development, syntactic analysis, pragmatics and discourse, language change, register variation, corpus creation and annotation, and practical applications of corpus work, primarily in language teaching, but also in medical training and machine translation. A common thread through most of the papers was the use of corpora to study domains longer than the word. Not surprisingly, fully half of the papers deal with the computational tools and linguistic strategies needed to search for and analyze these longer spans of language while most of the remaining papers examine particular syntactic and rhetorical properties of one or more corpora.

Corpus Linguistics Beyond the Word

Author:
Publisher: BRILL
ISBN: 9401203849
Category : Language Arts & Disciplines
Languages : en
Pages : 287

Get Book Here

The Cambridge Handbook of English Corpus Linguistics

Author: Douglas Biber
Publisher: Cambridge University Press
ISBN: 1316298701
Category : Language Arts & Disciplines
Languages : en
Pages : 757

Get Book Here

Book Description
The Cambridge Handbook of English Corpus Linguistics (CHECL) surveys the breadth of corpus-based linguistic research on English, including chapters on collocations, phraseology, grammatical variation, historical change, and the description of registers and dialects. The most innovative aspects of the CHECL are its emphasis on critical discussion, its explicit evaluation of the state of the art in each sub-discipline, and the inclusion of empirical case studies. While each chapter includes a broad survey of previous research, the primary focus is on a detailed description of the most important corpus-based studies in this area, with discussion of what those studies found, and why they are important. Each chapter also includes a critical discussion of the corpus-based methods employed for research in this area, as well as an explicit summary of new findings and discoveries.

Developing Linguistic Corpora

Author: Martin Wynne
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100

Get Book Here

Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.

Natural Language Processing for Corpus Linguistics

Author: Jonathan Dunn
Publisher: Cambridge University Press
ISBN: 1009083740
Category : Language Arts & Disciplines
Languages : en
Pages : 149

Get Book Here

Book Description
Corpus analysis can be expanded and scaled up by incorporating computational methods from natural language processing. This Element shows how text classification and text similarity models can extend our ability to undertake corpus linguistics across very large corpora. These computational methods are becoming increasingly important as corpora grow too large for more traditional types of linguistic analysis. We draw on five case studies to show how and why to use computational methods, ranging from usage-based grammar to authorship analysis to using social media for corpus-based sociolinguistics. Each section is accompanied by an interactive code notebook that shows how to implement the analysis in Python. A stand-alone Python package is also available to help readers use these methods with their own data. Because large-scale analysis introduces new ethical problems, this Element pairs each new methodology with a discussion of potential ethical implications.

Doing Corpus Linguistics

Author: Eniko Csomay
Publisher: Taylor & Francis
ISBN: 1003836488
Category : Language Arts & Disciplines
Languages : en
Pages : 191

Get Book Here

Book Description
Doing Corpus Linguistics offers a practical step-by-step introduction to corpus linguistics, making use of widely available corpora and of a register analysis-based theoretical framework to provide students in applied linguistics and TESOL with the understanding and skills necessary to meaningfully analyze corpora and carry out successful corpus-based research. This second edition has been thoroughly revised and updated with fresh exercises, examples, and references, as well as an extensive list of English corpora around the world. It also provides more clarity around the approach used in the book, contains new sections on how to identify patterns in texts, and now covers Cohen’s statistical method. This practical and applied text emphasizes hands-on experience with performing language analysis research and interpreting findings in a meaningful and engaging way. Readers are given multiple opportunities to analyze language data by completing smaller tasks and corpus projects using publicly available corpora. The book also takes readers through the process of building a specialized corpus designed to answer a specific research question and offers detailed information on completing a final research project that includes both a written paper and an oral presentation of the reader’s specific research projects. Doing Corpus Linguistics provides students in applied linguistics and TESOL with the opportunity to gain proficiency in the technical and interpretive aspects of corpus research and to encourage them to participate in the growing field of corpus linguistics.

Sociolinguistics and Corpus Linguistics

Author: Paul Baker
Publisher: Edinburgh University Press
ISBN: 0748631461
Category : Language Arts & Disciplines
Languages : en
Pages : 201

Get Book Here

Book Description
This textbook introduces students to the ways in which techniques from corpus linguistics can be used to aid sociolinguistic research. Corpus linguistics shares with variationist sociolinguistics a quantitative approach to the study of variation or differences between populations. It may also complement qualitative traditions of enquiry such as interactional sociolinguistics.This text covers a range of different topics within sociolinguistics:*Analysing demographic variation*Comparing language use across different cultures*Examining language change over time*Studying transcripts of spoken interactions*Identifying attitudes or discourses.Written for undergraduate and postgraduate students of sociolinguistics, or corpus linguists who wish to use corpora to study social phenomena, this textbook examines how corpora can be drawn on to investigate synchronic variation, diachronic change and the construction of discourses. It refers to several classic corpus-based studies as well as the author's own research. Original analyses of a number of corpora including the British National Corpus, the Survey of English Dialects and the Brown family of corpora are complemented by a new corpus of written British English collected around 2006 for the purposes of writing the book.Techniques of analysis like concordancing, keywords and collocations are discussed, along with corpus annotation and statistical procedures such as chi-squared tests and clustering. Paul Baker takes a critical approach to using corpora in sociolinguistics, outlining the limitations of the approach as well as its advantages.

Beyond Concordance Lines

Author: Pascual Pérez-Paredes
Publisher: John Benjamins Publishing Company
ISBN: 902725849X
Category : Language Arts & Disciplines
Languages : en
Pages : 267

Get Book Here

Book Description
In over 30 years of data-driven learning (DDL) research, there has been a growing sophistication in the ways we collect, analyse, and put corpus data to use. This volume takes a three-fold perspective on DDL. It first looks at DDL and its role in informing language learning theory and how it might shed light on the language development process; secondly it addresses how DDL can help us characterise learner language and inform teaching accordingly, and thirdly it showcases practical applications for the use of DDL in classrooms. The contributors to this volume examine a variety of instructional settings and languages across the world. They reflect on theoretical, methodological and classroom implications using both novel and established language learning theories, natural language processing (NLP), longitudinal research designs, and a variety of language learning targets. The present volume is an invitation from some of the leading researchers in DDL to reflect on the research avenues that will define the field in the coming years.

Corpus Linguistics and Statistics with R

Author: Guillaume Desagulier
Publisher: Springer
ISBN: 3319645722
Category : Computers
Languages : en
Pages : 359

Get Book Here

Book Description
This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.

Recent Advances in Corpus Linguistics

Author: Lieven Vandelanotte
Publisher: Rodopi
ISBN: 9401211132
Category : Foreign Language Study
Languages : en
Pages : 353

Get Book Here

Book Description
This book is a selection of studies presented at the 33rd International Conference of the International Computer Archive of Modern and Medieval English (ICAME), hosted by the University of Leuven (30 May - 3 June 2012). The strictly refereed and extensively revised contributions collected here represent recent advances in corpus linguistics, both in the development of specialist corpora and in ways of exploiting them for specific purposes. The first part focuses on “Corpus development and corpus interrogation” and features papers on the compilation of new, highly specialized corpora which aim to fill gaps in historical databases, and on new ways of extracting relevant patterns automatically from computerized datasets. The second part, devoted to “Specialist corpora”, presents detailed descriptive studies on grammatical patterns in World Englishes, on neology, and – using a contrastive approach – on prepositions and cohesive conjunctions. The third and final part on “Second language acquisition” groups together studies situated at the intersection of corpus linguistics and educational linguistics and dealing with markers of relevance and lesser relevance in lectures, deceptive cognates, the automatic annotation of native and non-native uses of demonstrative this and that, and measuring learners’ progress in speech and in writing. Each contribution in its own way reports on novel ways of getting mileage out of specialist corpora, and collectively the contributions attest to the rude health of computerized corpus linguistic studies.

Legal Linguistics Beyond Borders: Language and Law in a World of Media, Globalisation and Social Conflicts

Author: Friedemann Vogel
Publisher: Duncker & Humblot
ISBN: 342855423X
Category : Law
Languages : en
Pages : 385

Get Book Here

Book Description
The world of law has changed in the last decades: it has become more globalized, multilingual and digital. The sections and contributions of this volume continue the interdisciplinary discussion about the challenges of this change for theory and practice of law and for the International Language and Law Association (ILLA) relaunched in 2017. First, the book gives a broad overview to the research field of legal linguistics, its history, research directions and open questions in different parts of the world (United States, Africa, Italy, Spain, Germany, Nordic countries and Russia). The second section consists of contributions about the relation of language, law and justice in a globalized world with a focus on multilingual and supranational law in the EU. The third section focuses on digitalization and mediatization of the law, the last section reports about the discussion at the ILLA relaunch conference in 2017.