Author: Stefanowitsch, Anatol
Publisher: Language Science Press
ISBN: 3961102244
Category : Language Arts & Disciplines
Languages : en
Pages : 510
Book Description
Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.
Corpus linguistics
Author: Stefanowitsch, Anatol
Publisher: Language Science Press
ISBN: 3961102244
Category : Language Arts & Disciplines
Languages : en
Pages : 510
Book Description
Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.
Publisher: Language Science Press
ISBN: 3961102244
Category : Language Arts & Disciplines
Languages : en
Pages : 510
Book Description
Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.
Corpus Linguistics
Author: Tony McEnery
Publisher: Cambridge University Press
ISBN: 1139502441
Category : Language Arts & Disciplines
Languages : en
Pages : 311
Book Description
Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.
Publisher: Cambridge University Press
ISBN: 1139502441
Category : Language Arts & Disciplines
Languages : en
Pages : 311
Book Description
Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.
Corpus Linguistics and Statistics with R
Author: Guillaume Desagulier
Publisher: Springer
ISBN: 3319645722
Category : Computers
Languages : en
Pages : 359
Book Description
This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.
Publisher: Springer
ISBN: 3319645722
Category : Computers
Languages : en
Pages : 359
Book Description
This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.
Statistics in Corpus Linguistics
Author: Vaclav Brezina
Publisher: Cambridge University Press
ISBN: 1107125707
Category : Foreign Language Study
Languages : en
Pages : 317
Book Description
A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.
Publisher: Cambridge University Press
ISBN: 1107125707
Category : Foreign Language Study
Languages : en
Pages : 317
Book Description
A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.
Corpus Linguistics
Author: Douglas Biber
Publisher: Cambridge University Press
ISBN: 9780521499576
Category : Computers
Languages : en
Pages : 324
Book Description
An investigation into the way people use language in speech and writing, this volume introduces the corpus-based approach, which is based on analysis of large databases of real language examples stored on computer.
Publisher: Cambridge University Press
ISBN: 9780521499576
Category : Computers
Languages : en
Pages : 324
Book Description
An investigation into the way people use language in speech and writing, this volume introduces the corpus-based approach, which is based on analysis of large databases of real language examples stored on computer.
Quantitative Corpus Linguistics with R
Author: Stefan Th. Gries
Publisher: Routledge
ISBN: 1135895600
Category : Education
Languages : en
Pages : 257
Book Description
The first textbook of its kind, Quantitative Corpus Linguistics with R demonstrates how to use the open source programming language R for corpus linguistic analyses. Computational and corpus linguists doing corpus work will find that R provides an enormous range of functions that currently require several programs to achieve – searching and processing corpora, arranging and outputting the results of corpus searches, statistical evaluation, and graphing.
Publisher: Routledge
ISBN: 1135895600
Category : Education
Languages : en
Pages : 257
Book Description
The first textbook of its kind, Quantitative Corpus Linguistics with R demonstrates how to use the open source programming language R for corpus linguistic analyses. Computational and corpus linguists doing corpus work will find that R provides an enormous range of functions that currently require several programs to achieve – searching and processing corpora, arranging and outputting the results of corpus searches, statistical evaluation, and graphing.
Applications of Pattern-driven Methods in Corpus Linguistics
Author: Joanna Kopaczyk
Publisher: John Benjamins Publishing Company
ISBN: 9027264562
Category : Language Arts & Disciplines
Languages : en
Pages : 323
Book Description
The use of corpora has conventionally been envisioned as being either corpus-based or corpus-driven. While the formal definition of the latter term has been widely accepted since it was established by Tognini-Bonelli (2001), it is often applied to studies that do not, in fact, fullfil the fundamental requirement of a theory-neutral starting point. This volume proposes the term pattern-driven as a more precise alternative. The chapters illustrate a variety of methods that fall under this broad methodology, such as the extraction of lexical bundles, POS-grams and semantic frames, and demonstrate how these approaches can uncover new understandings of both synchronic and diachronic linguistic phenomena.
Publisher: John Benjamins Publishing Company
ISBN: 9027264562
Category : Language Arts & Disciplines
Languages : en
Pages : 323
Book Description
The use of corpora has conventionally been envisioned as being either corpus-based or corpus-driven. While the formal definition of the latter term has been widely accepted since it was established by Tognini-Bonelli (2001), it is often applied to studies that do not, in fact, fullfil the fundamental requirement of a theory-neutral starting point. This volume proposes the term pattern-driven as a more precise alternative. The chapters illustrate a variety of methods that fall under this broad methodology, such as the extraction of lexical bundles, POS-grams and semantic frames, and demonstrate how these approaches can uncover new understandings of both synchronic and diachronic linguistic phenomena.
A Practical Handbook of Corpus Linguistics
Author: Magali Paquot
Publisher: Springer Nature
ISBN: 3030462161
Category : Philosophy
Languages : en
Pages : 686
Book Description
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
Publisher: Springer Nature
ISBN: 3030462161
Category : Philosophy
Languages : en
Pages : 686
Book Description
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
Corpus Triangulation
Author: Sofia Malamatidou
Publisher: Routledge
ISBN: 1317364570
Category : Language Arts & Disciplines
Languages : en
Pages : 195
Book Description
Despite the recognition that corpus-based translation research would benefit from the triangulation of corpora, little has been done in the direction of actually employing combined corpus data and methods in the field. This book aims to address this gap by providing a much needed detailed account of corpus triangulation, where different corpora (e.g. parallel, comparable, synchronic, diachronic) and/or different methods of analysis (e.g. qualitative, quantitative) can be used to increase our understanding of the phenomena where translation plays a key role. The book also demonstrates clearly how the proposed methodology can be fruitfully employed to investigate different linguistic features, through its systematic application to empirical data. The first part of the book introduces the innovative framework for corpus triangulation, which is based on a new and comprehensive corpus typology, while the second part applies the methodological framework to two case studies examining the language of translation and the relationship between translation and language change. The book advances current translation studies in terms of methodology innovation and offers a model on which future studies investigating the network of relationships surrounding translated texts can be based.
Publisher: Routledge
ISBN: 1317364570
Category : Language Arts & Disciplines
Languages : en
Pages : 195
Book Description
Despite the recognition that corpus-based translation research would benefit from the triangulation of corpora, little has been done in the direction of actually employing combined corpus data and methods in the field. This book aims to address this gap by providing a much needed detailed account of corpus triangulation, where different corpora (e.g. parallel, comparable, synchronic, diachronic) and/or different methods of analysis (e.g. qualitative, quantitative) can be used to increase our understanding of the phenomena where translation plays a key role. The book also demonstrates clearly how the proposed methodology can be fruitfully employed to investigate different linguistic features, through its systematic application to empirical data. The first part of the book introduces the innovative framework for corpus triangulation, which is based on a new and comprehensive corpus typology, while the second part applies the methodological framework to two case studies examining the language of translation and the relationship between translation and language change. The book advances current translation studies in terms of methodology innovation and offers a model on which future studies investigating the network of relationships surrounding translated texts can be based.
Statistics in Corpus Linguistics Research
Author: Sean Wallis
Publisher: Routledge
ISBN: 0429958668
Category : Computers
Languages : en
Pages : 463
Book Description
Traditional approaches focused on significance tests have often been difficult for linguistics researchers to visualise. Statistics in Corpus Linguistics Research: A New Approach breaks these significance tests down for researchers in corpus linguistics and linguistic analysis, promoting a visual approach to understanding the performance of tests with real data, and demonstrating how to derive new intervals and tests. Accessibly written, this book discusses the ‘why’ behind the statistical model, allowing readers a greater facility for choosing their own methodologies. Accessibly written for those with little to no mathematical or statistical background, it explains the mathematical fundamentals of simple significance tests by relating them to confidence intervals. With sample datasets and easy-to-read visuals, this book focuses on practical issues, such as how to: • pose research questions in terms of choice and constraint; • employ confidence intervals correctly (including in graph plots); • select optimal significance tests (and what results mean); • measure the size of the effect of one variable on another; • estimate the similarity of distribution patterns; and • evaluate whether the results of two experiments significantly differ. Appropriate for anyone from the student just beginning their career to the seasoned researcher, this book is both a practical overview and valuable resource.
Publisher: Routledge
ISBN: 0429958668
Category : Computers
Languages : en
Pages : 463
Book Description
Traditional approaches focused on significance tests have often been difficult for linguistics researchers to visualise. Statistics in Corpus Linguistics Research: A New Approach breaks these significance tests down for researchers in corpus linguistics and linguistic analysis, promoting a visual approach to understanding the performance of tests with real data, and demonstrating how to derive new intervals and tests. Accessibly written, this book discusses the ‘why’ behind the statistical model, allowing readers a greater facility for choosing their own methodologies. Accessibly written for those with little to no mathematical or statistical background, it explains the mathematical fundamentals of simple significance tests by relating them to confidence intervals. With sample datasets and easy-to-read visuals, this book focuses on practical issues, such as how to: • pose research questions in terms of choice and constraint; • employ confidence intervals correctly (including in graph plots); • select optimal significance tests (and what results mean); • measure the size of the effect of one variable on another; • estimate the similarity of distribution patterns; and • evaluate whether the results of two experiments significantly differ. Appropriate for anyone from the student just beginning their career to the seasoned researcher, this book is both a practical overview and valuable resource.