Author: Niladri Sekhar Dash
Publisher: Springer
ISBN: 9811318018
Category : Language Arts & Disciplines
Languages : en
Pages : 308
Book Description
This book discusses some of the basic issues relating to corpus generation and the methods normally used to generate a corpus. Since corpus-related research goes beyond corpus generation, the book also addresses other major topics connected with the use and application of language corpora, namely, corpus readiness in the context of corpus sanitation and pre-editing of corpus texts; the application of statistical methods; and various text processing techniques. Importantly, it explores how corpora can be used as a primary or secondary resource in English language teaching, in creating dictionaries, in word sense disambiguation, in various language technologies, and in other branches of linguistics. Lastly, the book sheds light on the status quo of corpus generation in Indian languages and identifies current and future needs. Discussing various technical issues in the field in a lucid manner, providing extensive new diagrams and charts for easy comprehension, and using simplified English, the book is an ideal resource for non-native English readers. Written by academics with many years of experience teaching and researching corpus linguistics, its focus on Indian languages and on English corpora makes it applicable to graduate and postgraduate students of applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
Utility and Application of Language Corpora
Author: Niladri Sekhar Dash
Publisher: Springer
ISBN: 9811318018
Category : Language Arts & Disciplines
Languages : en
Pages : 308
Book Description
This book discusses some of the basic issues relating to corpus generation and the methods normally used to generate a corpus. Since corpus-related research goes beyond corpus generation, the book also addresses other major topics connected with the use and application of language corpora, namely, corpus readiness in the context of corpus sanitation and pre-editing of corpus texts; the application of statistical methods; and various text processing techniques. Importantly, it explores how corpora can be used as a primary or secondary resource in English language teaching, in creating dictionaries, in word sense disambiguation, in various language technologies, and in other branches of linguistics. Lastly, the book sheds light on the status quo of corpus generation in Indian languages and identifies current and future needs. Discussing various technical issues in the field in a lucid manner, providing extensive new diagrams and charts for easy comprehension, and using simplified English, the book is an ideal resource for non-native English readers. Written by academics with many years of experience teaching and researching corpus linguistics, its focus on Indian languages and on English corpora makes it applicable to graduate and postgraduate students of applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
Publisher: Springer
ISBN: 9811318018
Category : Language Arts & Disciplines
Languages : en
Pages : 308
Book Description
This book discusses some of the basic issues relating to corpus generation and the methods normally used to generate a corpus. Since corpus-related research goes beyond corpus generation, the book also addresses other major topics connected with the use and application of language corpora, namely, corpus readiness in the context of corpus sanitation and pre-editing of corpus texts; the application of statistical methods; and various text processing techniques. Importantly, it explores how corpora can be used as a primary or secondary resource in English language teaching, in creating dictionaries, in word sense disambiguation, in various language technologies, and in other branches of linguistics. Lastly, the book sheds light on the status quo of corpus generation in Indian languages and identifies current and future needs. Discussing various technical issues in the field in a lucid manner, providing extensive new diagrams and charts for easy comprehension, and using simplified English, the book is an ideal resource for non-native English readers. Written by academics with many years of experience teaching and researching corpus linguistics, its focus on Indian languages and on English corpora makes it applicable to graduate and postgraduate students of applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
History, Features, and Typology of Language Corpora
Author: Niladri Sekhar Dash
Publisher: Springer
ISBN: 9811074585
Category : Language Arts & Disciplines
Languages : en
Pages : 311
Book Description
This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and charts for easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
Publisher: Springer
ISBN: 9811074585
Category : Language Arts & Disciplines
Languages : en
Pages : 311
Book Description
This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and charts for easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.
Language Corpora Annotation and Processing
Author: Niladri Sekhar Dash
Publisher: Springer Nature
ISBN: 9811629609
Category : Language Arts & Disciplines
Languages : en
Pages : 292
Book Description
This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.
Publisher: Springer Nature
ISBN: 9811629609
Category : Language Arts & Disciplines
Languages : en
Pages : 292
Book Description
This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.
Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100
Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
Publisher: Oxbow Books Limited
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 100
Book Description
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
English Language Corpora
Author:
Publisher: BRILL
ISBN: 9004653554
Category : Computers
Languages : en
Pages : 336
Book Description
Publisher: BRILL
ISBN: 9004653554
Category : Computers
Languages : en
Pages : 336
Book Description
Corpus Linguistics: An Introduction
Author: Dash, Niladri Sekhar
Publisher: Pearson Education India
ISBN: 8131752623
Category :
Languages : en
Pages : 208
Book Description
Corpus Linguistics: An Introduction will appeal to a wide spectrum of scholars, researchers, and particularly to students of linguistics. It offers guidelines for the creation and usage of corpora in the form of empirical language databases with direct functional and theoretical interpretation of a natural language. Drawn from original research and written in an accessible language and style, this book will create avenues for further advancements in mainstream and applied linguistics and language technology.
Publisher: Pearson Education India
ISBN: 8131752623
Category :
Languages : en
Pages : 208
Book Description
Corpus Linguistics: An Introduction will appeal to a wide spectrum of scholars, researchers, and particularly to students of linguistics. It offers guidelines for the creation and usage of corpora in the form of empirical language databases with direct functional and theoretical interpretation of a natural language. Drawn from original research and written in an accessible language and style, this book will create avenues for further advancements in mainstream and applied linguistics and language technology.
Contemporary Corpus Linguistics
Author: Paul Baker
Publisher: A&C Black
ISBN: 1441181334
Category : Language Arts & Disciplines
Languages : en
Pages : 370
Book Description
Acts as a one-volume resource, providing an introduction to every aspect of corpus linguistics as it is being used at the moment.
Publisher: A&C Black
ISBN: 1441181334
Category : Language Arts & Disciplines
Languages : en
Pages : 370
Book Description
Acts as a one-volume resource, providing an introduction to every aspect of corpus linguistics as it is being used at the moment.
Routledge Encyclopedia of Technology and the Humanities
Author: Chan Sin-wai
Publisher: Taylor & Francis
ISBN: 1040005829
Category : Language Arts & Disciplines
Languages : en
Pages : 389
Book Description
Routledge Encyclopedia of Technology and the Humanities is a pioneer attempt to introduce a wide range of disciplines in the emerging field of techno-humanities to the English-reading world. This book covers topics such as archaeology, cultural heritage, design, fashion, linguistics, music, philosophy, and translation. It has 20 chapters, contributed by 26 local and international scholars. Each chapter has its own theme and addresses issues of significant interest in the respective disciplines. References are provided at the end of each chapter for further exploration into the literature of the relevant areas. To facilitate an easy reading of the information presented in this volume, chapters have been arranged according to the alphabetical order of the topics covered. This Encyclopedia will appeal to researchers and professionals in the field of technology and the humanities, and can be used by undergraduate and graduate students studying the humanities.
Publisher: Taylor & Francis
ISBN: 1040005829
Category : Language Arts & Disciplines
Languages : en
Pages : 389
Book Description
Routledge Encyclopedia of Technology and the Humanities is a pioneer attempt to introduce a wide range of disciplines in the emerging field of techno-humanities to the English-reading world. This book covers topics such as archaeology, cultural heritage, design, fashion, linguistics, music, philosophy, and translation. It has 20 chapters, contributed by 26 local and international scholars. Each chapter has its own theme and addresses issues of significant interest in the respective disciplines. References are provided at the end of each chapter for further exploration into the literature of the relevant areas. To facilitate an easy reading of the information presented in this volume, chapters have been arranged according to the alphabetical order of the topics covered. This Encyclopedia will appeal to researchers and professionals in the field of technology and the humanities, and can be used by undergraduate and graduate students studying the humanities.
Language corpora : past, present and future
Author: Niladri Sekhar Dash
Publisher: Mittal Publications
ISBN: 9788183242554
Category : Computational linguistics
Languages : en
Pages : 204
Book Description
Publisher: Mittal Publications
ISBN: 9788183242554
Category : Computational linguistics
Languages : en
Pages : 204
Book Description
Exploring Linguistic Science
Author: Allison Burkette
Publisher:
ISBN: 1108424805
Category : Language Arts & Disciplines
Languages : en
Pages : 253
Book Description
Introduces students to the scientific study of language, using the basic principles of complexity theory.
Publisher:
ISBN: 1108424805
Category : Language Arts & Disciplines
Languages : en
Pages : 253
Book Description
Introduces students to the scientific study of language, using the basic principles of complexity theory.