Author: A. Abeillé
Publisher: Springer Science & Business Media
ISBN: 9401002010
Category : Psychology
Languages : en
Pages : 411
Book Description
This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Treebanks
Author: A. Abeillé
Publisher: Springer Science & Business Media
ISBN: 9401002010
Category : Psychology
Languages : en
Pages : 411
Book Description
This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Publisher: Springer Science & Business Media
ISBN: 9401002010
Category : Psychology
Languages : en
Pages : 411
Book Description
This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Treebanks
Author: A. Abeillé
Publisher: Springer Science & Business Media
ISBN: 9781402013348
Category : Psychology
Languages : en
Pages : 442
Book Description
This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Publisher: Springer Science & Business Media
ISBN: 9781402013348
Category : Psychology
Languages : en
Pages : 442
Book Description
This book provides a state of the art on work being done with parsed corpora. It gathers 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.
Diachronic Treebanks for Historical Linguistics
Author: Hanne Martine Eckhoff
Publisher: John Benjamins Publishing Company
ISBN: 9027260451
Category : Language Arts & Disciplines
Languages : en
Pages : 162
Book Description
Over the last few decades, the widespread diffusion of digital technology has increased availability of primary textual sources, radically changing the everyday life of scholars in the humanities, who are now able to access, query and process a wealth of empirical evidence in ways not possible before. Also for ancient languages, corpora enhanced with increasingly complex layers of metalinguistic information, such as part-of-speech tagging and syntactic annotation (called 'treebanks') are now available. In particular, diachronic treebanks, which provide data for a language across several historical stages of a given language, allow for a new approach to diachronic studies of syntactic phenomena where scholars previously had to content themselves with empirical work on a much smaller scale. This volume brings together a set of papers that report research on various diachronic matters supported by evidence from diachronic treebanks. The contents of the papers cover a wide range of languages, including English, French, Russian, Old Church Slavonic, Latin and Ancient Greek. Originally published as special issue of Diachronica 35:3 (2018).
Publisher: John Benjamins Publishing Company
ISBN: 9027260451
Category : Language Arts & Disciplines
Languages : en
Pages : 162
Book Description
Over the last few decades, the widespread diffusion of digital technology has increased availability of primary textual sources, radically changing the everyday life of scholars in the humanities, who are now able to access, query and process a wealth of empirical evidence in ways not possible before. Also for ancient languages, corpora enhanced with increasingly complex layers of metalinguistic information, such as part-of-speech tagging and syntactic annotation (called 'treebanks') are now available. In particular, diachronic treebanks, which provide data for a language across several historical stages of a given language, allow for a new approach to diachronic studies of syntactic phenomena where scholars previously had to content themselves with empirical work on a much smaller scale. This volume brings together a set of papers that report research on various diachronic matters supported by evidence from diachronic treebanks. The contents of the papers cover a wide range of languages, including English, French, Russian, Old Church Slavonic, Latin and Ancient Greek. Originally published as special issue of Diachronica 35:3 (2018).
Introduction to Machine Learning and Natural Language Processing
Author: Dr.Kongara Srinivasa Rao
Publisher: Leilani Katie Publication
ISBN: 9363484823
Category : Computers
Languages : en
Pages : 219
Book Description
Dr.Kongara Srinivasa Rao, Assistant Professor, Department of Computer Science and Engineering, Faculty of Science and Technology (ICFAI Tech), ICFAI Foundation for Higher Education (IFHE), Hyderabad, Telangana, India. Dr.K.Sreeramamurthy, Professor, Department of Computer Science Engineering, Koneru Lakshmaiah Education Foundation, Bowrampet, Hyderabad, Telangana, India. Dr.Yaswanth Kumar Alapati, Associate Professor, Department of Information Technology, R.V.R. & J.C. College of Engineering, Guntur, Andhra Pradesh, India.
Publisher: Leilani Katie Publication
ISBN: 9363484823
Category : Computers
Languages : en
Pages : 219
Book Description
Dr.Kongara Srinivasa Rao, Assistant Professor, Department of Computer Science and Engineering, Faculty of Science and Technology (ICFAI Tech), ICFAI Foundation for Higher Education (IFHE), Hyderabad, Telangana, India. Dr.K.Sreeramamurthy, Professor, Department of Computer Science Engineering, Koneru Lakshmaiah Education Foundation, Bowrampet, Hyderabad, Telangana, India. Dr.Yaswanth Kumar Alapati, Associate Professor, Department of Information Technology, R.V.R. & J.C. College of Engineering, Guntur, Andhra Pradesh, India.
Encyclopedia of Language and Linguistics
Author:
Publisher: Elsevier
ISBN: 0080547842
Category : Language Arts & Disciplines
Languages : en
Pages : 26924
Book Description
The first edition of ELL (1993, Ron Asher, Editor) was hailed as "the field's standard reference work for a generation". Now the all-new second edition matches ELL's comprehensiveness and high quality, expanded for a new generation, while being the first encyclopedia to really exploit the multimedia potential of linguistics. * The most authoritative, up-to-date, comprehensive, and international reference source in its field * An entirely new work, with new editors, new authors, new topics and newly commissioned articles with a handful of classic articles * The first Encyclopedia to exploit the multimedia potential of linguistics through the online edition * Ground-breaking and International in scope and approach * Alphabetically arranged with extensive cross-referencing * Available in print and online, priced separately. The online version will include updates as subjects develop ELL2 includes: * c. 7,500,000 words * c. 11,000 pages * c. 3,000 articles * c. 1,500 figures: 130 halftones and 150 colour * Supplementary audio, video and text files online * c. 3,500 glossary definitions * c. 39,000 references * Extensive list of commonly used abbreviations * List of languages of the world (including information on no. of speakers, language family, etc.) * Approximately 700 biographical entries (now includes contemporary linguists) * 200 language maps in print and online Also available online via ScienceDirect – featuring extensive browsing, searching, and internal cross-referencing between articles in the work, plus dynamic linking to journal articles and abstract databases, making navigation flexible and easy. For more information, pricing options and availability visit www.info.sciencedirect.com. The first Encyclopedia to exploit the multimedia potential of linguistics Ground-breaking in scope - wider than any predecessor An invaluable resource for researchers, academics, students and professionals in the fields of: linguistics, anthropology, education, psychology, language acquisition, language pathology, cognitive science, sociology, the law, the media, medicine & computer science. The most authoritative, up-to-date, comprehensive, and international reference source in its field
Publisher: Elsevier
ISBN: 0080547842
Category : Language Arts & Disciplines
Languages : en
Pages : 26924
Book Description
The first edition of ELL (1993, Ron Asher, Editor) was hailed as "the field's standard reference work for a generation". Now the all-new second edition matches ELL's comprehensiveness and high quality, expanded for a new generation, while being the first encyclopedia to really exploit the multimedia potential of linguistics. * The most authoritative, up-to-date, comprehensive, and international reference source in its field * An entirely new work, with new editors, new authors, new topics and newly commissioned articles with a handful of classic articles * The first Encyclopedia to exploit the multimedia potential of linguistics through the online edition * Ground-breaking and International in scope and approach * Alphabetically arranged with extensive cross-referencing * Available in print and online, priced separately. The online version will include updates as subjects develop ELL2 includes: * c. 7,500,000 words * c. 11,000 pages * c. 3,000 articles * c. 1,500 figures: 130 halftones and 150 colour * Supplementary audio, video and text files online * c. 3,500 glossary definitions * c. 39,000 references * Extensive list of commonly used abbreviations * List of languages of the world (including information on no. of speakers, language family, etc.) * Approximately 700 biographical entries (now includes contemporary linguists) * 200 language maps in print and online Also available online via ScienceDirect – featuring extensive browsing, searching, and internal cross-referencing between articles in the work, plus dynamic linking to journal articles and abstract databases, making navigation flexible and easy. For more information, pricing options and availability visit www.info.sciencedirect.com. The first Encyclopedia to exploit the multimedia potential of linguistics Ground-breaking in scope - wider than any predecessor An invaluable resource for researchers, academics, students and professionals in the fields of: linguistics, anthropology, education, psychology, language acquisition, language pathology, cognitive science, sociology, the law, the media, medicine & computer science. The most authoritative, up-to-date, comprehensive, and international reference source in its field
Corpus Linguistics. Volume 1
Author: Anke Lüdeling
Publisher: Walter de Gruyter
ISBN: 3110211424
Category : Language Arts & Disciplines
Languages : en
Pages : 797
Book Description
This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade. Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics. The handbook sketches the history of corpus linguistics, shows its potential, discusses its problems, and describes various methods of collecting, annotating, and searching corpora as well as processing corpus data. It also reports case studies that illustrate the wide range of linguistic research questions addressed in corpus linguistics. The over 60 articles included in the handbook are divided into five sections: (1) the origins and history of corpus linguistics and surveys of its relationship to central fields of linguistics (2) corpus compilation (3) corpus types (4) preprocessing of corpora (5) the use and exploitation of corpora. The final section gives an overview of the results of corpus studies obtained in phonetics, phonology, morphology, syntax, semantics, sociolinguistics, historical linguistics, stylometry, dialectology, and discourse analysis. It also reports on recent advances made in human and machine translation, contrastive studies, computer-assisted language learning, and automatic summarization. The contributors to the volume are internationally known experts in their respective fields. The handbook is intended for a wide audience ranging from teachers, university students, and scholars to anyone interested in the use of computers in linguistic analyses and applications.
Publisher: Walter de Gruyter
ISBN: 3110211424
Category : Language Arts & Disciplines
Languages : en
Pages : 797
Book Description
This volume provides an up-to-date survey of the field of corpus linguistics, a field whose methodology has revolutionized much of the empirical work done in most fields of linguistic study over the past decade. Corpus linguistics investigates human language by starting out from large collections of texts - spoken, written, or recorded. These language corpora, which are now regularly available in electronic form, are the basis for quantitative and qualitative research on almost any question of linguistic interest. Many techniques that are in use in corpus linguistics today are rooted in the tradition of the late 18th and 19th century, when linguistics began to make use of mathematical and empirical methods. Modern corpus linguistics has used and developed these methods in close connection with computer science and computational linguistics. The handbook sketches the history of corpus linguistics, shows its potential, discusses its problems, and describes various methods of collecting, annotating, and searching corpora as well as processing corpus data. It also reports case studies that illustrate the wide range of linguistic research questions addressed in corpus linguistics. The over 60 articles included in the handbook are divided into five sections: (1) the origins and history of corpus linguistics and surveys of its relationship to central fields of linguistics (2) corpus compilation (3) corpus types (4) preprocessing of corpora (5) the use and exploitation of corpora. The final section gives an overview of the results of corpus studies obtained in phonetics, phonology, morphology, syntax, semantics, sociolinguistics, historical linguistics, stylometry, dialectology, and discourse analysis. It also reports on recent advances made in human and machine translation, contrastive studies, computer-assisted language learning, and automatic summarization. The contributors to the volume are internationally known experts in their respective fields. The handbook is intended for a wide audience ranging from teachers, university students, and scholars to anyone interested in the use of computers in linguistic analyses and applications.
The Open Handbook of Linguistic Data Management
Author: Andrea L. Berez-Kroeker
Publisher: MIT Press
ISBN: 0262362171
Category : Language Arts & Disciplines
Languages : en
Pages : 687
Book Description
A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data. "Doing language science" depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.
Publisher: MIT Press
ISBN: 0262362171
Category : Language Arts & Disciplines
Languages : en
Pages : 687
Book Description
A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data. "Doing language science" depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.
Handbook of Linguistic Annotation
Author: Nancy Ide
Publisher: Springer
ISBN: 9402408819
Category : Language Arts & Disciplines
Languages : en
Pages : 1440
Book Description
This handbook offers a thorough treatment of the science of linguistic annotation. Leaders in the field guide the reader through the process of modeling, creating an annotation language, building a corpus and evaluating it for correctness. Essential reading for both computer scientists and linguistic researchers.Linguistic annotation is an increasingly important activity in the field of computational linguistics because of its critical role in the development of language models for natural language processing applications. Part one of this book covers all phases of the linguistic annotation process, from annotation scheme design and choice of representation format through both the manual and automatic annotation process, evaluation, and iterative improvement of annotation accuracy. The second part of the book includes case studies of annotation projects across the spectrum of linguistic annotation types, including morpho-syntactic tagging, syntactic analyses, a range of semantic analyses (semantic roles, named entities, sentiment and opinion), time and event and spatial analyses, and discourse level analyses including discourse structure, co-reference, etc. Each case study addresses the various phases and processes discussed in the chapters of part one.
Publisher: Springer
ISBN: 9402408819
Category : Language Arts & Disciplines
Languages : en
Pages : 1440
Book Description
This handbook offers a thorough treatment of the science of linguistic annotation. Leaders in the field guide the reader through the process of modeling, creating an annotation language, building a corpus and evaluating it for correctness. Essential reading for both computer scientists and linguistic researchers.Linguistic annotation is an increasingly important activity in the field of computational linguistics because of its critical role in the development of language models for natural language processing applications. Part one of this book covers all phases of the linguistic annotation process, from annotation scheme design and choice of representation format through both the manual and automatic annotation process, evaluation, and iterative improvement of annotation accuracy. The second part of the book includes case studies of annotation projects across the spectrum of linguistic annotation types, including morpho-syntactic tagging, syntactic analyses, a range of semantic analyses (semantic roles, named entities, sentiment and opinion), time and event and spatial analyses, and discourse level analyses including discourse structure, co-reference, etc. Each case study addresses the various phases and processes discussed in the chapters of part one.
Language Processing and Knowledge in the Web
Author: Iryna Gurevych
Publisher: Springer
ISBN: 3642407226
Category : Computers
Languages : en
Pages : 227
Book Description
This book constitutes the refereed conference proceedings of the 25th International Conference on Language Processing and Knowledge in the Web, GSCL 2013, held in Darmstadt, Germany, in September 2013. The 20 revised full papers were carefully selected from numerous submissions and cover topics on language processing and knowledge in the Web on several important dimensions, such as computational linguistics, language technology, and processing of unstructured textual content in the Web.
Publisher: Springer
ISBN: 3642407226
Category : Computers
Languages : en
Pages : 227
Book Description
This book constitutes the refereed conference proceedings of the 25th International Conference on Language Processing and Knowledge in the Web, GSCL 2013, held in Darmstadt, Germany, in September 2013. The 20 revised full papers were carefully selected from numerous submissions and cover topics on language processing and knowledge in the Web on several important dimensions, such as computational linguistics, language technology, and processing of unstructured textual content in the Web.
Inductive Dependency Parsing
Author: Joakim Nivre
Publisher: Springer Science & Business Media
ISBN: 1402048890
Category : Computers
Languages : en
Pages : 224
Book Description
This book describes the framework of inductive dependency parsing, a methodology for robust and efficient syntactic analysis of unrestricted natural language text. Coverage includes a theoretical analysis of central models and algorithms, and an empirical evaluation of memory-based dependency parsing using data from Swedish and English. A one-stop reference to dependency-based parsing of natural language, it will interest researchers and system developers in language technology, and is suitable for graduate or advanced undergraduate courses.
Publisher: Springer Science & Business Media
ISBN: 1402048890
Category : Computers
Languages : en
Pages : 224
Book Description
This book describes the framework of inductive dependency parsing, a methodology for robust and efficient syntactic analysis of unrestricted natural language text. Coverage includes a theoretical analysis of central models and algorithms, and an empirical evaluation of memory-based dependency parsing using data from Swedish and English. A one-stop reference to dependency-based parsing of natural language, it will interest researchers and system developers in language technology, and is suitable for graduate or advanced undergraduate courses.