Author: van Halteren
Publisher: BRILL
ISBN: 9004653503
Category : Computers
Languages : en
Pages : 217
Book Description
Linguistic Exploitation of Syntactic Databases
Author: van Halteren
Publisher: BRILL
ISBN: 9004653503
Category : Computers
Languages : en
Pages : 217
Book Description
Publisher: BRILL
ISBN: 9004653503
Category : Computers
Languages : en
Pages : 217
Book Description
Excursions Into Syntactic Databases
Author: Hans van Halteren
Publisher: Rodopi
ISBN: 9789042003736
Category : Computers
Languages : en
Pages : 268
Book Description
This book is about syntactic databases (a.k.a. treebanks), collections of text material in which the syntactic relations have been made visible. It starts off with a general intro-duction to the subject and then continues with three in-depth investigations of more specialized aspects. In the introduction, syntactic databases are first placed in the larger context of linguistic databases, text collections with a broader linguistic annotation than just a syntactic one. Then some examples of syntactic databases are given, illustrating the range of annotation actually encountered. The introduction is completed with an investigation of database management systems for syntactic databases. The first in-depth investigation concerns the treatment of ambiguous structures in syntactic analysis trees, focussing on a very efficient representation for such structures and the means to create this representation. Next, classroom use of syntactic databases is examined. A computer program for this purpose, CLUES, is discussed, along with a suggested series of syntax exercises. The final subject is the importance of including function and attribute information in the annotation of texts. The central line of investigation here is a probabilistic parsing experiment in which the use of function and attribute information is the main variable.
Publisher: Rodopi
ISBN: 9789042003736
Category : Computers
Languages : en
Pages : 268
Book Description
This book is about syntactic databases (a.k.a. treebanks), collections of text material in which the syntactic relations have been made visible. It starts off with a general intro-duction to the subject and then continues with three in-depth investigations of more specialized aspects. In the introduction, syntactic databases are first placed in the larger context of linguistic databases, text collections with a broader linguistic annotation than just a syntactic one. Then some examples of syntactic databases are given, illustrating the range of annotation actually encountered. The introduction is completed with an investigation of database management systems for syntactic databases. The first in-depth investigation concerns the treatment of ambiguous structures in syntactic analysis trees, focussing on a very efficient representation for such structures and the means to create this representation. Next, classroom use of syntactic databases is examined. A computer program for this purpose, CLUES, is discussed, along with a suggested series of syntax exercises. The final subject is the importance of including function and attribute information in the annotation of texts. The central line of investigation here is a probabilistic parsing experiment in which the use of function and attribute information is the main variable.
Corpus Linguistics and Linguistic Theory
Author:
Publisher: BRILL
ISBN: 9004490752
Category : Language Arts & Disciplines
Languages : en
Pages : 403
Book Description
From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age. Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).
Publisher: BRILL
ISBN: 9004490752
Category : Language Arts & Disciplines
Languages : en
Pages : 403
Book Description
From being the occupation of a marginal (and frequently marginalised) group of researchers, the linguistic analysis of machine-readable language corpora has moved to the mainstream of research on the English language. In this process an impressive body of results has accumulated which, over and above the intrinsic descriptive interest it holds for students of the English language, forces a major and systematic re-thinking of foundational issues in linguistic theory. Corpus linguistics and linguistic theory was accordingly chosen as the motto for the twentieth annual gathering of ICAME, the International Computer Archive of Modern/ Medieval English, which was hosted by the University of Freiburg (Germany) in 1999. The present volume, which presents selected papers from this conference, thus builds on previous successful work in the computer-aided description of English and at the same time represents an attempt at stock-taking and methodological reflection in a linguistic subdiscipline that has clearly come of age. Contributions cover all levels of linguistic description - from phonology/ prosody, through grammar and semantics to discourse-analytical issues such as genre or gender-specific linguistic usage. They are united by a desire to further the dialogue between the corpus-linguistic community and researchers working in other traditions. Thereby, the atmosphere ranges from undisguised skepticism (as expressed by Noam Chomsky in an interview which is part of the opening contribution by Bas Aarts) to empirically substantiated optimism (as, for example, in Bernadette Vine's significantly titled contribution Getting things done).
Text Databases: One Database Model and Several Retrive Al Languages Paper
Author: Doedens
Publisher: BRILL
ISBN: 9004653570
Category : Computers
Languages : en
Pages : 316
Book Description
Manipulation of text by means of the computer is well-established. Everybody has a word processor on his or her desk, and electronic mail, desk top publishing, text interchange languages, hypertext and multimedia are technologies many will be aware of. However, the full potential of the computer for the management and use of textual information has not been tapped yet. Far from it. For this a more principled approach is necessary, which will create a framework on which existing technologies, and technologies-yet-to-come can build and in which they can be integrated. This book can be seen as one step on this road. It employs the experience gained in working with a rich electronic linguistic corpus, the ECA database. A basic text database model is put forward and several text database retrieval languages are defined and analysed. A clear direction for further research is given. Therefore, the book is of relevance to researchers and developers in the field of corpus linguistics and in the more general field of electronic text.
Publisher: BRILL
ISBN: 9004653570
Category : Computers
Languages : en
Pages : 316
Book Description
Manipulation of text by means of the computer is well-established. Everybody has a word processor on his or her desk, and electronic mail, desk top publishing, text interchange languages, hypertext and multimedia are technologies many will be aware of. However, the full potential of the computer for the management and use of textual information has not been tapped yet. Far from it. For this a more principled approach is necessary, which will create a framework on which existing technologies, and technologies-yet-to-come can build and in which they can be integrated. This book can be seen as one step on this road. It employs the experience gained in working with a rich electronic linguistic corpus, the ECA database. A basic text database model is put forward and several text database retrieval languages are defined and analysed. A clear direction for further research is given. Therefore, the book is of relevance to researchers and developers in the field of corpus linguistics and in the more general field of electronic text.
Corpus-Based Research Into Language
Author: Oostdijk
Publisher: BRILL
ISBN: 9004653562
Category : Computers
Languages : en
Pages : 287
Book Description
For over two decades Jan Aarts has been actively involved in corpus linguistic research. He was the instigator of a large number of projects, and he was responsible for what has become known as the Nijmegen approach to corpus linguistics. It is thanks to him that words like TOSCA and LDB have become household names in the corpus linguistic community. The present volume has been collected in his honour. The contributions in it cover a wide range of topics in the field of corpus linguistic research, especially those in which Jan Aarts takes a keen interest: corpus encoding and tagging, parsing and databases, and the linguistic exploration of corpus data. The contributions in this volume discuss work done in this field outside Nijmegen, for the obvious reason that we do not wish to present him with a report on work in which he is himself involved.
Publisher: BRILL
ISBN: 9004653562
Category : Computers
Languages : en
Pages : 287
Book Description
For over two decades Jan Aarts has been actively involved in corpus linguistic research. He was the instigator of a large number of projects, and he was responsible for what has become known as the Nijmegen approach to corpus linguistics. It is thanks to him that words like TOSCA and LDB have become household names in the corpus linguistic community. The present volume has been collected in his honour. The contributions in it cover a wide range of topics in the field of corpus linguistic research, especially those in which Jan Aarts takes a keen interest: corpus encoding and tagging, parsing and databases, and the linguistic exploration of corpus data. The contributions in this volume discuss work done in this field outside Nijmegen, for the obvious reason that we do not wish to present him with a report on work in which he is himself involved.
Corpus Linguistics, Hard and Soft
Author: Merja Kytö
Publisher: Rodopi
ISBN: 9789051830248
Category : Computers
Languages : en
Pages : 308
Book Description
Publisher: Rodopi
ISBN: 9789051830248
Category : Computers
Languages : en
Pages : 308
Book Description
An Introduction to Corpus Linguistics
Author: Graeme Kennedy
Publisher: Routledge
ISBN: 1317892577
Category : Language Arts & Disciplines
Languages : en
Pages : 334
Book Description
The use of large, computerized bodies of text for linguistic analysis and description has emerged in recent years as one of the most significant and rapidly-developing fields of activity in the study of language. This book provides a comprehensive introduction and guide to Corpus Linguistics. All aspects of the field are explored, from the various types of electronic corpora that are available to instructions on how to design and compile a corpus. Graeme Kennedy surveys the development of corpora for use in linguistic research, looking back to the pre-electronic age as well as to the massive growth of computer corpora in the electronic age.
Publisher: Routledge
ISBN: 1317892577
Category : Language Arts & Disciplines
Languages : en
Pages : 334
Book Description
The use of large, computerized bodies of text for linguistic analysis and description has emerged in recent years as one of the most significant and rapidly-developing fields of activity in the study of language. This book provides a comprehensive introduction and guide to Corpus Linguistics. All aspects of the field are explored, from the various types of electronic corpora that are available to instructions on how to design and compile a corpus. Graeme Kennedy surveys the development of corpora for use in linguistic research, looking back to the pre-electronic age as well as to the massive growth of computer corpora in the electronic age.
English Corpus Linguistics
Author: Karin Aijmer
Publisher: Routledge
ISBN: 1317899237
Category : Language Arts & Disciplines
Languages : en
Pages : 305
Book Description
This collection of articles form a tribute to Jan Svartvik and his pioneering work in the field. Covers corpus studies, problematic grammar, institution-based and observation-based grammars and the design and development of spoken and written text corpora in different varieties of English.
Publisher: Routledge
ISBN: 1317899237
Category : Language Arts & Disciplines
Languages : en
Pages : 305
Book Description
This collection of articles form a tribute to Jan Svartvik and his pioneering work in the field. Covers corpus studies, problematic grammar, institution-based and observation-based grammars and the design and development of spoken and written text corpora in different varieties of English.
Finite-state Methods and Natural Language Processing
Author: Jakub Piskorski
Publisher: IOS Press
ISBN: 158603975X
Category : Computers
Languages : en
Pages : 248
Book Description
Contains papers that cover a range of Natural Language Processing (NLP) applications, including machine learning and translation, logic, computational phonology, morphology and semantics, data mining, information extraction and disambiguation, as well as programming, optimization and compression of finite-state networks.
Publisher: IOS Press
ISBN: 158603975X
Category : Computers
Languages : en
Pages : 248
Book Description
Contains papers that cover a range of Natural Language Processing (NLP) applications, including machine learning and translation, logic, computational phonology, morphology and semantics, data mining, information extraction and disambiguation, as well as programming, optimization and compression of finite-state networks.
Linguistics and Language Behavior Abstracts
Author:
Publisher:
ISBN:
Category : Language and languages
Languages : en
Pages : 558
Book Description
Publisher:
ISBN:
Category : Language and languages
Languages : en
Pages : 558
Book Description