Author: Tony Berber Sardinha
Publisher: A&C Black
ISBN: 1472570014
Category : Language Arts & Disciplines
Languages : en
Pages : 347
Book Description
Although Portuguese is one of the main world languages and researchers have been working on Portuguese electronic text collections for decades (e.g. Kelly, 1970; Biderman, 1978; Bacelar do Nascimento et al., 1984; see Berber Sardinha, 2005), this is the first volume in English that encapsulates the exciting and cutting-edge corpus linguistic work being done with Portuguese language corpora on different continents. The book includes chapters by leading corpus linguists dealing with Portuguese corpora across the world, and their contributions explore various methods and how they are applicable to a wide range of language issues. The book is divided into six sections, each covering a key issue in Corpus Linguistics: lexis and grammar, lexicography, language teaching and terminology, translation, corpus building and sharing, and parsing and annotation. Together these sections present the reader with a broad picture of the field.
Working with Portuguese Corpora
Author: Tony Berber Sardinha
Publisher: A&C Black
ISBN: 1472570014
Category : Language Arts & Disciplines
Languages : en
Pages : 347
Book Description
Although Portuguese is one of the main world languages and researchers have been working on Portuguese electronic text collections for decades (e.g. Kelly, 1970; Biderman, 1978; Bacelar do Nascimento et al., 1984; see Berber Sardinha, 2005), this is the first volume in English that encapsulates the exciting and cutting-edge corpus linguistic work being done with Portuguese language corpora on different continents. The book includes chapters by leading corpus linguists dealing with Portuguese corpora across the world, and their contributions explore various methods and how they are applicable to a wide range of language issues. The book is divided into six sections, each covering a key issue in Corpus Linguistics: lexis and grammar, lexicography, language teaching and terminology, translation, corpus building and sharing, and parsing and annotation. Together these sections present the reader with a broad picture of the field.
Publisher: A&C Black
ISBN: 1472570014
Category : Language Arts & Disciplines
Languages : en
Pages : 347
Book Description
Although Portuguese is one of the main world languages and researchers have been working on Portuguese electronic text collections for decades (e.g. Kelly, 1970; Biderman, 1978; Bacelar do Nascimento et al., 1984; see Berber Sardinha, 2005), this is the first volume in English that encapsulates the exciting and cutting-edge corpus linguistic work being done with Portuguese language corpora on different continents. The book includes chapters by leading corpus linguists dealing with Portuguese corpora across the world, and their contributions explore various methods and how they are applicable to a wide range of language issues. The book is divided into six sections, each covering a key issue in Corpus Linguistics: lexis and grammar, lexicography, language teaching and terminology, translation, corpus building and sharing, and parsing and annotation. Together these sections present the reader with a broad picture of the field.
Linguistic Corpora and Big Data in Spanish and Portuguese
Author: Miguel Calderón Campos
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110781468
Category : Language Arts & Disciplines
Languages : en
Pages : 238
Book Description
In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacrónica del español (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110781468
Category : Language Arts & Disciplines
Languages : en
Pages : 238
Book Description
In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacrónica del español (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.
Multi-Dimensional Analysis, 25 years on
Author: Tony Berber Sardinha
Publisher: John Benjamins Publishing Company
ISBN: 9027270155
Category : Language Arts & Disciplines
Languages : en
Pages : 368
Book Description
Approximately a quarter of a century ago, the Multi-Dimensional (MD) approach—one of the most powerful (and controversial) methods in Corpus Linguistics—saw its first book-length treatment. In its eleven chapters, this volume presents all new contributions covering a wide range of written and spoken registers, such as movies, music, magazine texts, student writing, social media, letters to the editor, and reports, in different languages (English, Spanish, Portuguese) and contexts (engineering, journalism, the classroom, the entertainment industry, the Internet, etc.). The book also includes a personal account of the development of the method by its creator, Doug Biber, an introduction to MD statistics, as well as an application of MD analysis to corpus design. The book should be essential reading to anyone with an interest in how texts, genres, and registers are used in society, what their lexis and grammar look like, and how they are interrelated.
Publisher: John Benjamins Publishing Company
ISBN: 9027270155
Category : Language Arts & Disciplines
Languages : en
Pages : 368
Book Description
Approximately a quarter of a century ago, the Multi-Dimensional (MD) approach—one of the most powerful (and controversial) methods in Corpus Linguistics—saw its first book-length treatment. In its eleven chapters, this volume presents all new contributions covering a wide range of written and spoken registers, such as movies, music, magazine texts, student writing, social media, letters to the editor, and reports, in different languages (English, Spanish, Portuguese) and contexts (engineering, journalism, the classroom, the entertainment industry, the Internet, etc.). The book also includes a personal account of the development of the method by its creator, Doug Biber, an introduction to MD statistics, as well as an application of MD analysis to corpus design. The book should be essential reading to anyone with an interest in how texts, genres, and registers are used in society, what their lexis and grammar look like, and how they are interrelated.
Computational Processing of the Portuguese Language
Author: João Silva
Publisher: Springer
ISBN: 3319415522
Category : Computers
Languages : en
Pages : 402
Book Description
This book constitutes the refereed proceedings of the 12th International Conference on Computational Processing of the Portuguese Language, PROPOR 2016, held in Tomar, Portugal, in July 2016. The 23 full papers and 14 short papers presented in this volume were carefully reviewed and selected from 52 submissions. The papers are organized in topical sections named: language applications, language processing, and language resources.
Publisher: Springer
ISBN: 3319415522
Category : Computers
Languages : en
Pages : 402
Book Description
This book constitutes the refereed proceedings of the 12th International Conference on Computational Processing of the Portuguese Language, PROPOR 2016, held in Tomar, Portugal, in July 2016. The 23 full papers and 14 short papers presented in this volume were carefully reviewed and selected from 52 submissions. The papers are organized in topical sections named: language applications, language processing, and language resources.
Manual of Brazilian Portuguese Linguistics
Author: Johannes Kabatek
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110405954
Category : Language Arts & Disciplines
Languages : en
Pages : 640
Book Description
This manual is the first comprehensive account of Brazilian Portuguese linguistics written in English, offering not only linguists but also historians and social scientists new insights gained from the intensive research carried out over the last decades on the linguistic reality of this vast territory. In the 20 overview chapters, internationally renowned experts give detailed yet concise information on a wide range of language-internal as well as external synchronic and diachronic topics. Most of this information is the fruit of large-scale language documentation and description projects, such as the project on the linguistic norm of educated speakers (NURC), the project “Grammar of spoken Portuguese”, and the project “Towards a History of Brazilian Portuguese” (PHPB), among others. Further chapters of high contemporary interest and relevance include the study of linguistic policies and psycholinguistics. The manual offers theoretical insights of general interest, not least since many chapters present the linguistic data in the light of a combination of formal, functional, generative and sociolinguistic approaches. This rather unique feature of the volume is achieved by the double authorship of some of the relevant chapters, thus bringing together and synthesizing different perspectives.
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3110405954
Category : Language Arts & Disciplines
Languages : en
Pages : 640
Book Description
This manual is the first comprehensive account of Brazilian Portuguese linguistics written in English, offering not only linguists but also historians and social scientists new insights gained from the intensive research carried out over the last decades on the linguistic reality of this vast territory. In the 20 overview chapters, internationally renowned experts give detailed yet concise information on a wide range of language-internal as well as external synchronic and diachronic topics. Most of this information is the fruit of large-scale language documentation and description projects, such as the project on the linguistic norm of educated speakers (NURC), the project “Grammar of spoken Portuguese”, and the project “Towards a History of Brazilian Portuguese” (PHPB), among others. Further chapters of high contemporary interest and relevance include the study of linguistic policies and psycholinguistics. The manual offers theoretical insights of general interest, not least since many chapters present the linguistic data in the light of a combination of formal, functional, generative and sociolinguistic approaches. This rather unique feature of the volume is achieved by the double authorship of some of the relevant chapters, thus bringing together and synthesizing different perspectives.
Text, Speech, and Dialogue
Author: Petr Sojka
Publisher: Springer
ISBN: 3319455109
Category : Computers
Languages : en
Pages : 565
Book Description
This book constitutes the refereed proceedings of the 19th International Conference on Text, Speech, and Dialogue, TSD 2016, held in Brno, CzechRepublic, in September 2016. The 62 papers presented together with 3 abstracts of invited talks were carefully reviewed and selected from 127 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.
Publisher: Springer
ISBN: 3319455109
Category : Computers
Languages : en
Pages : 565
Book Description
This book constitutes the refereed proceedings of the 19th International Conference on Text, Speech, and Dialogue, TSD 2016, held in Brno, CzechRepublic, in September 2016. The 62 papers presented together with 3 abstracts of invited talks were carefully reviewed and selected from 127 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.
Lexical Priming
Author: Michael Pace-Sigge
Publisher: John Benjamins Publishing Company
ISBN: 9027265410
Category : Language Arts & Disciplines
Languages : en
Pages : 335
Book Description
Published in 2005, Michael Hoey’s Lexical Priming – A new theory of words and language introduced a completely new theory of language based on how words are used in the real world. In the ten years that have passed, the theory has since gained traction in the field of corpus-linguistics. This volume brings together some of the most important contributions to the theory, in areas such as language teaching and learning, discourse analysis, stylistics as well as the design of language learning software. Crucially, this book introduces aspects of the language that have so far been given less focus in lexical priming, such as spoken language, figurative language, forced primings, priming as predictor of genre, and historical primings. The volume also focuses on applying the lexical priming theory to languages other than English including Mandarin Chinese and Finnish.
Publisher: John Benjamins Publishing Company
ISBN: 9027265410
Category : Language Arts & Disciplines
Languages : en
Pages : 335
Book Description
Published in 2005, Michael Hoey’s Lexical Priming – A new theory of words and language introduced a completely new theory of language based on how words are used in the real world. In the ten years that have passed, the theory has since gained traction in the field of corpus-linguistics. This volume brings together some of the most important contributions to the theory, in areas such as language teaching and learning, discourse analysis, stylistics as well as the design of language learning software. Crucially, this book introduces aspects of the language that have so far been given less focus in lexical priming, such as spoken language, figurative language, forced primings, priming as predictor of genre, and historical primings. The volume also focuses on applying the lexical priming theory to languages other than English including Mandarin Chinese and Finnish.
New Language Technologies and Linguistic Research
Author: Sandra Maria Aluisio
Publisher: Cambridge Scholars Publishing
ISBN: 1443858633
Category : Computers
Languages : en
Pages : 238
Book Description
This book is a collection of the papers presented and discussed at the 11th Corpus Linguistics Symposium (ELC 2012), held at the Instituto de Ciências Matemáticas e de Computação (Institute of Mathematics and Computer Science) of the University of São Paulo, at São Carlos, Brazil. The sessions addressed the following six topics: Corpus Linguistics and Language Description; Translation, Terminology and Corpora; Spoken Language and Corpora; Natural Language Processing and Corpora; Corpus Annotation; and Corpora and Multiple Documents. These unique studies will inspire readers with an interest in Linguistics, and will provide motivation for conducting further research in the interdisciplinary area of Language Technologies and Linguistic Research.
Publisher: Cambridge Scholars Publishing
ISBN: 1443858633
Category : Computers
Languages : en
Pages : 238
Book Description
This book is a collection of the papers presented and discussed at the 11th Corpus Linguistics Symposium (ELC 2012), held at the Instituto de Ciências Matemáticas e de Computação (Institute of Mathematics and Computer Science) of the University of São Paulo, at São Carlos, Brazil. The sessions addressed the following six topics: Corpus Linguistics and Language Description; Translation, Terminology and Corpora; Spoken Language and Corpora; Natural Language Processing and Corpora; Corpus Annotation; and Corpora and Multiple Documents. These unique studies will inspire readers with an interest in Linguistics, and will provide motivation for conducting further research in the interdisciplinary area of Language Technologies and Linguistic Research.
Computational Processing of the Portuguese Language
Author: Vládia Pinheiro
Publisher: Springer Nature
ISBN: 3030983056
Category : Computers
Languages : en
Pages : 447
Book Description
This book constitutes the proceedings of the 15th International Conference on Computational Processing of the Portuguese Language, PROPOR 2021, held in Fortaleza, Brazil, in March 2021. The 36 full papers presented together with 4 short papers were carefully reviewed and selected from 88 submissions. They are grouped in topical sections on speech processing; resources and evaluation; natural language processing applications; semantics; natural language processing tasks; and multilinguality.
Publisher: Springer Nature
ISBN: 3030983056
Category : Computers
Languages : en
Pages : 447
Book Description
This book constitutes the proceedings of the 15th International Conference on Computational Processing of the Portuguese Language, PROPOR 2021, held in Fortaleza, Brazil, in March 2021. The 36 full papers presented together with 4 short papers were carefully reviewed and selected from 88 submissions. They are grouped in topical sections on speech processing; resources and evaluation; natural language processing applications; semantics; natural language processing tasks; and multilinguality.
Formalising Natural Languages: Applications to Natural Language Processing and Digital Humanities
Author: Božo Bekavac
Publisher: Springer Nature
ISBN: 303070629X
Category : Computers
Languages : en
Pages : 253
Book Description
This book constitutes selected revised papers of the 14th International Conference, NooJ 2020, held Zagreb, Croatia, in June 2020. Due to the COVID-19 pandemic the conference was held online. NooJ is a linguistic development environment that allows linguists to formalize several levels of linguistic phenomena. NooJ provides linguists with tools to develop dictionaries, regular grammars, context-free grammars, context-sensitive grammars and unrestricted grammars as well as their graphical equivalent to formalize each linguistic phenomenon. The 20 full papers presented were carefully reviewed and selected from 68 submissions. The papers are organized in the following topics: Linguistic Formalization; Digital Humanities and Teaching with NooJ; Natural Language Processing Applications.
Publisher: Springer Nature
ISBN: 303070629X
Category : Computers
Languages : en
Pages : 253
Book Description
This book constitutes selected revised papers of the 14th International Conference, NooJ 2020, held Zagreb, Croatia, in June 2020. Due to the COVID-19 pandemic the conference was held online. NooJ is a linguistic development environment that allows linguists to formalize several levels of linguistic phenomena. NooJ provides linguists with tools to develop dictionaries, regular grammars, context-free grammars, context-sensitive grammars and unrestricted grammars as well as their graphical equivalent to formalize each linguistic phenomenon. The 20 full papers presented were carefully reviewed and selected from 68 submissions. The papers are organized in the following topics: Linguistic Formalization; Digital Humanities and Teaching with NooJ; Natural Language Processing Applications.