From Complex Sentences to a Formal Semantic Representation using Syntactic Text Simplification and Open Information Extraction

From Complex Sentences to a Formal Semantic Representation using Syntactic Text Simplification and Open Information Extraction PDF Author: Christina Niklaus
Publisher: Springer Nature
ISBN: 3658386975
Category : Language Arts & Disciplines
Languages : en
Pages : 340

Get Book Here

Book Description
This work presents a discourse-aware Text Simplification approach that splits and rephrases complex English sentences within the semantic context in which they occur. Based on a linguistically grounded transformation stage, complex sentences are transformed into shorter utterances with a simple canonical structure that can be easily analyzed by downstream applications. To avoid breaking down the input into a disjointed sequence of statements that is difficult to interpret, the author incorporates the semantic context between the split propositions in the form of hierarchical structures and semantic relationships, thus generating a novel representation of complex assertions that puts a semantic layer on top of the simplified sentences. In a second step, she leverages the semantic hierarchy of minimal propositions to improve the performance of Open IE frameworks. She shows that such systems benefit in two dimensions. First, the canonical structure of the simplified sentences facilitates the extraction of relational tuples, leading to an improved precision and recall of the extracted relations. Second, the semantic hierarchy can be leveraged to enrich the output of existing Open IE approaches with additional meta-information, resulting in a novel lightweight semantic representation for complex text data in the form of normalized and context-preserving relational tuples.

From Complex Sentences to a Formal Semantic Representation using Syntactic Text Simplification and Open Information Extraction

From Complex Sentences to a Formal Semantic Representation using Syntactic Text Simplification and Open Information Extraction PDF Author: Christina Niklaus
Publisher: Springer Nature
ISBN: 3658386975
Category : Language Arts & Disciplines
Languages : en
Pages : 340

Get Book Here

Book Description
This work presents a discourse-aware Text Simplification approach that splits and rephrases complex English sentences within the semantic context in which they occur. Based on a linguistically grounded transformation stage, complex sentences are transformed into shorter utterances with a simple canonical structure that can be easily analyzed by downstream applications. To avoid breaking down the input into a disjointed sequence of statements that is difficult to interpret, the author incorporates the semantic context between the split propositions in the form of hierarchical structures and semantic relationships, thus generating a novel representation of complex assertions that puts a semantic layer on top of the simplified sentences. In a second step, she leverages the semantic hierarchy of minimal propositions to improve the performance of Open IE frameworks. She shows that such systems benefit in two dimensions. First, the canonical structure of the simplified sentences facilitates the extraction of relational tuples, leading to an improved precision and recall of the extracted relations. Second, the semantic hierarchy can be leveraged to enrich the output of existing Open IE approaches with additional meta-information, resulting in a novel lightweight semantic representation for complex text data in the form of normalized and context-preserving relational tuples.

From Complex Sentences to a Formal Semantic Representation Using Syntactic Text Simplification and Open Information Extraction

From Complex Sentences to a Formal Semantic Representation Using Syntactic Text Simplification and Open Information Extraction PDF Author: Christina Niklaus
Publisher:
ISBN: 9783658386986
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
This work presents a discourse-aware Text Simplification approach that splits and rephrases complex English sentences within the semantic context in which they occur. Based on a linguistically grounded transformation stage, complex sentences are transformed into shorter utterances with a simple canonical structure that can be easily analyzed by downstream applications. To avoid breaking down the input into a disjointed sequence of statements that is difficult to interpret, the author incorporates the semantic context between the split propositions in the form of hierarchical structures and semantic relationships, thus generating a novel representation of complex assertions that puts a semantic layer on top of the simplified sentences. In a second step, she leverages the semantic hierarchy of minimal propositions to improve the performance of Open IE frameworks. She shows that such systems benefit in two dimensions. First, the canonical structure of the simplified sentences facilitates the extraction of relational tuples, leading to an improved precision and recall of the extracted relations. Second, the semantic hierarchy can be leveraged to enrich the output of existing Open IE approaches with additional meta-information, resulting in a novel lightweight semantic representation for complex text data in the form of normalized and context-preserving relational tuples. About the author Christina Niklaus is an Assistant Professor in Computer Science at the University of St.Gallen with a focus on Data Science and NLP. .

Automatic Text Simplification

Automatic Text Simplification PDF Author: Horacio Saggion
Publisher: Springer Nature
ISBN: 3031021665
Category : Computers
Languages : en
Pages : 121

Get Book Here

Book Description
Thanks to the availability of texts on the Web in recent years, increased knowledge and information have been made available to broader audiences. However, the way in which a text is written—its vocabulary, its syntax—can be difficult to read and understand for many people, especially those with poor literacy, cognitive or linguistic impairment, or those with limited knowledge of the language of the text. Texts containing uncommon words or long and complicated sentences can be difficult to read and understand by people as well as difficult to analyze by machines. Automatic text simplification is the process of transforming a text into another text which, ideally conveying the same message, will be easier to read and understand by a broader audience. The process usually involves the replacement of difficult or unknown phrases with simpler equivalents and the transformation of long and syntactically complex sentences into shorter and less complex ones. Automatic text simplification, a research topic which started 20 years ago, now has taken on a central role in natural language processing research not only because of the interesting challenges it posesses but also because of its social implications. This book presents past and current research in text simplification, exploring key issues including automatic readability assessment, lexical simplification, and syntactic simplification. It also provides a detailed account of machine learning techniques currently used in simplification, describes full systems designed for specific languages and target audiences, and offers available resources for research and development together with text simplification evaluation techniques.

An Introduction to Syntactic Analysis and Theory

An Introduction to Syntactic Analysis and Theory PDF Author: Dominique Sportiche
Publisher: John Wiley & Sons
ISBN: 1118470478
Category : Language Arts & Disciplines
Languages : en
Pages : 483

Get Book Here

Book Description
An Introduction to Syntactic Analysis and Theory offers beginning students a comprehensive overview of and introduction to our current understanding of the rules and principles that govern the syntax of natural languages. Includes numerous pedagogical features such as 'practice' boxes and sidebars, designed to facilitate understanding of both the 'hows' and the 'whys' of sentence structure Guides readers through syntactic and morphological structures in a progressive manner Takes the mystery out of one of the most crucial aspects of the workings of language – the principles and processes behind the structure of sentences Ideal for students with minimal knowledge of current syntactic research, it progresses in theoretical difficulty from basic ideas and theories to more complex and advanced, up to date concepts in syntactic theory

The Oxford Handbook of Computational Linguistics

The Oxford Handbook of Computational Linguistics PDF Author: Ruslan Mitkov
Publisher: Oxford University Press
ISBN: 019927634X
Category : Computers
Languages : en
Pages : 808

Get Book Here

Book Description
This handbook of computational linguistics, written for academics, graduate students and researchers, provides a state-of-the-art reference to one of the most active and productive fields in linguistics.

Natural Language Processing with Python

Natural Language Processing with Python PDF Author: Steven Bird
Publisher: "O'Reilly Media, Inc."
ISBN: 0596555717
Category : Computers
Languages : en
Pages : 506

Get Book Here

Book Description
This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic summarization and translation. With it, you'll learn how to write Python programs that work with large collections of unstructured text. You'll access richly annotated datasets using a comprehensive range of linguistic data structures, and you'll understand the main algorithms for analyzing the content and structure of written communication. Packed with examples and exercises, Natural Language Processing with Python will help you: Extract information from unstructured text, either to guess the topic or identify "named entities" Analyze linguistic structure in text, including parsing and semantic analysis Access popular linguistic databases, including WordNet and treebanks Integrate techniques drawn from fields as diverse as linguistics and artificial intelligence This book will help you gain practical skills in natural language processing using the Python programming language and the Natural Language Toolkit (NLTK) open source library. If you're interested in developing web applications, analyzing multilingual news sources, or documenting endangered languages -- or if you're simply curious to have a programmer's perspective on how human language works -- you'll find Natural Language Processing with Python both fascinating and immensely useful.

The Cambridge Handbook of Psycholinguistics

The Cambridge Handbook of Psycholinguistics PDF Author: Michael Spivey
Publisher: Cambridge University Press
ISBN: 1139536141
Category : Psychology
Languages : en
Pages : 1297

Get Book Here

Book Description
Our ability to speak, write, understand speech and read is critical to our ability to function in today's society. As such, psycholinguistics, or the study of how humans learn and use language, is a central topic in cognitive science. This comprehensive handbook is a collection of chapters written not by practitioners in the field, who can summarize the work going on around them, but by trailblazers from a wide array of subfields, who have been shaping the field of psycholinguistics over the last decade. Some topics discussed include how children learn language, how average adults understand and produce language, how language is represented in the brain, how brain-damaged individuals perform in terms of their language abilities and computer-based models of language and meaning. This is required reading for advanced researchers, graduate students and upper-level undergraduates who are interested in the recent developments and the future of psycholinguistics.

Representation Learning for Natural Language Processing

Representation Learning for Natural Language Processing PDF Author: Zhiyuan Liu
Publisher: Springer Nature
ISBN: 9811555737
Category : Computers
Languages : en
Pages : 319

Get Book Here

Book Description
This open access book provides an overview of the recent advances in representation learning theory, algorithms and applications for natural language processing (NLP). It is divided into three parts. Part I presents the representation learning techniques for multiple language entries, including words, phrases, sentences and documents. Part II then introduces the representation techniques for those objects that are closely related to NLP, including entity-based world knowledge, sememe-based linguistic knowledge, networks, and cross-modal entries. Lastly, Part III provides open resource tools for representation learning techniques, and discusses the remaining challenges and future research directions. The theories and algorithms of representation learning presented can also benefit other related domains such as machine learning, social network analysis, semantic Web, information retrieval, data mining and computational biology. This book is intended for advanced undergraduate and graduate students, post-doctoral fellows, researchers, lecturers, and industrial engineers, as well as anyone interested in representation learning and natural language processing.

Linguistic Databases

Linguistic Databases PDF Author: John A. Nerbonne
Publisher: Center for the Study of Language and Information Publications
ISBN: 9781575860930
Category : Language Arts & Disciplines
Languages : en
Pages : 255

Get Book Here

Book Description
Linguistic Databases explores the increasing use of databases in linguistics. The enormous potential in linguistic data - billions of utterances and messages daily - has been difficult to exploit. Many linguists have had to concentrate on introspective data with its inevitable blinders toward frequency, variation, and naturalness. Applications of linguistics have been handicapped. This volume explores the potential advantages of database applications to linguistics. Included in this volume are reports on database activities in phonetics, phonology, lexicography and syntax, comparative grammar, second-language acquisition, linguistic fieldwork, and language pathology. The book presents the specialized problems of multi-media (especially audio) and multi-lingual texts, including those in exotic writing systems. Implemented solutions are also discussed. The opportunities to use existing, minimally structured text repositories are presented.

The Cambridge Handbook of Generative Syntax

The Cambridge Handbook of Generative Syntax PDF Author: Marcel den Dikken
Publisher: Cambridge University Press
ISBN: 1107354587
Category : Language Arts & Disciplines
Languages : en
Pages : 1412

Get Book Here

Book Description
Syntax – the study of sentence structure – has been at the centre of generative linguistics from its inception and has developed rapidly and in various directions. The Cambridge Handbook of Generative Syntax provides a historical context for what is happening in the field of generative syntax today, a survey of the various generative approaches to syntactic structure available in the literature and an overview of the state of the art in the principal modules of the theory and the interfaces with semantics, phonology, information structure and sentence processing, as well as linguistic variation and language acquisition. This indispensable resource for advanced students, professional linguists (generative and non-generative alike) and scholars in related fields of inquiry presents a comprehensive survey of the field of generative syntactic research in all its variety, written by leading experts and providing a proper sense of the range of syntactic theories calling themselves generative.