Author: M. Färber
Publisher: IOS Press
ISBN: 1614997756
Category : Computers
Languages : en
Pages : 214
Book Description
In this book, new approaches are presented for detecting and extracting simultaneously relevant and novel information from unstructured text documents. A major contribution of these approaches is that the information already provided and the extracted information are modeled semantically. This leads to the following benefits: (a) ambiguities in the language can be resolved; (b) the exact information needs regarding relevance and novelty can be specified; and (c) knowledge graphs can be incorporated. More specifically, this book presents the following scientific contributions: 1. An assessment of the suitability of existing large knowledge graphs (namely, DBpedia, Freebase, OpenCyc, Wikidata, and YAGO) for the task of detecting novel information in text documents. 2. A description of an approach by which emerging entities that are missing in a knowledge graph are detected in a stream of text documents. 3. A suggestion for an approach to extracting novel, relevant, semantically-structured statements from text documents. The developed approaches are suitable for the recommendation of emerging entities and novel statements respectively, for the purpose of knowledge graph population, and for providing assistance to users requiring novel information, such as journalists and technology scouts.
Semantic Search for Novel Information
Author: M. Färber
Publisher: IOS Press
ISBN: 1614997756
Category : Computers
Languages : en
Pages : 214
Book Description
In this book, new approaches are presented for detecting and extracting simultaneously relevant and novel information from unstructured text documents. A major contribution of these approaches is that the information already provided and the extracted information are modeled semantically. This leads to the following benefits: (a) ambiguities in the language can be resolved; (b) the exact information needs regarding relevance and novelty can be specified; and (c) knowledge graphs can be incorporated. More specifically, this book presents the following scientific contributions: 1. An assessment of the suitability of existing large knowledge graphs (namely, DBpedia, Freebase, OpenCyc, Wikidata, and YAGO) for the task of detecting novel information in text documents. 2. A description of an approach by which emerging entities that are missing in a knowledge graph are detected in a stream of text documents. 3. A suggestion for an approach to extracting novel, relevant, semantically-structured statements from text documents. The developed approaches are suitable for the recommendation of emerging entities and novel statements respectively, for the purpose of knowledge graph population, and for providing assistance to users requiring novel information, such as journalists and technology scouts.
Publisher: IOS Press
ISBN: 1614997756
Category : Computers
Languages : en
Pages : 214
Book Description
In this book, new approaches are presented for detecting and extracting simultaneously relevant and novel information from unstructured text documents. A major contribution of these approaches is that the information already provided and the extracted information are modeled semantically. This leads to the following benefits: (a) ambiguities in the language can be resolved; (b) the exact information needs regarding relevance and novelty can be specified; and (c) knowledge graphs can be incorporated. More specifically, this book presents the following scientific contributions: 1. An assessment of the suitability of existing large knowledge graphs (namely, DBpedia, Freebase, OpenCyc, Wikidata, and YAGO) for the task of detecting novel information in text documents. 2. A description of an approach by which emerging entities that are missing in a knowledge graph are detected in a stream of text documents. 3. A suggestion for an approach to extracting novel, relevant, semantically-structured statements from text documents. The developed approaches are suitable for the recommendation of emerging entities and novel statements respectively, for the purpose of knowledge graph population, and for providing assistance to users requiring novel information, such as journalists and technology scouts.
Google Semantic Search
Author: Dave Amerland
Publisher: Pearson Education
ISBN: 0789751348
Category : Business & Economics
Languages : en
Pages : 240
Book Description
Optimize Your Sites for Today's Radically New Semantic Search Breakthrough "semantic search" techniques are already transforming Google(tm)'s search results. If you want to be found, yesterday's SEO techniques won't cut it anymore. Google Semantic Search tells you what to do instead--in plain English. David Amerland demystifies Knowledge Graph(tm), TrustRank(tm), AuthorityRank(tm), personalized and mobile search, social media activity, and much more. Drawing on deep knowledge of Google's internal workings and newest patents, he also reveals the growing impact of social networks on your SEO performance. Whether you do it yourself or supervise an agency, this is your complete playbook for next-generation SEO! * Learn how Google is delivering answers, not just links--and what it means to you * Profit from Google Now(tm) and the fragmented, personalized future of search * Prepare for Knowledge Graph(tm) by growing your online reputation, authority, and trust * Stop using 10 common SEO techniques that no longer work * Discover the truth about Trust Ranking(tm)--and 10 steps to take right now * Go way beyond keywords in today's new era of content marketing * Strengthen the "social signal" you create on Twitter, Facebook, Google+, and LinkedIn * See why the "First Page of Google" is rapidly become obsolete * Drive unprecedented business value from your online identity and influence * Learn how Google captures meaning in unstructured data--and give it what it wants * Plan for all "4 Vs" of semantic search: Volume, Velocity, Variety, and Veracity * Rapidly transition from technical to strategic search optimization http://helpmyseo.com/google-semantic-search.html
Publisher: Pearson Education
ISBN: 0789751348
Category : Business & Economics
Languages : en
Pages : 240
Book Description
Optimize Your Sites for Today's Radically New Semantic Search Breakthrough "semantic search" techniques are already transforming Google(tm)'s search results. If you want to be found, yesterday's SEO techniques won't cut it anymore. Google Semantic Search tells you what to do instead--in plain English. David Amerland demystifies Knowledge Graph(tm), TrustRank(tm), AuthorityRank(tm), personalized and mobile search, social media activity, and much more. Drawing on deep knowledge of Google's internal workings and newest patents, he also reveals the growing impact of social networks on your SEO performance. Whether you do it yourself or supervise an agency, this is your complete playbook for next-generation SEO! * Learn how Google is delivering answers, not just links--and what it means to you * Profit from Google Now(tm) and the fragmented, personalized future of search * Prepare for Knowledge Graph(tm) by growing your online reputation, authority, and trust * Stop using 10 common SEO techniques that no longer work * Discover the truth about Trust Ranking(tm)--and 10 steps to take right now * Go way beyond keywords in today's new era of content marketing * Strengthen the "social signal" you create on Twitter, Facebook, Google+, and LinkedIn * See why the "First Page of Google" is rapidly become obsolete * Drive unprecedented business value from your online identity and influence * Learn how Google captures meaning in unstructured data--and give it what it wants * Plan for all "4 Vs" of semantic search: Volume, Velocity, Variety, and Veracity * Rapidly transition from technical to strategic search optimization http://helpmyseo.com/google-semantic-search.html
Information Sharing on the Semantic Web
Author: Heiner Stuckenschmidt
Publisher: Springer Science & Business Media
ISBN: 9783540205944
Category : Business & Economics
Languages : en
Pages : 300
Book Description
The large-scale and almost ubiquitous availability of information has become as much of a curse as it is a blessing. The more information is available, the harder it is to locate any particular piece of it. And even when it has been successfully found, it is even harder still to usefully combine it with other information we may already possess. This problem occurs at many different levels, ranging from the overcrowded disks of our own PCs to the mass of unstructured information on the World Wide Web.It is commonly understood that this problem of information sharing can only be solved by giving computers better access to the semantics of the information. While it has been recognized that ontologies play a crucial role in solving the open problems, most approaches rely on the existence of well-established data structures. To overcome these shortcomings, Stuckenschmidt and van Harmelen describe ontology-based approaches for resolving semantic heterogeneity in weakly structured environments, in particular the World Wide Web. Addressing problems like missing conceptual models, unclear system boundaries, and heterogeneous representations, they design a framework for ontology-based information sharing in weakly structured environments like the Semantic Web.For researchers and students in areas related to the Semantic Web, the authors provide not only a comprehensive overview of the State of the art, but also present in detail recent research in areas like ontology design for information integration, metadata generation and management, and representation and management of distributed ontologies. For professionals in areas such as e-commerce (e.g., the exchange of product knowledge) and knowledge management (e.g., in large and distributed organizations), the book provides decision support on the use of novel technologies, information about potential problems, and guidelines for the successful application of existing technologies.
Publisher: Springer Science & Business Media
ISBN: 9783540205944
Category : Business & Economics
Languages : en
Pages : 300
Book Description
The large-scale and almost ubiquitous availability of information has become as much of a curse as it is a blessing. The more information is available, the harder it is to locate any particular piece of it. And even when it has been successfully found, it is even harder still to usefully combine it with other information we may already possess. This problem occurs at many different levels, ranging from the overcrowded disks of our own PCs to the mass of unstructured information on the World Wide Web.It is commonly understood that this problem of information sharing can only be solved by giving computers better access to the semantics of the information. While it has been recognized that ontologies play a crucial role in solving the open problems, most approaches rely on the existence of well-established data structures. To overcome these shortcomings, Stuckenschmidt and van Harmelen describe ontology-based approaches for resolving semantic heterogeneity in weakly structured environments, in particular the World Wide Web. Addressing problems like missing conceptual models, unclear system boundaries, and heterogeneous representations, they design a framework for ontology-based information sharing in weakly structured environments like the Semantic Web.For researchers and students in areas related to the Semantic Web, the authors provide not only a comprehensive overview of the State of the art, but also present in detail recent research in areas like ontology design for information integration, metadata generation and management, and representation and management of distributed ontologies. For professionals in areas such as e-commerce (e.g., the exchange of product knowledge) and knowledge management (e.g., in large and distributed organizations), the book provides decision support on the use of novel technologies, information about potential problems, and guidelines for the successful application of existing technologies.
Entity-Oriented Search
Author: Krisztian Balog
Publisher: Springer
ISBN: 3319939351
Category : Computers
Languages : en
Pages : 358
Book Description
This open access book covers all facets of entity-oriented search—where “search” can be interpreted in the broadest sense of information access—from a unified point of view, and provides a coherent and comprehensive overview of the state of the art. It represents the first synthesis of research in this broad and rapidly developing area. Selected topics are discussed in-depth, the goal being to establish fundamental techniques and methods as a basis for future research and development. Additional topics are treated at a survey level only, containing numerous pointers to the relevant literature. A roadmap for future research, based on open issues and challenges identified along the way, rounds out the book. The book is divided into three main parts, sandwiched between introductory and concluding chapters. The first two chapters introduce readers to the basic concepts, provide an overview of entity-oriented search tasks, and present the various types and sources of data that will be used throughout the book. Part I deals with the core task of entity ranking: given a textual query, possibly enriched with additional elements or structural hints, return a ranked list of entities. This core task is examined in a number of different variants, using both structured and unstructured data collections, and numerous query formulations. In turn, Part II is devoted to the role of entities in bridging unstructured and structured data. Part III explores how entities can enable search engines to understand the concepts, meaning, and intent behind the query that the user enters into the search box, and how they can provide rich and focused responses (as opposed to merely a list of documents)—a process known as semantic search. The final chapter concludes the book by discussing the limitations of current approaches, and suggesting directions for future research. Researchers and graduate students are the primary target audience of this book. A general background in information retrieval is sufficient to follow the material, including an understanding of basic probability and statistics concepts as well as a basic knowledge of machine learning concepts and supervised learning algorithms.
Publisher: Springer
ISBN: 3319939351
Category : Computers
Languages : en
Pages : 358
Book Description
This open access book covers all facets of entity-oriented search—where “search” can be interpreted in the broadest sense of information access—from a unified point of view, and provides a coherent and comprehensive overview of the state of the art. It represents the first synthesis of research in this broad and rapidly developing area. Selected topics are discussed in-depth, the goal being to establish fundamental techniques and methods as a basis for future research and development. Additional topics are treated at a survey level only, containing numerous pointers to the relevant literature. A roadmap for future research, based on open issues and challenges identified along the way, rounds out the book. The book is divided into three main parts, sandwiched between introductory and concluding chapters. The first two chapters introduce readers to the basic concepts, provide an overview of entity-oriented search tasks, and present the various types and sources of data that will be used throughout the book. Part I deals with the core task of entity ranking: given a textual query, possibly enriched with additional elements or structural hints, return a ranked list of entities. This core task is examined in a number of different variants, using both structured and unstructured data collections, and numerous query formulations. In turn, Part II is devoted to the role of entities in bridging unstructured and structured data. Part III explores how entities can enable search engines to understand the concepts, meaning, and intent behind the query that the user enters into the search box, and how they can provide rich and focused responses (as opposed to merely a list of documents)—a process known as semantic search. The final chapter concludes the book by discussing the limitations of current approaches, and suggesting directions for future research. Researchers and graduate students are the primary target audience of this book. A general background in information retrieval is sufficient to follow the material, including an understanding of basic probability and statistics concepts as well as a basic knowledge of machine learning concepts and supervised learning algorithms.
Semantic Search on Text and Knowledge Bases
Author: Hannah Bast
Publisher:
ISBN: 9781680831641
Category : Computers
Languages : en
Pages : 170
Book Description
Provides a comprehensive overview of the broad area of semantic search on text and knowledge bases. It is as self-contained as possible, and serves as a good tutorial for newcomers to this fascinating and highly topical field.
Publisher:
ISBN: 9781680831641
Category : Computers
Languages : en
Pages : 170
Book Description
Provides a comprehensive overview of the broad area of semantic search on text and knowledge bases. It is as self-contained as possible, and serves as a good tutorial for newcomers to this fascinating and highly topical field.
Neural Generation of Textual Summaries from Knowledge Base Triples
Author: P. Vougiouklis
Publisher: IOS Press
ISBN: 1643680676
Category : Computers
Languages : en
Pages : 174
Book Description
Most people need textual or visual interfaces to help them make sense of Semantic Web data. In this book, the author investigates the problems associated with generating natural language summaries for structured data encoded as triples using deep neural networks. An end-to-end trainable architecture is proposed, which encodes the information from a set of knowledge graph triples into a vector of fixed dimensionality, and generates a textual summary by conditioning the output on this encoded vector. Different methodologies for building the required data-to-text corpora are explored to train and evaluate the performance of the approach. Attention is first focused on generating biographies, and the author demonstrates that the technique is capable of scaling to domains with larger and more challenging vocabularies. The applicability of the technique for the generation of open-domain Wikipedia summaries in Arabic and Esperanto – two under-resourced languages – is then discussed, and a set of community studies, devised to measure the usability of the automatically generated content by Wikipedia readers and editors, is described. Finally, the book explains an extension of the original model with a pointer mechanism that enables it to learn to verbalise in a different number of ways the content from the triples while retaining the capacity to generate words from a fixed target vocabulary. The evaluation of performance using a dataset encompassing all of English Wikipedia is described, with results from both automatic and human evaluation both of which highlight the superiority of the latter approach as compared to the original architecture.
Publisher: IOS Press
ISBN: 1643680676
Category : Computers
Languages : en
Pages : 174
Book Description
Most people need textual or visual interfaces to help them make sense of Semantic Web data. In this book, the author investigates the problems associated with generating natural language summaries for structured data encoded as triples using deep neural networks. An end-to-end trainable architecture is proposed, which encodes the information from a set of knowledge graph triples into a vector of fixed dimensionality, and generates a textual summary by conditioning the output on this encoded vector. Different methodologies for building the required data-to-text corpora are explored to train and evaluate the performance of the approach. Attention is first focused on generating biographies, and the author demonstrates that the technique is capable of scaling to domains with larger and more challenging vocabularies. The applicability of the technique for the generation of open-domain Wikipedia summaries in Arabic and Esperanto – two under-resourced languages – is then discussed, and a set of community studies, devised to measure the usability of the automatically generated content by Wikipedia readers and editors, is described. Finally, the book explains an extension of the original model with a pointer mechanism that enables it to learn to verbalise in a different number of ways the content from the triples while retaining the capacity to generate words from a fixed target vocabulary. The evaluation of performance using a dataset encompassing all of English Wikipedia is described, with results from both automatic and human evaluation both of which highlight the superiority of the latter approach as compared to the original architecture.
Semantic Search over the Web
Author: Roberto De Virgilio
Publisher: Springer Science & Business Media
ISBN: 3642250084
Category : Computers
Languages : en
Pages : 418
Book Description
The Web has become the world’s largest database, with search being the main tool that allows organizations and individuals to exploit its huge amount of information. Search on the Web has been traditionally based on textual and structural similarities, ignoring to a large degree the semantic dimension, i.e., understanding the meaning of the query and of the document content. Combining search and semantics gives birth to the idea of semantic search. Traditional search engines have already advertised some semantic dimensions. Some of them, for instance, can enhance their generated result sets with documents that are semantically related to the query terms even though they may not include these terms. Nevertheless, the exploitation of the semantic search has not yet reached its full potential. In this book, Roberto De Virgilio, Francesco Guerra and Yannis Velegrakis present an extensive overview of the work done in Semantic Search and other related areas. They explore different technologies and solutions in depth, making their collection a valuable and stimulating reading for both academic and industrial researchers. The book is divided into three parts. The first introduces the readers to the basic notions of the Web of Data. It describes the different kinds of data that exist, their topology, and their storing and indexing techniques. The second part is dedicated to Web Search. It presents different types of search, like the exploratory or the path-oriented, alongside methods for their efficient and effective implementation. Other related topics included in this part are the use of uncertainty in query answering, the exploitation of ontologies, and the use of semantics in mashup design and operation. The focus of the third part is on linked data, and more specifically, on applying ideas originating in recommender systems on linked data management, and on techniques for the efficiently querying answering on linked data.
Publisher: Springer Science & Business Media
ISBN: 3642250084
Category : Computers
Languages : en
Pages : 418
Book Description
The Web has become the world’s largest database, with search being the main tool that allows organizations and individuals to exploit its huge amount of information. Search on the Web has been traditionally based on textual and structural similarities, ignoring to a large degree the semantic dimension, i.e., understanding the meaning of the query and of the document content. Combining search and semantics gives birth to the idea of semantic search. Traditional search engines have already advertised some semantic dimensions. Some of them, for instance, can enhance their generated result sets with documents that are semantically related to the query terms even though they may not include these terms. Nevertheless, the exploitation of the semantic search has not yet reached its full potential. In this book, Roberto De Virgilio, Francesco Guerra and Yannis Velegrakis present an extensive overview of the work done in Semantic Search and other related areas. They explore different technologies and solutions in depth, making their collection a valuable and stimulating reading for both academic and industrial researchers. The book is divided into three parts. The first introduces the readers to the basic notions of the Web of Data. It describes the different kinds of data that exist, their topology, and their storing and indexing techniques. The second part is dedicated to Web Search. It presents different types of search, like the exploratory or the path-oriented, alongside methods for their efficient and effective implementation. Other related topics included in this part are the use of uncertainty in query answering, the exploitation of ontologies, and the use of semantics in mashup design and operation. The focus of the third part is on linked data, and more specifically, on applying ideas originating in recommender systems on linked data management, and on techniques for the efficiently querying answering on linked data.
Identity of Long-tail Entities in Text
Author: F. Ilievski
Publisher: IOS Press
ISBN: 1643680439
Category : Computers
Languages : en
Pages : 229
Book Description
The digital era has generated a huge amount of data on the identities (profiles) of people, organizations and other entities in a digital format, largely consisting of textual documents such as news articles, encyclopedias, personal websites, books, and social media. Identity has thus been transformed from a philosophical to a societal issue, one requiring robust computational tools to determine entity identity in text. Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities – which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous – can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge (“profiling”) models for establishing the identity of NIL entities. Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
Publisher: IOS Press
ISBN: 1643680439
Category : Computers
Languages : en
Pages : 229
Book Description
The digital era has generated a huge amount of data on the identities (profiles) of people, organizations and other entities in a digital format, largely consisting of textual documents such as news articles, encyclopedias, personal websites, books, and social media. Identity has thus been transformed from a philosophical to a societal issue, one requiring robust computational tools to determine entity identity in text. Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities – which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous – can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge (“profiling”) models for establishing the identity of NIL entities. Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
Innovations, Developments, and Applications of Semantic Web and Information Systems
Author: Lytras, Miltiadis D.
Publisher: IGI Global
ISBN: 1522550437
Category : Computers
Languages : en
Pages : 493
Book Description
In the last few years, there has been an increased advancement and evolution in semantic web and information systems in a variety of fields. The integration of these approaches to ontology engineering, sophisticated methods and algorithms for open linked data extraction, and advanced decision-making creates new opportunities for a bright future. Innovations, Developments, and Applications of Semantic Web and Information Systems is a critical scholarly resource that discusses integrated methods of research and analytics in information technology. Featuring coverage on a broad range of topics, such as cognitive computing, artificial intelligence, machine learning, data analysis, and algorithms, this book is geared towards researchers, academicians, and professionals seeking current information on semantic web and information systems.
Publisher: IGI Global
ISBN: 1522550437
Category : Computers
Languages : en
Pages : 493
Book Description
In the last few years, there has been an increased advancement and evolution in semantic web and information systems in a variety of fields. The integration of these approaches to ontology engineering, sophisticated methods and algorithms for open linked data extraction, and advanced decision-making creates new opportunities for a bright future. Innovations, Developments, and Applications of Semantic Web and Information Systems is a critical scholarly resource that discusses integrated methods of research and analytics in information technology. Featuring coverage on a broad range of topics, such as cognitive computing, artificial intelligence, machine learning, data analysis, and algorithms, this book is geared towards researchers, academicians, and professionals seeking current information on semantic web and information systems.
Advances in Ontology Design and Patterns
Author: K. Hammar
Publisher: IOS Press
ISBN: 1614998264
Category : Computers
Languages : en
Pages : 162
Book Description
The study of patterns in the context of ontology engineering for the semantic web was pioneered more than a decade ago by Blomqvist, Sandkuhl and Gangemi. Since then, this line of research has flourished and led to the development of ontology design patterns, knowledge patterns, and linked data patterns: the patterns as they are known by ontology designers, knowledge engineers, and linked data publishers, respectively. A key characteristic of those patterns is that they are modular and reusable solutions to recurrent problems in ontology engineering and linked data publishing. This book contains recent contributions which advance the state of the art on theory and use of ontology design patterns. The papers collected in this book cover a range of topics, from a method to instantiate content patterns, a proposal on how to document a content pattern, to a number of patterns emerging in ontology modeling in various situations.
Publisher: IOS Press
ISBN: 1614998264
Category : Computers
Languages : en
Pages : 162
Book Description
The study of patterns in the context of ontology engineering for the semantic web was pioneered more than a decade ago by Blomqvist, Sandkuhl and Gangemi. Since then, this line of research has flourished and led to the development of ontology design patterns, knowledge patterns, and linked data patterns: the patterns as they are known by ontology designers, knowledge engineers, and linked data publishers, respectively. A key characteristic of those patterns is that they are modular and reusable solutions to recurrent problems in ontology engineering and linked data publishing. This book contains recent contributions which advance the state of the art on theory and use of ontology design patterns. The papers collected in this book cover a range of topics, from a method to instantiate content patterns, a proposal on how to document a content pattern, to a number of patterns emerging in ontology modeling in various situations.