Query Processing over Incomplete Databases

Query Processing over Incomplete Databases PDF Author: Yunjun Gao
Publisher: Springer Nature
ISBN: 303101863X
Category : Computers
Languages : en
Pages : 106

Get Book Here

Book Description
Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Query Processing over Incomplete Databases

Query Processing over Incomplete Databases PDF Author: Yunjun Gao
Publisher: Springer Nature
ISBN: 303101863X
Category : Computers
Languages : en
Pages : 106

Get Book Here

Book Description
Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Query Processing over Incomplete Databases

Query Processing over Incomplete Databases PDF Author: Yunjun Gao
Publisher: Morgan & Claypool Publishers
ISBN: 1681734214
Category : Computers
Languages : en
Pages : 124

Get Book Here

Book Description
Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Knowledge Graphs and Big Data Processing

Knowledge Graphs and Big Data Processing PDF Author: Valentina Janev
Publisher: Springer Nature
ISBN: 3030531996
Category : Computers
Languages : en
Pages : 212

Get Book Here

Book Description
This open access book is part of the LAMBDA Project (Learning, Applying, Multiplying Big Data Analytics), funded by the European Union, GA No. 809965. Data Analytics involves applying algorithmic processes to derive insights. Nowadays it is used in many industries to allow organizations and companies to make better decisions as well as to verify or disprove existing theories or models. The term data analytics is often used interchangeably with intelligence, statistics, reasoning, data mining, knowledge discovery, and others. The goal of this book is to introduce some of the definitions, methods, tools, frameworks, and solutions for big data processing, starting from the process of information extraction and knowledge representation, via knowledge processing and analytics to visualization, sense-making, and practical applications. Each chapter in this book addresses some pertinent aspect of the data processing chain, with a specific focus on understanding Enterprise Knowledge Graphs, Semantic Big Data Architectures, and Smart Data Analytics solutions. This book is addressed to graduate students from technical disciplines, to professional audiences following continuous education short courses, and to researchers from diverse areas following self-study courses. Basic skills in computer science, mathematics, and statistics are required.

Database Systems for Advanced Applications

Database Systems for Advanced Applications PDF Author: Hwanjo Yu
Publisher: Springer
ISBN: 364229023X
Category : Computers
Languages : en
Pages : 357

Get Book Here

Book Description
This book constitutes the workshop proceedings of the 17th International Conference on Database Systems for Advanced Applications, DASFAA 2012, held in Busan, South Korea, in April 2012. The volume contains five workshops, each focusing on specific area that contributes to the main themes of the DASFAA conference: The Second International Workshop on Flash-based Database Systems (FlashDB 2012), the First International Workshop on Information Technologies for Maritime and Logistics (ITEMS 2012), the Third International Workshop on Social Networks and Social Media Mining on the Web (SNSMW 2012), the Second International Workshop on Spatial Information Modeling, Management and Mining (SIM3 2012), and the Fifth International Workshop on Data Quality in Integration Systems (DQIS 2012).

Advanced Database Systems For Integration Of Media And User Environments '98: Advanced Database Research

Advanced Database Systems For Integration Of Media And User Environments '98: Advanced Database Research PDF Author: Yahiko Kambayashi
Publisher: World Scientific
ISBN: 9814545031
Category :
Languages : en
Pages : 366

Get Book Here

Book Description
This volume is a progress report on the project Research and Development of Advanced Database Systems for Integration of Media and User Environments, supported by the Ministry of Education, Science, Sports and Culture of Japan. It investigates research on new database systems due to the recent development of network technology; a clearer picture of integration by database technology is drawn as a result.

Non-Volatile Memory Database Management Systems

Non-Volatile Memory Database Management Systems PDF Author: Joy Arulraj
Publisher: Springer Nature
ISBN: 3031018680
Category : Computers
Languages : en
Pages : 173

Get Book Here

Book Description
This book explores the implications of non-volatile memory (NVM) for database management systems (DBMSs). The advent of NVM will fundamentally change the dichotomy between volatile memory and durable storage in DBMSs. These new NVM devices are almost as fast as volatile memory, but all writes to them are persistent even after power loss. Existing DBMSs are unable to take full advantage of this technology because their internal architectures are predicated on the assumption that memory is volatile. With NVM, many of the components of legacy DBMSs are unnecessary and will degrade the performance of data-intensive applications. We present the design and implementation of DBMS architectures that are explicitly tailored for NVM. The book focuses on three aspects of a DBMS: (1) logging and recovery, (2) storage and buffer management, and (3) indexing. First, we present a logging and recovery protocol that enables the DBMS to support near-instantaneous recovery. Second, we propose a storage engine architecture and buffer management policy that leverages the durability and byte-addressability properties of NVM to reduce data duplication and data migration. Third, the book presents the design of a range index tailored for NVM that is latch-free yet simple to implement. All together, the work described in this book illustrates that rethinking the fundamental algorithms and data structures employed in a DBMS for NVM improves performance and availability, reduces operational cost, and simplifies software development.

Databases Theory and Applications

Databases Theory and Applications PDF Author: Junhu Wang
Publisher: Springer
ISBN: 3319920138
Category : Computers
Languages : en
Pages : 374

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 29th Australasian Database Conference, ADC 2018, held in Gold Coast, QLD, Australia, in May 2018. The 23 full papers plus 6 short papers presented together with 3 demo papers were carefully reviewed and selected from 53 submissions. The Australasian Database Conference is an annual international forum for sharing the latest research advancements and novel applications of database systems, data-driven applications, and data analytics between researchers and practitioners from around the globe, particularly Australia and New Zealand.

Query Processing over Graph-structured Data on the Web

Query Processing over Graph-structured Data on the Web PDF Author: M. Acosta Deibe
Publisher: IOS Press
ISBN: 1614999163
Category : Computers
Languages : en
Pages : 244

Get Book Here

Book Description
In the last years, Linked Data initiatives have encouraged the publication of large graph-structured datasets using the Resource Description Framework (RDF). Due to the constant growth of RDF data on the web, more flexible data management infrastructures must be able to efficiently and effectively exploit the vast amount of knowledge accessible on the web. This book presents flexible query processing strategies over RDF graphs on the web using the SPARQL query language. In this work, we show how query engines can change plans on-the-fly with adaptive techniques to cope with unpredictable conditions and to reduce execution time. Furthermore, this work investigates the application of crowdsourcing in query processing, where engines are able to contact humans to enhance the quality of query answers. The theoretical and empirical results presented in this book indicate that flexible techniques allow for querying RDF data sources efficiently and effectively.

Database and Expert Systems Applications

Database and Expert Systems Applications PDF Author: Sven Hartmann
Publisher: Springer Nature
ISBN: 3030590038
Category : Computers
Languages : en
Pages : 469

Get Book Here

Book Description
The double volumes LNCS 12391-12392 constitutes the papers of the 31st International Conference on Database and Expert Systems Applications, DEXA 2020, which will be held online in September 2020. The 38 full papers presented together with 20 short papers plus 1 keynote papers in these volumes were carefully reviewed and selected from a total of 190 submissions.

Transactions on Large-Scale Data- and Knowledge-Centered Systems III

Transactions on Large-Scale Data- and Knowledge-Centered Systems III PDF Author: Abdelkader Hameurlain
Publisher: Springer
ISBN: 3642230741
Category : Computers
Languages : en
Pages : 246

Get Book Here

Book Description
The LNCS journal Transactions on Large-Scale Data- and Knowledge-Centered Systems focuses on data management, knowledge discovery, and knowledge processing, which are core and hot topics in computer science. Since the 1990s, the Internet has become the main driving force behind application development in all domains. An increase in the demand for resource sharing across different sites connected through networks has led to an evolution of data- and knowledge-management systems from centralized systems to decentralized systems enabling large-scale distributed applications providing high scalability. Current decentralized systems still focus on data and knowledge as their main resource. Feasibility of these systems relies basically on P2P (peer-to-peer) techniques and the support of agent systems with scaling and decentralized control. Synergy between Grids, P2P systems, and agent technologies is the key to data- and knowledge-centered systems in large-scale environments. This, the third issue of Transactions on Large-Scale Data- and Knowledge-Centered Systems, contains two kinds of papers: Firstly, a selection of the best papers from the third International Conference on Data Management in Grid and Peer-to-Peer Systems, Globe 2010, and secondly, a selection of 6 papers from the 18 papers submitted in response to the call for papers for this issue. The topics covered by this special issue include replication, the semantic web, information retrieval, data storage, source selection, and large-scale distributed applications.