Interoperable Query Processing Among Heterogeneous Databases

Interoperable Query Processing Among Heterogeneous Databases PDF Author: Yahui Chang
Publisher:
ISBN:
Category : Database management
Languages : en
Pages : 36

Get Book Here

Book Description
Abstract: "The proliferation of database systems based on different data models and query languages has created the need for techniques that support interoperability so that information may be shared among the database systems. This dissertation proposal describes a technique to support interoperable query processing when multiple heterogeneous databases are accessed. We will focus on the problem of supporting query transformation transparently, so a user can pose queries locally, without any need of global knowledge about different data models and schema. To support interoperable query transformation, we need to resolve the conflicts (i.e., heterogeneities) between different databases. The conflicts exist because each database has been designed and populated independently of one another, so semantically equivalent concepts may have been defined in many different ways in each model/schema. Furthermore, we also need to consider the different query language utilized by each database to provide interoperability. We propose two kinds of parameterized canonical representations as a means of classifying and resolving heterogeneities. The first canonical form resolves heterogeneity based on the query language. The second canonical form resolves representation heterogeneity based on using two different schema. We initially focus on object and relational schema. The query transformation mechanism can operate on these canonical forms to produce the proper query for a heterogeneous remote database. We will describe an architecture for supporting interoperability and certain functional modules. The first module described is an extractor module (EM) which parses a source query and extracts semantics, corresponding to some primitive expressions in the query which are represented using a canonical form. The second is a heterogeneous mapping module (HTM) which maps among entities in different schema; this module uses mapping rules which reflect global knowledge of the models, schema, and query languages. We use F-logic, a high-level logic representation, to represent the global dictionary knowledge in canonical form. We wil provide some examples of transforming queries from a relational schema to an equivalent object schema. This proposal outlines several topics that need to be studied in order to solve the problems described above. The first area of investigation is the extension of the current canonical representation for source and target (transformed) queries to cover more constructs in the SQL and XSQL query languages. A second area is the extension of the canonical representation for mapping information to cover further solutions for schema conflict, in particular, conflicts among several object schemas. We describe our current prototype implementation which supports the EM and HTM modules for transforming an XSQL query to an SQL query (and vice versa). We expect to extend the functionality of these modules as we progress in our outlined research."

Interoperable Query Processing Among Heterogeneous Databases

Interoperable Query Processing Among Heterogeneous Databases PDF Author: Yahui Chang
Publisher:
ISBN:
Category : Database management
Languages : en
Pages : 36

Get Book Here

Book Description
Abstract: "The proliferation of database systems based on different data models and query languages has created the need for techniques that support interoperability so that information may be shared among the database systems. This dissertation proposal describes a technique to support interoperable query processing when multiple heterogeneous databases are accessed. We will focus on the problem of supporting query transformation transparently, so a user can pose queries locally, without any need of global knowledge about different data models and schema. To support interoperable query transformation, we need to resolve the conflicts (i.e., heterogeneities) between different databases. The conflicts exist because each database has been designed and populated independently of one another, so semantically equivalent concepts may have been defined in many different ways in each model/schema. Furthermore, we also need to consider the different query language utilized by each database to provide interoperability. We propose two kinds of parameterized canonical representations as a means of classifying and resolving heterogeneities. The first canonical form resolves heterogeneity based on the query language. The second canonical form resolves representation heterogeneity based on using two different schema. We initially focus on object and relational schema. The query transformation mechanism can operate on these canonical forms to produce the proper query for a heterogeneous remote database. We will describe an architecture for supporting interoperability and certain functional modules. The first module described is an extractor module (EM) which parses a source query and extracts semantics, corresponding to some primitive expressions in the query which are represented using a canonical form. The second is a heterogeneous mapping module (HTM) which maps among entities in different schema; this module uses mapping rules which reflect global knowledge of the models, schema, and query languages. We use F-logic, a high-level logic representation, to represent the global dictionary knowledge in canonical form. We wil provide some examples of transforming queries from a relational schema to an equivalent object schema. This proposal outlines several topics that need to be studied in order to solve the problems described above. The first area of investigation is the extension of the current canonical representation for source and target (transformed) queries to cover more constructs in the SQL and XSQL query languages. A second area is the extension of the canonical representation for mapping information to cover further solutions for schema conflict, in particular, conflicts among several object schemas. We describe our current prototype implementation which supports the EM and HTM modules for transforming an XSQL query to an SQL query (and vice versa). We expect to extend the functionality of these modules as we progress in our outlined research."

Interconnecting Heterogeneous Information Systems

Interconnecting Heterogeneous Information Systems PDF Author: Athman Bouguettaya
Publisher: Springer Science & Business Media
ISBN: 1461555671
Category : Business & Economics
Languages : en
Pages : 229

Get Book Here

Book Description
Information systems are the backbone of many of today's computerized applications. Distributed databases and the infrastructure needed to support them have been well studied. However, this book is the first to address distributed database interoperability by examining the successes and failures, various approaches, infrastructures, and trends of the field. A gap exists in the way that these systems have been investigated by real practitioners. This gap is more pronounced than usual, partly because of the way businesses operate, the systems they have, and the difficulties created by systems' autonomy and heterogeneity. Telecommunications firms, for example, must deal with an increased demand for automation while at the same time continuing to function at their current level. While academics are focusing on investigating differences between distributed databases, federated databases, heterogeneous databases, and, more generally, among loosely connected and tightly coupled systems, those who have to deal with real problems right away know that the only relevant research is the one that will ensure that their system works to produce reasonably correct results. Interconnecting Heterogeneous Information Systems covers the underlying principles and infrastructures needed to realize truly global information systems. The book discusses technologies related to middleware, the Web, workflows, transactions, and data warehousing. It also overviews architectures with a discussion of critical issues. The book gives an overview of systems that can be viewed as learning platforms. While these systems do not translate to successful commercial realities, they push the envelope in terms of research. Successful commercial systems have benefited from the experiments conducted in these prototypes. The book includes two case studies based on the authors' own work. Interconnecting Heterogeneous Information Systems is suitable as a textbook for a graduate-level course on Interconnecting Heterogeneous Information Systems, as well as a secondary text for a graduate-level course on database or information systems, and as a reference for researchers and practitioners in industry.

Interoperable Query Processing for Heterogeneous Applications in a Mixed Object and Relational Database Environment

Interoperable Query Processing for Heterogeneous Applications in a Mixed Object and Relational Database Environment PDF Author: Ho-Chuan Huang
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description


Data Management Systems

Data Management Systems PDF Author: Bhavani Thuraisingham
Publisher: CRC Press
ISBN: 9780849394935
Category : Computers
Languages : en
Pages : 276

Get Book Here

Book Description
As the information contained in databases has become a critical resource in organizations, efficient access to that information and the ability to share it among different users and across different systems has become an urgent need. The interoperability of heterogeneous database systems-literally, the ability to access information between or among differing types of databases, is the topic of this timely book. In the last two decades, tremendous improvements in tools and technologies have resulted in new products that provide distributed data processing capabilities. This book describes these tools and emerging technologies, explaining the essential concepts behind the topics but focusing on practical applications. Selected products are discussed to illustrate the characteristics of the different technologies. This is an ideal source for anyone who needs a broad perspective on heterogeneous database integration and related technologies.

Heterogeneous Information Exchange and Organizational Hubs

Heterogeneous Information Exchange and Organizational Hubs PDF Author: H. Bestougeff
Publisher: Springer Science & Business Media
ISBN: 9401717699
Category : Computers
Languages : en
Pages : 253

Get Book Here

Book Description
Helene Bestougeff, Universite de Marne Ia Vallee, France Jacques-Emile Dubois, Universite Paris VII-Denis Diderot, France Bhavani Thuraisingham, MITRE Corporation, USA The last fifty years promoted the conceptual trio: Knowledge, Information and Data (KID) to the center of our present scientific technological and human activities. The intrusion of the Internet drastically modified the historical cycles of communication between authors, providers and users. Today, information is often the result of the interaction between data and the knowledge based on their comprehension, interpretation and prediction. Nowadays important goals involve the exchange of heterogeneous information, as many real life and even specific scientific and technological problems are all interdisciplinary by nature. For a specific project, this signifies extracting information, data and even knowledge from many different sources that must be addressed by interoperable programs. Another important challenge is that of corporations collaborating with each other and forming coalitions and partnerships. One development towards achieving this challenge is organizational hubs. This concept is new and still evolving. Much like an airport hub serving air traffic needs, organizational hubs are central platforms that provide information and collaboration specific to a group of users' needs. Now companies are creating hubs particular to certain types of industries. The users of hubs are seen as communities for which all related information is directly available without further searching efforts and often with value-added services.

Ad Hoc Integration and Querying of Heterogeneous Online Distributed Databases

Ad Hoc Integration and Querying of Heterogeneous Online Distributed Databases PDF Author: Liangyou Chen
Publisher:
ISBN:
Category : Databases
Languages : en
Pages :

Get Book Here

Book Description
This dissertation provides an ad hoc integration methodology to manage and integrate heterogeneous online distributed databases on demand. The problem arises from an impending demand from scientific users to conveniently manage existing Web data along with the complexity involved in the construction of a functional data federation system using existing data integration technologies. We close this gap with a databases management framework accompanying novel Web data specification languages, wrapper generation technologies, and distributed query processing techniques. A major achievement of this dissertation is the establishment of a sound relational data model for Web data. Under this model, the Web becomes a synthetic extension of the traditional database systems. Consequently, a novice user of our system can cheaply integrate a large number of distributed Web sources with in-house databases for daily scientific data analysis purpose. The relational Web modeling leads to a practical ad hoc integration system - the Meteoroid system (a MEthodology for ad hoc inTEgration of Online distributed heteROgeneous Internet Data) - in the context of biological data interoperability. We identify that a main difficulty for ad hoc integration lies in the lack of a fully automated wrapper generation and maintenance technique for general semi-structured data such as HTML, XML and plain text documents. We address this issue through a thorough study of characteristics of online Web data and devise various automated wrapper techniques to facilitate robust data wrapping tasks. With this technique, form-based Web data and table-based Web data can be treated like traditional relational databases. A seamless interoperation environment for Web data and in-house databases is possible. Another difficulty impeding ad hoc integration is in the query processing for heterogeneous distributed sources, where conflict of data is common and on demand mediation of distributed sources is desirable. The dynamicity and unpredictability of Web data further complicate the query processing task. We studied limitations posed by the Web environment for integration query processing and developed innovative techniques to expedite the early appearance of available results. Finally we demonstrate a prototype system for ad hoc integration of heterogeneous biological data. In the system, visual Web-based interfaces guide the integration of heterogeneous data for novice users. A declarative environment is supported for ad hoc querying and management of distributed data sources.

Advanced Query Processing

Advanced Query Processing PDF Author: Barbara Catania
Publisher: Springer Science & Business Media
ISBN: 3642283233
Category : Technology & Engineering
Languages : en
Pages : 355

Get Book Here

Book Description
This research book presents key developments, directions, and challenges concerning advanced query processing for both traditional and non-traditional data. A special emphasis is devoted to approximation and adaptivity issues as well as to the integration of heterogeneous data sources. The book will prove useful as a reference book for senior undergraduate or graduate courses on advanced data management issues, which have a special focus on query processing and data integration. It is aimed for technologists, managers, and developers who want to know more about emerging trends in advanced query processing.

Query Processing on Heterogeneous Database Systems

Query Processing on Heterogeneous Database Systems PDF Author: Teri Moore
Publisher:
ISBN:
Category : Database management
Languages : en
Pages : 192

Get Book Here

Book Description


Query Processing over Incomplete Databases

Query Processing over Incomplete Databases PDF Author: Yunjun Gao
Publisher: Morgan & Claypool Publishers
ISBN: 1681734214
Category : Computers
Languages : en
Pages : 124

Get Book Here

Book Description
Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Interoperable Database Systems (DS-5)

Interoperable Database Systems (DS-5) PDF Author: D.K. Hsiao
Publisher: Elsevier
ISBN: 1483298477
Category : Computers
Languages : en
Pages : 368

Get Book Here

Book Description
The proliferation of databases within organizations have made it imperative to allow effective sharing of information from these disparate database systems. In addition, it is desirable that the individual systems must maintain a certain degree of autonomy over their data in order to continue to provide for their existing applications and to support controlled access to their information. Thus it becomes necessary to develop new techniques and build new functionality to interoperate these autonomous database systems and to integrate them into an overall information system. Research into interoperable database systems has advanced substantially over recent years in response to this need. The papers presented in this volume cover a wide spectrum of both theoretical and pragmatic issues related to the semantics of interoperable database systems. Topics covered include techniques to support the translation between database schema and between database languages; object oriented frameworks for supporting interoperability of heterogeneous databases, knowledge base integration and techniques for overcoming schematic discrepancies in interoperable databases. In addition, there are papers addressing issues of security transaction processing, data modelling and object identification in interoperable database systems. It is hoped the publication will represent a valuable collective contribution to research and development in the field for database researchers, implementors, designers, application builders and users alike.