Provenance in Data Science

Provenance in Data Science PDF Author: Leslie F. Sikos
Publisher: Springer Nature
ISBN: 3030676811
Category : Computers
Languages : en
Pages : 110

Get Book Here

Book Description
RDF-based knowledge graphs require additional formalisms to be fully context-aware, which is presented in this book. This book also provides a collection of provenance techniques and state-of-the-art metadata-enhanced, provenance-aware, knowledge graph-based representations across multiple application domains, in order to demonstrate how to combine graph-based data models and provenance representations. This is important to make statements authoritative, verifiable, and reproducible, such as in biomedical, pharmaceutical, and cybersecurity applications, where the data source and generator can be just as important as the data itself. Capturing provenance is critical to ensure sound experimental results and rigorously designed research studies for patient and drug safety, pathology reports, and medical evidence generation. Similarly, provenance is needed for cyberthreat intelligence dashboards and attack maps that aggregate and/or fuse heterogeneous data from disparate data sources to differentiate between unimportant online events and dangerous cyberattacks, which is demonstrated in this book. Without provenance, data reliability and trustworthiness might be limited, causing data reuse, trust, reproducibility and accountability issues. This book primarily targets researchers who utilize knowledge graphs in their methods and approaches (this includes researchers from a variety of domains, such as cybersecurity, eHealth, data science, Semantic Web, etc.). This book collects core facts for the state of the art in provenance approaches and techniques, complemented by a critical review of existing approaches. New research directions are also provided that combine data science and knowledge graphs, for an increasingly important research topic.

Provenance in Data Science

Provenance in Data Science PDF Author: Leslie F. Sikos
Publisher: Springer Nature
ISBN: 3030676811
Category : Computers
Languages : en
Pages : 110

Get Book Here

Book Description
RDF-based knowledge graphs require additional formalisms to be fully context-aware, which is presented in this book. This book also provides a collection of provenance techniques and state-of-the-art metadata-enhanced, provenance-aware, knowledge graph-based representations across multiple application domains, in order to demonstrate how to combine graph-based data models and provenance representations. This is important to make statements authoritative, verifiable, and reproducible, such as in biomedical, pharmaceutical, and cybersecurity applications, where the data source and generator can be just as important as the data itself. Capturing provenance is critical to ensure sound experimental results and rigorously designed research studies for patient and drug safety, pathology reports, and medical evidence generation. Similarly, provenance is needed for cyberthreat intelligence dashboards and attack maps that aggregate and/or fuse heterogeneous data from disparate data sources to differentiate between unimportant online events and dangerous cyberattacks, which is demonstrated in this book. Without provenance, data reliability and trustworthiness might be limited, causing data reuse, trust, reproducibility and accountability issues. This book primarily targets researchers who utilize knowledge graphs in their methods and approaches (this includes researchers from a variety of domains, such as cybersecurity, eHealth, data science, Semantic Web, etc.). This book collects core facts for the state of the art in provenance approaches and techniques, complemented by a critical review of existing approaches. New research directions are also provided that combine data science and knowledge graphs, for an increasingly important research topic.

Encyclopedia of Database Systems

Encyclopedia of Database Systems PDF Author: Ling Liu
Publisher:
ISBN: 9781489979933
Category : Database management
Languages : en
Pages :

Get Book Here

Book Description


Towards Interoperable Research Infrastructures for Environmental and Earth Sciences

Towards Interoperable Research Infrastructures for Environmental and Earth Sciences PDF Author: Zhiming Zhao
Publisher: Springer Nature
ISBN: 3030528294
Category : Computers
Languages : en
Pages : 375

Get Book Here

Book Description
This open access book summarises the latest developments on data management in the EU H2020 ENVRIplus project, which brought together more than 20 environmental and Earth science research infrastructures into a single community. It provides readers with a systematic overview of the common challenges faced by research infrastructures and how a ‘reference model guided’ engineering approach can be used to achieve greater interoperability among such infrastructures in the environmental and earth sciences. The 20 contributions in this book are structured in 5 parts on the design, development, deployment, operation and use of research infrastructures. Part one provides an overview of the state of the art of research infrastructure and relevant e-Infrastructure technologies, part two discusses the reference model guided engineering approach, the third part presents the software and tools developed for common data management challenges, the fourth part demonstrates the software via several use cases, and the last part discusses the sustainability and future directions.

Encyclopedia of Big Data

Encyclopedia of Big Data PDF Author: Laurie A. Schintler
Publisher: Springer
ISBN: 9783319320090
Category : Business & Economics
Languages : en
Pages : 0

Get Book Here

Book Description
This encyclopedia will be an essential resource for our times, reflecting the fact that we currently are living in an expanding data-driven world. Technological advancements and other related trends are contributing to the production of an astoundingly large and exponentially increasing collection of data and information, referred to in popular vernacular as “Big Data.” Social media and crowdsourcing platforms and various applications ― “apps” ― are producing reams of information from the instantaneous transactions and input of millions and millions of people around the globe. The Internet-of-Things (IoT), which is expected to comprise tens of billions of objects by the end of this decade, is actively sensing real-time intelligence on nearly every aspect of our lives and environment. The Global Positioning System (GPS) and other location-aware technologies are producing data that is specific down to particular latitude and longitude coordinates and seconds of the day. Large-scale instruments, such as the Large Hadron Collider (LHC), are collecting massive amounts of data on our planet and even distant corners of the visible universe. Digitization is being used to convert large collections of documents from print to digital format, giving rise to large archives of unstructured data. Innovations in technology, in the areas of Cloud and molecular computing, Artificial Intelligence/Machine Learning, and Natural Language Processing (NLP), to name only a few, also are greatly expanding our capacity to store, manage, and process Big Data. In this context, the Encyclopedia of Big Data is being offered in recognition of a world that is rapidly moving from gigabytes to terabytes to petabytes and beyond. While indeed large data sets have long been around and in use in a variety of fields, the era of Big Data in which we now live departs from the past in a number of key respects and with this departure comes a fresh set of challenges and opportunities that cut across and affect multiple sectors and disciplines, and the public at large. With expanded analytical capacities at hand, Big Data is now being used for scientific inquiry and experimentation in nearly every (if not all) disciplines, from the social sciences to the humanities to the natural sciences, and more. Moreover, the use of Big Data has been well established beyond the Ivory Tower. In today’s economy, businesses simply cannot be competitive without engaging Big Data in one way or another in support of operations, management, planning, or simply basic hiring decisions. In all levels of government, Big Data is being used to engage citizens and to guide policy making in pursuit of the interests of the public and society in general. Moreover, the changing nature of Big Data also raises new issues and concerns related to, for example, privacy, liability, security, access, and even the veracity of the data itself. Given the complex issues attending Big Data, there is a real need for a reference book that covers the subject from a multi-disciplinary, cross-sectoral, comprehensive, and international perspective. The Encyclopedia of Big Data will address this need and will be the first of such reference books to do so. Featuring some 500 entries, from "Access" to "Zillow," the Encyclopedia will serve as a fundamental resource for researchers and students, for decision makers and leaders, and for business analysts and purveyors. Developed for those in academia, industry, and government, and others with a general interest in Big Data, the encyclopedia will be aimed especially at those involved in its collection, analysis, and use. Ultimately, the Encyclopedia of Big Data will provide a common platform and language covering the breadth and depth of the topic for different segments, sectors, and disciplines.

Provenance and Annotation of Data and Processes

Provenance and Annotation of Data and Processes PDF Author: Juliana Freire
Publisher: Springer Science & Business Media
ISBN: 3540899642
Category : Business & Economics
Languages : en
Pages : 339

Get Book Here

Book Description
This book constitutes the thoroughly refereed post-conference proceedings of the Second International Provenance and Annotation Workshop, IPAW 2008, held in Salt Lake City, UT, USA, in June 2007. The 14 revised full papers and 15 revised short and demo papers presented together with 2 keynote lectures were carefully reviewed and selected from 40 submissions. The paper are organized in topical sections on provenance: models and querying; provenance: visualization, failures, identity; provenance and workflows; provenance for streams and collaboration; and applications.

The Foundations for Provenance on the Web

The Foundations for Provenance on the Web PDF Author: Luc Moreau
Publisher: Now Publishers Inc
ISBN: 1601983867
Category : Computers
Languages : en
Pages : 159

Get Book Here

Book Description
Provenance, i.e., the origin or source of something, is becoming an important concern, since it offers the means to verify data products, to infer their quality, and to decide whether they can be trusted. For instance, provenance enables the reproducibility of scientific results; provenance is necessary to track attribution and credit in curated databases; and, it is essential for reasoners to make trust judgements about the information they use over the Semantic Web. As the Web allows information sharing, discovery, aggregation, filtering and flow in an unprecedented manner, it also becomes difficult to identify the original source that produced information on the Web. This survey contends that provenance can and should reliably be tracked and exploited on the Web, and investigates the necessary foundations to achieve such a vision.

Principles of Data Integration

Principles of Data Integration PDF Author: AnHai Doan
Publisher: Elsevier
ISBN: 0123914795
Category : Computers
Languages : en
Pages : 522

Get Book Here

Book Description
Principles of Data Integration is the first comprehensive textbook of data integration, covering theoretical principles and implementation issues as well as current challenges raised by the semantic web and cloud computing. The book offers a range of data integration solutions enabling you to focus on what is most relevant to the problem at hand. Readers will also learn how to build their own algorithms and implement their own data integration application. Written by three of the most respected experts in the field, this book provides an extensive introduction to the theory and concepts underlying today's data integration techniques, with detailed, instruction for their application using concrete examples throughout to explain the concepts. This text is an ideal resource for database practitioners in industry, including data warehouse engineers, database system designers, data architects/enterprise architects, database researchers, statisticians, and data analysts; students in data analytics and knowledge discovery; and other data professionals working at the R&D and implementation levels. - Offers a range of data integration solutions enabling you to focus on what is most relevant to the problem at hand - Enables you to build your own algorithms and implement your own data integration applications

Active Conceptual Modeling of Learning

Active Conceptual Modeling of Learning PDF Author: Peter P. Chen
Publisher: Springer
ISBN: 354077503X
Category : Computers
Languages : en
Pages : 234

Get Book Here

Book Description
This volume is a collection of papers presented during the first International ACM-L Workshop, which was held in Tucson, Arizona, during the 25th International Conference on Conceptual Modeling, ER 2006. Included in this state-of-the-art survey are 11 revised full papers, carefully reviewed and selected from the workshop presentations. These are rounded off with four invited lectures and an introductory overview, and represent the current thinking in conceptual modeling research.

Provenance in Databases

Provenance in Databases PDF Author: James Cheney
Publisher: Now Publishers Inc
ISBN: 1601982321
Category : Computers
Languages : en
Pages : 111

Get Book Here

Book Description
Reviews research over the past ten years on why, how, and where provenance, clarifies the relationships among these notions of provenance, and describes some of their applications in confidence computation, view maintenance and update, debugging, and annotation propagation

Provenance and Annotation of Data and Processes

Provenance and Annotation of Data and Processes PDF Author: Khalid Belhajjame
Publisher: Springer
ISBN: 3319983792
Category : Computers
Languages : en
Pages : 272

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 7th International Provenance and Annotation Workshop, IPAW 2018, held in London, UK, in July 2018. The 12 revised full papers, 19 poster papers, and 2 demonstration papers presented were carefully reviewed and selected from 50 submissions. The papers feature a variety of provenance-related topics ranging from the capture and inference of provenance to its use and application.They are organized in topical sections on reproducibility; modeling, simulating and capturing provenance; PROV extensions; scientific workflows; applications; and system demonstrations.