Author: Matthias Templ
Publisher: Springer Nature
ISBN: 3031300734
Category : Mathematics
Languages : en
Pages : 478
Book Description
This book explores visualization and imputation techniques for missing values and presents practical applications using the statistical software R. It explains the concepts of common imputation methods with a focus on visualization, description of data problems and practical solutions using R, including modern methods of robust imputation, imputation based on deep learning and imputation for complex data. By describing the advantages, disadvantages and pitfalls of each method, the book presents a clear picture of which imputation methods are applicable given a specific data set at hand. The material covered includes the pre-analysis of data, visualization of missing values in incomplete data, single and multiple imputation, deductive imputation and outlier replacement, model-based methods including methods based on robust estimates, non-linear methods such as tree-based and deep learning methods, imputation of compositional data, imputation quality evaluation from visual diagnostics to precision measures, coverage rates and prediction performance and a description of different model- and design-based simulation designs for the evaluation. The book also features a topic-focused introduction to R and R code is provided in each chapter to explain the practical application of the described methodology. Addressed to researchers, practitioners and students who work with incomplete data, the book offers an introduction to the subject as well as a discussion of recent developments in the field. It is suitable for beginners to the topic and advanced readers alike.
Visualization and Imputation of Missing Values
Author: Matthias Templ
Publisher: Springer Nature
ISBN: 3031300734
Category : Mathematics
Languages : en
Pages : 478
Book Description
This book explores visualization and imputation techniques for missing values and presents practical applications using the statistical software R. It explains the concepts of common imputation methods with a focus on visualization, description of data problems and practical solutions using R, including modern methods of robust imputation, imputation based on deep learning and imputation for complex data. By describing the advantages, disadvantages and pitfalls of each method, the book presents a clear picture of which imputation methods are applicable given a specific data set at hand. The material covered includes the pre-analysis of data, visualization of missing values in incomplete data, single and multiple imputation, deductive imputation and outlier replacement, model-based methods including methods based on robust estimates, non-linear methods such as tree-based and deep learning methods, imputation of compositional data, imputation quality evaluation from visual diagnostics to precision measures, coverage rates and prediction performance and a description of different model- and design-based simulation designs for the evaluation. The book also features a topic-focused introduction to R and R code is provided in each chapter to explain the practical application of the described methodology. Addressed to researchers, practitioners and students who work with incomplete data, the book offers an introduction to the subject as well as a discussion of recent developments in the field. It is suitable for beginners to the topic and advanced readers alike.
Publisher: Springer Nature
ISBN: 3031300734
Category : Mathematics
Languages : en
Pages : 478
Book Description
This book explores visualization and imputation techniques for missing values and presents practical applications using the statistical software R. It explains the concepts of common imputation methods with a focus on visualization, description of data problems and practical solutions using R, including modern methods of robust imputation, imputation based on deep learning and imputation for complex data. By describing the advantages, disadvantages and pitfalls of each method, the book presents a clear picture of which imputation methods are applicable given a specific data set at hand. The material covered includes the pre-analysis of data, visualization of missing values in incomplete data, single and multiple imputation, deductive imputation and outlier replacement, model-based methods including methods based on robust estimates, non-linear methods such as tree-based and deep learning methods, imputation of compositional data, imputation quality evaluation from visual diagnostics to precision measures, coverage rates and prediction performance and a description of different model- and design-based simulation designs for the evaluation. The book also features a topic-focused introduction to R and R code is provided in each chapter to explain the practical application of the described methodology. Addressed to researchers, practitioners and students who work with incomplete data, the book offers an introduction to the subject as well as a discussion of recent developments in the field. It is suitable for beginners to the topic and advanced readers alike.
Flexible Imputation of Missing Data, Second Edition
Author: Stef van Buuren
Publisher: CRC Press
ISBN: 0429960352
Category : Mathematics
Languages : en
Pages : 444
Book Description
Missing data pose challenges to real-life data analysis. Simple ad-hoc fixes, like deletion or mean imputation, only work under highly restrictive conditions, which are often not met in practice. Multiple imputation replaces each missing value by multiple plausible values. The variability between these replacements reflects our ignorance of the true (but missing) value. Each of the completed data set is then analyzed by standard methods, and the results are pooled to obtain unbiased estimates with correct confidence intervals. Multiple imputation is a general approach that also inspires novel solutions to old problems by reformulating the task at hand as a missing-data problem. This is the second edition of a popular book on multiple imputation, focused on explaining the application of methods through detailed worked examples using the MICE package as developed by the author. This new edition incorporates the recent developments in this fast-moving field. This class-tested book avoids mathematical and technical details as much as possible: formulas are accompanied by verbal statements that explain the formula in accessible terms. The book sharpens the reader’s intuition on how to think about missing data, and provides all the tools needed to execute a well-grounded quantitative analysis in the presence of missing data.
Publisher: CRC Press
ISBN: 0429960352
Category : Mathematics
Languages : en
Pages : 444
Book Description
Missing data pose challenges to real-life data analysis. Simple ad-hoc fixes, like deletion or mean imputation, only work under highly restrictive conditions, which are often not met in practice. Multiple imputation replaces each missing value by multiple plausible values. The variability between these replacements reflects our ignorance of the true (but missing) value. Each of the completed data set is then analyzed by standard methods, and the results are pooled to obtain unbiased estimates with correct confidence intervals. Multiple imputation is a general approach that also inspires novel solutions to old problems by reformulating the task at hand as a missing-data problem. This is the second edition of a popular book on multiple imputation, focused on explaining the application of methods through detailed worked examples using the MICE package as developed by the author. This new edition incorporates the recent developments in this fast-moving field. This class-tested book avoids mathematical and technical details as much as possible: formulas are accompanied by verbal statements that explain the formula in accessible terms. The book sharpens the reader’s intuition on how to think about missing data, and provides all the tools needed to execute a well-grounded quantitative analysis in the presence of missing data.
Feature Engineering and Selection
Author: Max Kuhn
Publisher: CRC Press
ISBN: 1351609467
Category : Business & Economics
Languages : en
Pages : 266
Book Description
The process of developing predictive models includes many stages. Most resources focus on the modeling algorithms but neglect other critical aspects of the modeling process. This book describes techniques for finding the best representations of predictors for modeling and for nding the best subset of predictors for improving model performance. A variety of example data sets are used to illustrate the techniques along with R programs for reproducing the results.
Publisher: CRC Press
ISBN: 1351609467
Category : Business & Economics
Languages : en
Pages : 266
Book Description
The process of developing predictive models includes many stages. Most resources focus on the modeling algorithms but neglect other critical aspects of the modeling process. This book describes techniques for finding the best representations of predictors for modeling and for nding the best subset of predictors for improving model performance. A variety of example data sets are used to illustrate the techniques along with R programs for reproducing the results.
Large-scale Numerical Optimization
Author: Thomas Frederick Coleman
Publisher: SIAM
ISBN: 9780898712681
Category : Mathematics
Languages : en
Pages : 278
Book Description
Papers from a workshop held at Cornell University, Oct. 1989, and sponsored by Cornell's Mathematical Sciences Institute. Annotation copyright Book News, Inc. Portland, Or.
Publisher: SIAM
ISBN: 9780898712681
Category : Mathematics
Languages : en
Pages : 278
Book Description
Papers from a workshop held at Cornell University, Oct. 1989, and sponsored by Cornell's Mathematical Sciences Institute. Annotation copyright Book News, Inc. Portland, Or.
Qlik Sense: Advanced Data Visualization for Your Organization
Author: Ferran Garcia Pagans
Publisher: Packt Publishing Ltd
ISBN: 1788998723
Category : Computers
Languages : en
Pages : 765
Book Description
Perform Interactive Data Analysis with Smarter Visualizations and Support your Enterprise-wide Analytical Needs Key Features Get a practical demonstration of discovering data for sales, human resources, and more using Qlik Sense Create dynamic dashboards for business intelligence and predictive analytics Create and collaborate comprehensive analytical solutions using Rattle and Qlik Sense Book Description Qlik Sense is powerful and creative visual analytics software that allows users to discover data, explore it, and dig out meaningful insights in order to make a profit and make decisions for your business. This course begins by introducing you to the features and functions of the most modern edition of Qlik Sense so you get to grips with the application. The course will teach you how to administer the data architecture in Qlik Sense, enabling you to customize your own Qlik Sense application for your business intelligence needs. It also contains numerous recipes to help you overcome challenging situations while creating fully featured desktop applications in Qlik Sense. It explains how to combine Rattle and Qlik Sense Desktop to apply predictive analytics to your data to develop real-world interactive data applications. The course includes premium content from three of our most popular books: [*] Learning Qlik Sense: The Official Guide Second Edition [*] Qlik Sense Cookbook [*] Predictive Analytics using Rattle and Qlik Sense On completion of this course, you will be self-sufficient in improving your data analysis and will know how to apply predictive analytics to your datasets. Through this course, you will be able to create predictive models and data applications, allowing you to explore your data insights much deeper. What you will learn Build simple visualization models with Rattle and Qlik Sense Desktop Get to grips with the life cycle and new visualization functions of a Qlik Sense application Discover simple ways to examine data and get it ready for analysis Visualize your data with Qlik Sense's engaging and informative graphs Build efficient and responsive Associative Models Optimize Qlik Sense for sales, human resources, and demographic data discovery Explore various tips and tricks of navigation for the Qlik Sense® front end Develop creative extensions for your Qlik Sense® dashboard Who this book is for This course is for anyone who wishes to understand and utilize the various new approaches to business intelligence actively in their business practice. Knowing the basics of business intelligence concepts would be helpful when picking up this course, but is not mandatory.
Publisher: Packt Publishing Ltd
ISBN: 1788998723
Category : Computers
Languages : en
Pages : 765
Book Description
Perform Interactive Data Analysis with Smarter Visualizations and Support your Enterprise-wide Analytical Needs Key Features Get a practical demonstration of discovering data for sales, human resources, and more using Qlik Sense Create dynamic dashboards for business intelligence and predictive analytics Create and collaborate comprehensive analytical solutions using Rattle and Qlik Sense Book Description Qlik Sense is powerful and creative visual analytics software that allows users to discover data, explore it, and dig out meaningful insights in order to make a profit and make decisions for your business. This course begins by introducing you to the features and functions of the most modern edition of Qlik Sense so you get to grips with the application. The course will teach you how to administer the data architecture in Qlik Sense, enabling you to customize your own Qlik Sense application for your business intelligence needs. It also contains numerous recipes to help you overcome challenging situations while creating fully featured desktop applications in Qlik Sense. It explains how to combine Rattle and Qlik Sense Desktop to apply predictive analytics to your data to develop real-world interactive data applications. The course includes premium content from three of our most popular books: [*] Learning Qlik Sense: The Official Guide Second Edition [*] Qlik Sense Cookbook [*] Predictive Analytics using Rattle and Qlik Sense On completion of this course, you will be self-sufficient in improving your data analysis and will know how to apply predictive analytics to your datasets. Through this course, you will be able to create predictive models and data applications, allowing you to explore your data insights much deeper. What you will learn Build simple visualization models with Rattle and Qlik Sense Desktop Get to grips with the life cycle and new visualization functions of a Qlik Sense application Discover simple ways to examine data and get it ready for analysis Visualize your data with Qlik Sense's engaging and informative graphs Build efficient and responsive Associative Models Optimize Qlik Sense for sales, human resources, and demographic data discovery Explore various tips and tricks of navigation for the Qlik Sense® front end Develop creative extensions for your Qlik Sense® dashboard Who this book is for This course is for anyone who wishes to understand and utilize the various new approaches to business intelligence actively in their business practice. Knowing the basics of business intelligence concepts would be helpful when picking up this course, but is not mandatory.
Interactive and Dynamic Graphics for Data Analysis
Author: Dianne Cook
Publisher: Springer Science & Business Media
ISBN: 0387717617
Category : Computers
Languages : en
Pages : 202
Book Description
This book is about using interactive and dynamic plots on a computer screen as part of data exploration and modeling, both alone and as a partner with static graphics and non-graphical computational methods. The area of int- active and dynamic data visualization emerged within statistics as part of research on exploratory data analysis in the late 1960s, and it remains an active subject of research today, as its use in practice continues to grow. It now makes substantial contributions within computer science as well, as part of the growing ?elds of information visualization and data mining, especially visual data mining. The material in this book includes: • An introduction to data visualization, explaining how it di?ers from other types of visualization. • Adescriptionofourtoolboxofinteractiveanddynamicgraphicalmethods. • An approach for exploring missing values in data. • An explanation of the use of these tools in cluster analysis and supervised classi?cation. • An overview of additional material available on the web. • A description of the data used in the analyses and exercises. The book’s examples use the software R and GGobi. R (Ihaka & Gent- man 1996, RDevelopment CoreTeam2006) isafreesoftware environment for statistical computing and graphics; it is most often used from the command line, provides a wide variety of statistical methods, and includes high–quality staticgraphics.RaroseintheStatisticsDepartmentoftheUniversityofAu- land and is now developed and maintained by a global collaborative e?ort.
Publisher: Springer Science & Business Media
ISBN: 0387717617
Category : Computers
Languages : en
Pages : 202
Book Description
This book is about using interactive and dynamic plots on a computer screen as part of data exploration and modeling, both alone and as a partner with static graphics and non-graphical computational methods. The area of int- active and dynamic data visualization emerged within statistics as part of research on exploratory data analysis in the late 1960s, and it remains an active subject of research today, as its use in practice continues to grow. It now makes substantial contributions within computer science as well, as part of the growing ?elds of information visualization and data mining, especially visual data mining. The material in this book includes: • An introduction to data visualization, explaining how it di?ers from other types of visualization. • Adescriptionofourtoolboxofinteractiveanddynamicgraphicalmethods. • An approach for exploring missing values in data. • An explanation of the use of these tools in cluster analysis and supervised classi?cation. • An overview of additional material available on the web. • A description of the data used in the analyses and exercises. The book’s examples use the software R and GGobi. R (Ihaka & Gent- man 1996, RDevelopment CoreTeam2006) isafreesoftware environment for statistical computing and graphics; it is most often used from the command line, provides a wide variety of statistical methods, and includes high–quality staticgraphics.RaroseintheStatisticsDepartmentoftheUniversityofAu- land and is now developed and maintained by a global collaborative e?ort.
Interactive Data Visualization with Python
Author: Abha Belorkar
Publisher: Packt Publishing Ltd
ISBN: 1800201060
Category : Computers
Languages : en
Pages : 362
Book Description
Create your own clear and impactful interactive data visualizations with the powerful data visualization libraries of Python Key FeaturesStudy and use Python interactive libraries, such as Bokeh and PlotlyExplore different visualization principles and understand when to use which oneCreate interactive data visualizations with real-world dataBook Description With so much data being continuously generated, developers, who can present data as impactful and interesting visualizations, are always in demand. Interactive Data Visualization with Python sharpens your data exploration skills, tells you everything there is to know about interactive data visualization in Python. You'll begin by learning how to draw various plots with Matplotlib and Seaborn, the non-interactive data visualization libraries. You'll study different types of visualizations, compare them, and find out how to select a particular type of visualization to suit your requirements. After you get a hang of the various non-interactive visualization libraries, you'll learn the principles of intuitive and persuasive data visualization, and use Bokeh and Plotly to transform your visuals into strong stories. You'll also gain insight into how interactive data and model visualization can optimize the performance of a regression model. By the end of the course, you'll have a new skill set that'll make you the go-to person for transforming data visualizations into engaging and interesting stories. What you will learnExplore and apply different interactive data visualization techniquesManipulate plotting parameters and styles to create appealing plotsCustomize data visualization for different audiencesDesign data visualizations using interactive librariesUse Matplotlib, Seaborn, Altair and Bokeh for drawing appealing plotsCustomize data visualization for different scenariosWho this book is for This book intends to provide a solid training ground for Python developers, data analysts and data scientists to enable them to present critical data insights in a way that best captures the user's attention and imagination. It serves as a simple step-by-step guide that demonstrates the different types and components of visualization, the principles, and techniques of effective interactivity, as well as common pitfalls to avoid when creating interactive data visualizations. Students should have an intermediate level of competency in writing Python code, as well as some familiarity with using libraries such as pandas.
Publisher: Packt Publishing Ltd
ISBN: 1800201060
Category : Computers
Languages : en
Pages : 362
Book Description
Create your own clear and impactful interactive data visualizations with the powerful data visualization libraries of Python Key FeaturesStudy and use Python interactive libraries, such as Bokeh and PlotlyExplore different visualization principles and understand when to use which oneCreate interactive data visualizations with real-world dataBook Description With so much data being continuously generated, developers, who can present data as impactful and interesting visualizations, are always in demand. Interactive Data Visualization with Python sharpens your data exploration skills, tells you everything there is to know about interactive data visualization in Python. You'll begin by learning how to draw various plots with Matplotlib and Seaborn, the non-interactive data visualization libraries. You'll study different types of visualizations, compare them, and find out how to select a particular type of visualization to suit your requirements. After you get a hang of the various non-interactive visualization libraries, you'll learn the principles of intuitive and persuasive data visualization, and use Bokeh and Plotly to transform your visuals into strong stories. You'll also gain insight into how interactive data and model visualization can optimize the performance of a regression model. By the end of the course, you'll have a new skill set that'll make you the go-to person for transforming data visualizations into engaging and interesting stories. What you will learnExplore and apply different interactive data visualization techniquesManipulate plotting parameters and styles to create appealing plotsCustomize data visualization for different audiencesDesign data visualizations using interactive librariesUse Matplotlib, Seaborn, Altair and Bokeh for drawing appealing plotsCustomize data visualization for different scenariosWho this book is for This book intends to provide a solid training ground for Python developers, data analysts and data scientists to enable them to present critical data insights in a way that best captures the user's attention and imagination. It serves as a simple step-by-step guide that demonstrates the different types and components of visualization, the principles, and techniques of effective interactivity, as well as common pitfalls to avoid when creating interactive data visualizations. Students should have an intermediate level of competency in writing Python code, as well as some familiarity with using libraries such as pandas.
Encyclopedia of Survey Research Methods
Author: Paul J. Lavrakas
Publisher: SAGE Publications
ISBN: 150631788X
Category : Social Science
Languages : en
Pages : 1073
Book Description
To the uninformed, surveys appear to be an easy type of research to design and conduct, but when students and professionals delve deeper, they encounter the vast complexities that the range and practice of survey methods present. To complicate matters, technology has rapidly affected the way surveys can be conducted; today, surveys are conducted via cell phone, the Internet, email, interactive voice response, and other technology-based modes. Thus, students, researchers, and professionals need both a comprehensive understanding of these complexities and a revised set of tools to meet the challenges. In conjunction with top survey researchers around the world and with Nielsen Media Research serving as the corporate sponsor, the Encyclopedia of Survey Research Methods presents state-of-the-art information and methodological examples from the field of survey research. Although there are other "how-to" guides and references texts on survey research, none is as comprehensive as this Encyclopedia, and none presents the material in such a focused and approachable manner. With more than 600 entries, this resource uses a Total Survey Error perspective that considers all aspects of possible survey error from a cost-benefit standpoint. Key Features Covers all major facets of survey research methodology, from selecting the sample design and the sampling frame, designing and pretesting the questionnaire, data collection, and data coding, to the thorny issues surrounding diminishing response rates, confidentiality, privacy, informed consent and other ethical issues, data weighting, and data analyses Presents a Reader′s Guide to organize entries around themes or specific topics and easily guide users to areas of interest Offers cross-referenced terms, a brief listing of Further Readings, and stable Web site URLs following most entries The Encyclopedia of Survey Research Methods is specifically written to appeal to beginning, intermediate, and advanced students, practitioners, researchers, consultants, and consumers of survey-based information.
Publisher: SAGE Publications
ISBN: 150631788X
Category : Social Science
Languages : en
Pages : 1073
Book Description
To the uninformed, surveys appear to be an easy type of research to design and conduct, but when students and professionals delve deeper, they encounter the vast complexities that the range and practice of survey methods present. To complicate matters, technology has rapidly affected the way surveys can be conducted; today, surveys are conducted via cell phone, the Internet, email, interactive voice response, and other technology-based modes. Thus, students, researchers, and professionals need both a comprehensive understanding of these complexities and a revised set of tools to meet the challenges. In conjunction with top survey researchers around the world and with Nielsen Media Research serving as the corporate sponsor, the Encyclopedia of Survey Research Methods presents state-of-the-art information and methodological examples from the field of survey research. Although there are other "how-to" guides and references texts on survey research, none is as comprehensive as this Encyclopedia, and none presents the material in such a focused and approachable manner. With more than 600 entries, this resource uses a Total Survey Error perspective that considers all aspects of possible survey error from a cost-benefit standpoint. Key Features Covers all major facets of survey research methodology, from selecting the sample design and the sampling frame, designing and pretesting the questionnaire, data collection, and data coding, to the thorny issues surrounding diminishing response rates, confidentiality, privacy, informed consent and other ethical issues, data weighting, and data analyses Presents a Reader′s Guide to organize entries around themes or specific topics and easily guide users to areas of interest Offers cross-referenced terms, a brief listing of Further Readings, and stable Web site URLs following most entries The Encyclopedia of Survey Research Methods is specifically written to appeal to beginning, intermediate, and advanced students, practitioners, researchers, consultants, and consumers of survey-based information.
Data Visualization Made Easy: A Beginner's Guide for Everyone
Author: M.B. Chatfield
Publisher: M.B. Chatfield
ISBN:
Category : Computers
Languages : en
Pages : 328
Book Description
Make your data sing with data visualization. In today's data-driven world, it's more important than ever to be able to communicate data effectively. Data visualization is the art of transforming data into visually appealing charts, graphs, and dashboards that convey your message clearly and persuasively. Data Visualization Made Easy is the perfect resource for anyone who wants to learn how to use data visualization to communicate their data. This comprehensive guide covers everything you need to know, from the basics of data visualization to advanced topics such as data storytelling and dashboard design. Whether you're a student, a business professional, or a data enthusiast, Data Visualization Made Easy is the essential resource for learning how to communicate data effectively. Here are some of the key topics covered in the book: Introduction to data visualization Choosing the right data visualization Creating effective data visualizations Data storytelling Dashboard design With Data Visualization Made Easy, you'll be well on your way to becoming a data visualization expert.
Publisher: M.B. Chatfield
ISBN:
Category : Computers
Languages : en
Pages : 328
Book Description
Make your data sing with data visualization. In today's data-driven world, it's more important than ever to be able to communicate data effectively. Data visualization is the art of transforming data into visually appealing charts, graphs, and dashboards that convey your message clearly and persuasively. Data Visualization Made Easy is the perfect resource for anyone who wants to learn how to use data visualization to communicate their data. This comprehensive guide covers everything you need to know, from the basics of data visualization to advanced topics such as data storytelling and dashboard design. Whether you're a student, a business professional, or a data enthusiast, Data Visualization Made Easy is the essential resource for learning how to communicate data effectively. Here are some of the key topics covered in the book: Introduction to data visualization Choosing the right data visualization Creating effective data visualizations Data storytelling Dashboard design With Data Visualization Made Easy, you'll be well on your way to becoming a data visualization expert.
Statistical Data Cleaning with Applications in R
Author: Mark van der Loo
Publisher: John Wiley & Sons
ISBN: 1118897137
Category : Computers
Languages : en
Pages : 396
Book Description
A comprehensive guide to automated statistical data cleaning The production of clean data is a complex and time-consuming process that requires both technical know-how and statistical expertise. Statistical Data Cleaning brings together a wide range of techniques for cleaning textual, numeric or categorical data. This book examines technical data cleaning methods relating to data representation and data structure. A prominent role is given to statistical data validation, data cleaning based on predefined restrictions, and data cleaning strategy. Key features: Focuses on the automation of data cleaning methods, including both theory and applications written in R. Enables the reader to design data cleaning processes for either one-off analytical purposes or for setting up production systems that clean data on a regular basis. Explores statistical techniques for solving issues such as incompleteness, contradictions and outliers, integration of data cleaning components and quality monitoring. Supported by an accompanying website featuring data and R code. This book enables data scientists and statistical analysts working with data to deepen their understanding of data cleaning as well as to upgrade their practical data cleaning skills. It can also be used as material for a course in data cleaning and analyses.
Publisher: John Wiley & Sons
ISBN: 1118897137
Category : Computers
Languages : en
Pages : 396
Book Description
A comprehensive guide to automated statistical data cleaning The production of clean data is a complex and time-consuming process that requires both technical know-how and statistical expertise. Statistical Data Cleaning brings together a wide range of techniques for cleaning textual, numeric or categorical data. This book examines technical data cleaning methods relating to data representation and data structure. A prominent role is given to statistical data validation, data cleaning based on predefined restrictions, and data cleaning strategy. Key features: Focuses on the automation of data cleaning methods, including both theory and applications written in R. Enables the reader to design data cleaning processes for either one-off analytical purposes or for setting up production systems that clean data on a regular basis. Explores statistical techniques for solving issues such as incompleteness, contradictions and outliers, integration of data cleaning components and quality monitoring. Supported by an accompanying website featuring data and R code. This book enables data scientists and statistical analysts working with data to deepen their understanding of data cleaning as well as to upgrade their practical data cleaning skills. It can also be used as material for a course in data cleaning and analyses.