Author: Jörg Drechsler
Publisher: Springer Science & Business Media
ISBN: 146140326X
Category : Social Science
Languages : en
Pages : 148
Book Description
The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints. Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice. The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure. The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesn’t consist only of the originally collected values. The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.
Synthetic Datasets for Statistical Disclosure Control
Author: Jörg Drechsler
Publisher: Springer Science & Business Media
ISBN: 146140326X
Category : Social Science
Languages : en
Pages : 148
Book Description
The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints. Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice. The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure. The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesn’t consist only of the originally collected values. The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.
Publisher: Springer Science & Business Media
ISBN: 146140326X
Category : Social Science
Languages : en
Pages : 148
Book Description
The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints. Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice. The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure. The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesn’t consist only of the originally collected values. The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.
Data Disclosure
Author: Moritz Hennemann
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3111010600
Category : Law
Languages : en
Pages : 228
Book Description
Data has become a key factor for the competitiveness of private and state actors alike. Personal data in particular fuels manifold corresponding data ecosystems - in many cases based on the disclosure decision of an individual. This volume presents the proceedings of the bidt "Vectors of Data Disclosure" conference held in Munich 2022. The contributions give comparative insights into the data disclosure process - combining perspectives of law, cultural studies, and business information systems. The authors thereby tackle the question in which way regulation and cultural settings shape (or do not shape) respective decisions in different parts of the world. The volume also includes interim results of the corresponding bidt research project - including in-depth reports covering the regulatory and cultural dimensions of data disclosure in eight different countries / regions worldwide, a business information systems model of the disclosure decision process, and empirical studies. The volume thereby lays the ground for interdisciplinary informed policy decisions and gives guidance to stakeholders.
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 3111010600
Category : Law
Languages : en
Pages : 228
Book Description
Data has become a key factor for the competitiveness of private and state actors alike. Personal data in particular fuels manifold corresponding data ecosystems - in many cases based on the disclosure decision of an individual. This volume presents the proceedings of the bidt "Vectors of Data Disclosure" conference held in Munich 2022. The contributions give comparative insights into the data disclosure process - combining perspectives of law, cultural studies, and business information systems. The authors thereby tackle the question in which way regulation and cultural settings shape (or do not shape) respective decisions in different parts of the world. The volume also includes interim results of the corresponding bidt research project - including in-depth reports covering the regulatory and cultural dimensions of data disclosure in eight different countries / regions worldwide, a business information systems model of the disclosure decision process, and empirical studies. The volume thereby lays the ground for interdisciplinary informed policy decisions and gives guidance to stakeholders.
Overview of the Privacy Act of 1974
Author: United States. Department of Justice. Privacy and Civil Liberties Office
Publisher: Office of Information & Privacy
ISBN:
Category : Law
Languages : en
Pages : 336
Book Description
2012 edition. Issued biennially. Contains a discussion of the Privacy Act's disclosure prohibition, its access and amendment provisions, and its agency recordkeeping requirements. Provides reference to, and legal analysis of, court decisions interpreting the Act's provisions.
Publisher: Office of Information & Privacy
ISBN:
Category : Law
Languages : en
Pages : 336
Book Description
2012 edition. Issued biennially. Contains a discussion of the Privacy Act's disclosure prohibition, its access and amendment provisions, and its agency recordkeeping requirements. Provides reference to, and legal analysis of, court decisions interpreting the Act's provisions.
Political organizations data disclosure and IRS's oversight of organizations should be improved.
Author:
Publisher: DIANE Publishing
ISBN: 1428945512
Category :
Languages : en
Pages : 69
Book Description
Publisher: DIANE Publishing
ISBN: 1428945512
Category :
Languages : en
Pages : 69
Book Description
Elements of Statistical Disclosure Control
Author: Leon Willenborg
Publisher: Springer Science & Business Media
ISBN: 1461301211
Category : Business & Economics
Languages : en
Pages : 273
Book Description
Statistical disclosure control is the discipline that deals with producing statistical data that are safe enough to be released to external researchers. This book concentrates on the methodology of the area. It deals with both microdata (individual data) and tabular (aggregated) data. The book attempts to develop the theory from what can be called the paradigm of statistical confidentiality: to modify unsafe data in such a way that safe (enough) data emerge, with minimum information loss. This book discusses what safe data, are, how information loss can be measured, and how to modify the data in a (near) optimal way. Once it has been decided how to measure safety and information loss, the production of safe data from unsafe data is often a matter of solving an optimization problem. Several such problems are discussed in the book, and most of them turn out to be hard problems that can be solved only approximately. The authors present new results that have not been published before. The book is not a description of an area that is closed, but, on the contrary, one that still has many spots awaiting to be more fully explored. Some of these are indicated in the book. The book will be useful for official, social and medical statisticians and others who are involved in releasing personal or business data for statistical use. Operations researchers may be interested in the optimization problems involved, particularly for the challenges they present. Leon Willenborg has worked at the Department of Statistical Methods at Statistics Netherlands since 1983, first as a researcher and since 1989 as a senior researcher. Since 1989 his main field of research and consultancy has been statistical disclosure control. From 1996-1998 he was the project coordinator of the EU co-funded SDC project.
Publisher: Springer Science & Business Media
ISBN: 1461301211
Category : Business & Economics
Languages : en
Pages : 273
Book Description
Statistical disclosure control is the discipline that deals with producing statistical data that are safe enough to be released to external researchers. This book concentrates on the methodology of the area. It deals with both microdata (individual data) and tabular (aggregated) data. The book attempts to develop the theory from what can be called the paradigm of statistical confidentiality: to modify unsafe data in such a way that safe (enough) data emerge, with minimum information loss. This book discusses what safe data, are, how information loss can be measured, and how to modify the data in a (near) optimal way. Once it has been decided how to measure safety and information loss, the production of safe data from unsafe data is often a matter of solving an optimization problem. Several such problems are discussed in the book, and most of them turn out to be hard problems that can be solved only approximately. The authors present new results that have not been published before. The book is not a description of an area that is closed, but, on the contrary, one that still has many spots awaiting to be more fully explored. Some of these are indicated in the book. The book will be useful for official, social and medical statisticians and others who are involved in releasing personal or business data for statistical use. Operations researchers may be interested in the optimization problems involved, particularly for the challenges they present. Leon Willenborg has worked at the Department of Statistical Methods at Statistics Netherlands since 1983, first as a researcher and since 1989 as a senior researcher. Since 1989 his main field of research and consultancy has been statistical disclosure control. From 1996-1998 he was the project coordinator of the EU co-funded SDC project.
Statistical Disclosure Control for Microdata
Author: Matthias Templ
Publisher: Springer
ISBN: 3319502727
Category : Social Science
Languages : en
Pages : 299
Book Description
This book on statistical disclosure control presents the theory, applications and software implementation of the traditional approach to (micro)data anonymization, including data perturbation methods, disclosure risk, data utility, information loss and methods for simulating synthetic data. Introducing readers to the R packages sdcMicro and simPop, the book also features numerous examples and exercises with solutions, as well as case studies with real-world data, accompanied by the underlying R code to allow readers to reproduce all results. The demand for and volume of data from surveys, registers or other sources containing sensible information on persons or enterprises have increased significantly over the last several years. At the same time, privacy protection principles and regulations have imposed restrictions on the access and use of individual data. Proper and secure microdata dissemination calls for the application of statistical disclosure control methods to the da ta before release. This book is intended for practitioners at statistical agencies and other national and international organizations that deal with confidential data. It will also be interesting for researchers working in statistical disclosure control and the health sciences.
Publisher: Springer
ISBN: 3319502727
Category : Social Science
Languages : en
Pages : 299
Book Description
This book on statistical disclosure control presents the theory, applications and software implementation of the traditional approach to (micro)data anonymization, including data perturbation methods, disclosure risk, data utility, information loss and methods for simulating synthetic data. Introducing readers to the R packages sdcMicro and simPop, the book also features numerous examples and exercises with solutions, as well as case studies with real-world data, accompanied by the underlying R code to allow readers to reproduce all results. The demand for and volume of data from surveys, registers or other sources containing sensible information on persons or enterprises have increased significantly over the last several years. At the same time, privacy protection principles and regulations have imposed restrictions on the access and use of individual data. Proper and secure microdata dissemination calls for the application of statistical disclosure control methods to the da ta before release. This book is intended for practitioners at statistical agencies and other national and international organizations that deal with confidential data. It will also be interesting for researchers working in statistical disclosure control and the health sciences.
Statistical Disclosure Control
Author: Anco Hundepool
Publisher: John Wiley & Sons
ISBN: 1118348214
Category : Mathematics
Languages : en
Pages : 308
Book Description
A reference to answer all your statistical confidentiality questions. This handbook provides technical guidance on statistical disclosure control and on how to approach the problem of balancing the need to provide users with statistical outputs and the need to protect the confidentiality of respondents. Statistical disclosure control is combined with other tools such as administrative, legal and IT in order to define a proper data dissemination strategy based on a risk management approach. The key concepts of statistical disclosure control are presented, along with the methodology and software that can be used to apply various methods of statistical disclosure control. Numerous examples and guidelines are also featured to illustrate the topics covered. Statistical Disclosure Control: Presents a combination of both theoretical and practical solutions Introduces all the key concepts and definitions involved with statistical disclosure control. Provides a high level overview of how to approach problems associated with confidentiality. Provides a broad-ranging review of the methods available to control disclosure. Explains the subtleties of group disclosure control. Features examples throughout the book along with case studies demonstrating how particular methods are used. Discusses microdata, magnitude and frequency tabular data, and remote access issues. Written by experts within leading National Statistical Institutes. Official statisticians, academics and market researchers who need to be informed and make decisions on disclosure limitation will benefit from this book.
Publisher: John Wiley & Sons
ISBN: 1118348214
Category : Mathematics
Languages : en
Pages : 308
Book Description
A reference to answer all your statistical confidentiality questions. This handbook provides technical guidance on statistical disclosure control and on how to approach the problem of balancing the need to provide users with statistical outputs and the need to protect the confidentiality of respondents. Statistical disclosure control is combined with other tools such as administrative, legal and IT in order to define a proper data dissemination strategy based on a risk management approach. The key concepts of statistical disclosure control are presented, along with the methodology and software that can be used to apply various methods of statistical disclosure control. Numerous examples and guidelines are also featured to illustrate the topics covered. Statistical Disclosure Control: Presents a combination of both theoretical and practical solutions Introduces all the key concepts and definitions involved with statistical disclosure control. Provides a high level overview of how to approach problems associated with confidentiality. Provides a broad-ranging review of the methods available to control disclosure. Explains the subtleties of group disclosure control. Features examples throughout the book along with case studies demonstrating how particular methods are used. Discusses microdata, magnitude and frequency tabular data, and remote access issues. Written by experts within leading National Statistical Institutes. Official statisticians, academics and market researchers who need to be informed and make decisions on disclosure limitation will benefit from this book.
Statistical Disclosure Control in Practice
Author: Leon Willenborg
Publisher: Springer Science & Business Media
ISBN: 146124028X
Category : Mathematics
Languages : en
Pages : 164
Book Description
The aim of this book is to discuss various aspects associated with disseminating personal or business data collected in censuses or surveys or copied from administrative sources. The problem is to present the data in such a form that they are useful for statistical research and to provide sufficient protection for the individuals or businesses to whom the data refer. The major part of this book is concerned with how to define the disclosure problem and how to deal with it in practical circumstances.
Publisher: Springer Science & Business Media
ISBN: 146124028X
Category : Mathematics
Languages : en
Pages : 164
Book Description
The aim of this book is to discuss various aspects associated with disseminating personal or business data collected in censuses or surveys or copied from administrative sources. The problem is to present the data in such a form that they are useful for statistical research and to provide sufficient protection for the individuals or businesses to whom the data refer. The major part of this book is concerned with how to define the disclosure problem and how to deal with it in practical circumstances.
Private Data and Public Value
Author: Holly Jarman
Publisher: Springer
ISBN: 3319278231
Category : Law
Languages : en
Pages : 216
Book Description
This book investigates the ways in which these systems can promote public value by encouraging the disclosure and reuse of privately-held data in ways that support collective values such as environmental sustainability. Supported by funding from the National Science Foundation, the authors' research team has been working on one such system, designed to enhance consumers ability to access information about the sustainability of the products that they buy and the supply chains that produce them. Pulled by rapidly developing technology and pushed by budget cuts, politicians and public managers are attempting to find ways to increase the public value of their actions. Policymakers are increasingly acknowledging the potential that lies in publicly disclosing more of the data that they hold, as well as incentivizing individuals and organizations to access, use, and combine it in new ways. Due to technological advances which include smarter phones, better ways to track objects and people as they travel, and more efficient data processing, it is now possible to build systems which use shared, transparent data in creative ways. The book adds to the current conversation among academics and practitioners about how to promote public value through data disclosure, focusing particularly on the roles that governments, businesses and non-profit actors can play in this process, making it of interest to both scholars and policy-makers.
Publisher: Springer
ISBN: 3319278231
Category : Law
Languages : en
Pages : 216
Book Description
This book investigates the ways in which these systems can promote public value by encouraging the disclosure and reuse of privately-held data in ways that support collective values such as environmental sustainability. Supported by funding from the National Science Foundation, the authors' research team has been working on one such system, designed to enhance consumers ability to access information about the sustainability of the products that they buy and the supply chains that produce them. Pulled by rapidly developing technology and pushed by budget cuts, politicians and public managers are attempting to find ways to increase the public value of their actions. Policymakers are increasingly acknowledging the potential that lies in publicly disclosing more of the data that they hold, as well as incentivizing individuals and organizations to access, use, and combine it in new ways. Due to technological advances which include smarter phones, better ways to track objects and people as they travel, and more efficient data processing, it is now possible to build systems which use shared, transparent data in creative ways. The book adds to the current conversation among academics and practitioners about how to promote public value through data disclosure, focusing particularly on the roles that governments, businesses and non-profit actors can play in this process, making it of interest to both scholars and policy-makers.
Open and Big Data Management and Innovation
Author: Marijn Janssen
Publisher: Springer
ISBN: 3319250132
Category : Computers
Languages : en
Pages : 520
Book Description
This book constitutes the refereed conference proceedings of the 14th IFIP WG 6.11 Conference on e-Business, e-Services and e-Society, I3E 2015, held in Delft, The Netherlands, in October 2015. The 40 revised full papers presented together with 1 keynote panel were carefully reviewed and selected from 65 submissions. They are organized in the following topical sections: adoption; big and open data; e-business, e-services,, and e-society; and witness workshop.
Publisher: Springer
ISBN: 3319250132
Category : Computers
Languages : en
Pages : 520
Book Description
This book constitutes the refereed conference proceedings of the 14th IFIP WG 6.11 Conference on e-Business, e-Services and e-Society, I3E 2015, held in Delft, The Netherlands, in October 2015. The 40 revised full papers presented together with 1 keynote panel were carefully reviewed and selected from 65 submissions. They are organized in the following topical sections: adoption; big and open data; e-business, e-services,, and e-society; and witness workshop.