Author: Paul Attewell
Publisher: Univ of California Press
ISBN: 0520280989
Category : Computers
Languages : en
Pages : 264
Book Description
"The amount of information collected on human behavior every day is staggering, and exponentially greater than at any time in the past. At the same time, we are inundated by stories of powerful algorithms capable of churning through this sea of data and uncovering patterns. These techniques go by many names - data mining, predictive analytics, machine learning - and they are being used by governments as they spy on citizens and by huge corporations are they fine-tune their advertising strategies. And yet social scientists continue mainly to employ a set of analytical tools developed in an earlier era when data was sparse and difficult to come by. In this timely book, Paul Attewell and David Monaghan provide a simple and accessible introduction to Data Mining geared towards social scientists. They discuss how the data mining approach differs substantially, and in some ways radically, from that of conventional statistical modeling familiar to most social scientists. They demystify data mining, describing the diverse set of techniques that the term covers and discussing the strengths and weaknesses of the various approaches. Finally they give practical demonstrations of how to carry out analyses using data mining tools in a number of statistical software packages. It is the hope of the authors that this book will empower social scientists to consider incorporating data mining methodologies in their analytical toolkits"--Provided by publisher.
Data Mining for the Social Sciences
Author: Paul Attewell
Publisher: Univ of California Press
ISBN: 0520280989
Category : Computers
Languages : en
Pages : 264
Book Description
"The amount of information collected on human behavior every day is staggering, and exponentially greater than at any time in the past. At the same time, we are inundated by stories of powerful algorithms capable of churning through this sea of data and uncovering patterns. These techniques go by many names - data mining, predictive analytics, machine learning - and they are being used by governments as they spy on citizens and by huge corporations are they fine-tune their advertising strategies. And yet social scientists continue mainly to employ a set of analytical tools developed in an earlier era when data was sparse and difficult to come by. In this timely book, Paul Attewell and David Monaghan provide a simple and accessible introduction to Data Mining geared towards social scientists. They discuss how the data mining approach differs substantially, and in some ways radically, from that of conventional statistical modeling familiar to most social scientists. They demystify data mining, describing the diverse set of techniques that the term covers and discussing the strengths and weaknesses of the various approaches. Finally they give practical demonstrations of how to carry out analyses using data mining tools in a number of statistical software packages. It is the hope of the authors that this book will empower social scientists to consider incorporating data mining methodologies in their analytical toolkits"--Provided by publisher.
Publisher: Univ of California Press
ISBN: 0520280989
Category : Computers
Languages : en
Pages : 264
Book Description
"The amount of information collected on human behavior every day is staggering, and exponentially greater than at any time in the past. At the same time, we are inundated by stories of powerful algorithms capable of churning through this sea of data and uncovering patterns. These techniques go by many names - data mining, predictive analytics, machine learning - and they are being used by governments as they spy on citizens and by huge corporations are they fine-tune their advertising strategies. And yet social scientists continue mainly to employ a set of analytical tools developed in an earlier era when data was sparse and difficult to come by. In this timely book, Paul Attewell and David Monaghan provide a simple and accessible introduction to Data Mining geared towards social scientists. They discuss how the data mining approach differs substantially, and in some ways radically, from that of conventional statistical modeling familiar to most social scientists. They demystify data mining, describing the diverse set of techniques that the term covers and discussing the strengths and weaknesses of the various approaches. Finally they give practical demonstrations of how to carry out analyses using data mining tools in a number of statistical software packages. It is the hope of the authors that this book will empower social scientists to consider incorporating data mining methodologies in their analytical toolkits"--Provided by publisher.
Text Mining for Qualitative Data Analysis in the Social Sciences
Author: Gregor Wiedemann
Publisher: Springer
ISBN: 3658153091
Category : Social Science
Languages : en
Pages : 307
Book Description
Gregor Wiedemann evaluates text mining applications for social science studies with respect to conceptual integration of consciously selected methods, systematic optimization of algorithms and workflows, and methodological reflections relating to empirical research. In an exemplary study, he introduces workflows to analyze a corpus of around 600,000 newspaper articles on the subject of “democratic demarcation” in Germany. He provides a valuable resource for innovative measures to social scientists and computer scientists in the field of applied natural language processing.
Publisher: Springer
ISBN: 3658153091
Category : Social Science
Languages : en
Pages : 307
Book Description
Gregor Wiedemann evaluates text mining applications for social science studies with respect to conceptual integration of consciously selected methods, systematic optimization of algorithms and workflows, and methodological reflections relating to empirical research. In an exemplary study, he introduces workflows to analyze a corpus of around 600,000 newspaper articles on the subject of “democratic demarcation” in Germany. He provides a valuable resource for innovative measures to social scientists and computer scientists in the field of applied natural language processing.
Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences
Author: John J. McArdle
Publisher: Routledge
ISBN: 1135044090
Category : Psychology
Languages : en
Pages : 496
Book Description
This book reviews the latest techniques in exploratory data mining (EDM) for the analysis of data in the social and behavioral sciences to help researchers assess the predictive value of different combinations of variables in large data sets. Methodological findings and conceptual models that explain reliable EDM techniques for predicting and understanding various risk mechanisms are integrated throughout. Numerous examples illustrate the use of these techniques in practice. Contributors provide insight through hands-on experiences with their own use of EDM techniques in various settings. Readers are also introduced to the most popular EDM software programs. A related website at http://mephisto.unige.ch/pub/edm-book-supplement/offers color versions of the book’s figures, a supplemental paper to chapter 3, and R commands for some chapters. The results of EDM analyses can be perilous – they are often taken as predictions with little regard for cross-validating the results. This carelessness can be catastrophic in terms of money lost or patients misdiagnosed. This book addresses these concerns and advocates for the development of checks and balances for EDM analyses. Both the promises and the perils of EDM are addressed. Editors McArdle and Ritschard taught the "Exploratory Data Mining" Advanced Training Institute of the American Psychological Association (APA). All contributors are top researchers from the US and Europe. Organized into two parts--methodology and applications, the techniques covered include decision, regression, and SEM tree models, growth mixture modeling, and time based categorical sequential analysis. Some of the applications of EDM (and the corresponding data) explored include: selection to college based on risky prior academic profiles the decline of cognitive abilities in older persons global perceptions of stress in adulthood predicting mortality from demographics and cognitive abilities risk factors during pregnancy and the impact on neonatal development Intended as a reference for researchers, methodologists, and advanced students in the social and behavioral sciences including psychology, sociology, business, econometrics, and medicine, interested in learning to apply the latest exploratory data mining techniques. Prerequisites include a basic class in statistics.
Publisher: Routledge
ISBN: 1135044090
Category : Psychology
Languages : en
Pages : 496
Book Description
This book reviews the latest techniques in exploratory data mining (EDM) for the analysis of data in the social and behavioral sciences to help researchers assess the predictive value of different combinations of variables in large data sets. Methodological findings and conceptual models that explain reliable EDM techniques for predicting and understanding various risk mechanisms are integrated throughout. Numerous examples illustrate the use of these techniques in practice. Contributors provide insight through hands-on experiences with their own use of EDM techniques in various settings. Readers are also introduced to the most popular EDM software programs. A related website at http://mephisto.unige.ch/pub/edm-book-supplement/offers color versions of the book’s figures, a supplemental paper to chapter 3, and R commands for some chapters. The results of EDM analyses can be perilous – they are often taken as predictions with little regard for cross-validating the results. This carelessness can be catastrophic in terms of money lost or patients misdiagnosed. This book addresses these concerns and advocates for the development of checks and balances for EDM analyses. Both the promises and the perils of EDM are addressed. Editors McArdle and Ritschard taught the "Exploratory Data Mining" Advanced Training Institute of the American Psychological Association (APA). All contributors are top researchers from the US and Europe. Organized into two parts--methodology and applications, the techniques covered include decision, regression, and SEM tree models, growth mixture modeling, and time based categorical sequential analysis. Some of the applications of EDM (and the corresponding data) explored include: selection to college based on risky prior academic profiles the decline of cognitive abilities in older persons global perceptions of stress in adulthood predicting mortality from demographics and cognitive abilities risk factors during pregnancy and the impact on neonatal development Intended as a reference for researchers, methodologists, and advanced students in the social and behavioral sciences including psychology, sociology, business, econometrics, and medicine, interested in learning to apply the latest exploratory data mining techniques. Prerequisites include a basic class in statistics.
Text Mining
Author: Gabe Ignatow
Publisher: SAGE Publications
ISBN: 1483369323
Category : Social Science
Languages : en
Pages : 189
Book Description
Online communities generate massive volumes of natural language data and the social sciences continue to learn how to best make use of this new information and the technology available for analyzing it. Text Mining brings together a broad range of contemporary qualitative and quantitative methods to provide strategic and practical guidance on analyzing large text collections. This accessible book, written by a sociologist and a computer scientist, surveys the fast-changing landscape of data sources, programming languages, software packages, and methods of analysis available today. Suitable for novice and experienced researchers alike, the book will help readers use text mining techniques more efficiently and productively.
Publisher: SAGE Publications
ISBN: 1483369323
Category : Social Science
Languages : en
Pages : 189
Book Description
Online communities generate massive volumes of natural language data and the social sciences continue to learn how to best make use of this new information and the technology available for analyzing it. Text Mining brings together a broad range of contemporary qualitative and quantitative methods to provide strategic and practical guidance on analyzing large text collections. This accessible book, written by a sociologist and a computer scientist, surveys the fast-changing landscape of data sources, programming languages, software packages, and methods of analysis available today. Suitable for novice and experienced researchers alike, the book will help readers use text mining techniques more efficiently and productively.
Data Science and Social Research
Author: N. Carlo Lauro
Publisher: Springer
ISBN: 3319554778
Category : Social Science
Languages : en
Pages : 292
Book Description
This edited volume lays the groundwork for Social Data Science, addressing epistemological issues, methods, technologies, software and applications of data science in the social sciences. It presents data science techniques for the collection, analysis and use of both online and offline new (big) data in social research and related applications. Among others, the individual contributions cover topics like social media, learning analytics, clustering, statistical literacy, recurrence analysis and network analysis. Data science is a multidisciplinary approach based mainly on the methods of statistics and computer science, and its aim is to develop appropriate methodologies for forecasting and decision-making in response to an increasingly complex reality often characterized by large amounts of data (big data) of various types (numeric, ordinal and nominal variables, symbolic data, texts, images, data streams, multi-way data, social networks etc.) and from diverse sources. This book presents selected papers from the international conference on Data Science & Social Research, held in Naples, Italy in February 2016, and will appeal to researchers in the social sciences working in academia as well as in statistical institutes and offices.
Publisher: Springer
ISBN: 3319554778
Category : Social Science
Languages : en
Pages : 292
Book Description
This edited volume lays the groundwork for Social Data Science, addressing epistemological issues, methods, technologies, software and applications of data science in the social sciences. It presents data science techniques for the collection, analysis and use of both online and offline new (big) data in social research and related applications. Among others, the individual contributions cover topics like social media, learning analytics, clustering, statistical literacy, recurrence analysis and network analysis. Data science is a multidisciplinary approach based mainly on the methods of statistics and computer science, and its aim is to develop appropriate methodologies for forecasting and decision-making in response to an increasingly complex reality often characterized by large amounts of data (big data) of various types (numeric, ordinal and nominal variables, symbolic data, texts, images, data streams, multi-way data, social networks etc.) and from diverse sources. This book presents selected papers from the international conference on Data Science & Social Research, held in Naples, Italy in February 2016, and will appeal to researchers in the social sciences working in academia as well as in statistical institutes and offices.
Decoding the Social World
Author: Sandra Gonzalez-Bailon
Publisher: MIT Press
ISBN: 0262343460
Category : Social Science
Languages : en
Pages : 257
Book Description
How data science and the analysis of networks help us solve the puzzle of unintended consequences. Social life is full of paradoxes. Our intentional actions often trigger outcomes that we did not intend or even envision. How do we explain those unintended effects and what can we do to regulate them? In Decoding the Social World, Sandra González-Bailón explains how data science and digital traces help us solve the puzzle of unintended consequences—offering the solution to a social paradox that has intrigued thinkers for centuries. Communication has always been the force that makes a collection of people more than the sum of individuals, but only now can we explain why: digital technologies have made it possible to parse the information we generate by being social in new, imaginative ways. And yet we must look at that data, González-Bailón argues, through the lens of theories that capture the nature of social life. The technologies we use, in the end, are also a manifestation of the social world we inhabit. González-Bailón discusses how the unpredictability of social life relates to communication networks, social influence, and the unintended effects that derive from individual decisions. She describes how communication generates social dynamics in aggregate (leading to episodes of “collective effervescence”) and discusses the mechanisms that underlie large-scale diffusion, when information and behavior spread “like wildfire.” She applies the theory of networks to illuminate why collective outcomes can differ drastically even when they arise from the same individual actions. By opening the black box of unintended effects, González-Bailón identifies strategies for social intervention and discusses the policy implications—and how data science and evidence-based research embolden critical thinking in a world that is constantly changing.
Publisher: MIT Press
ISBN: 0262343460
Category : Social Science
Languages : en
Pages : 257
Book Description
How data science and the analysis of networks help us solve the puzzle of unintended consequences. Social life is full of paradoxes. Our intentional actions often trigger outcomes that we did not intend or even envision. How do we explain those unintended effects and what can we do to regulate them? In Decoding the Social World, Sandra González-Bailón explains how data science and digital traces help us solve the puzzle of unintended consequences—offering the solution to a social paradox that has intrigued thinkers for centuries. Communication has always been the force that makes a collection of people more than the sum of individuals, but only now can we explain why: digital technologies have made it possible to parse the information we generate by being social in new, imaginative ways. And yet we must look at that data, González-Bailón argues, through the lens of theories that capture the nature of social life. The technologies we use, in the end, are also a manifestation of the social world we inhabit. González-Bailón discusses how the unpredictability of social life relates to communication networks, social influence, and the unintended effects that derive from individual decisions. She describes how communication generates social dynamics in aggregate (leading to episodes of “collective effervescence”) and discusses the mechanisms that underlie large-scale diffusion, when information and behavior spread “like wildfire.” She applies the theory of networks to illuminate why collective outcomes can differ drastically even when they arise from the same individual actions. By opening the black box of unintended effects, González-Bailón identifies strategies for social intervention and discusses the policy implications—and how data science and evidence-based research embolden critical thinking in a world that is constantly changing.
Quantitative Social Science
Author: Kosuke Imai
Publisher: Princeton University Press
ISBN: 0691191093
Category : Political Science
Languages : en
Pages : 464
Book Description
"Princeton University Press published Imai's textbook, Quantitative Social Science: An Introduction, an introduction to quantitative methods and data science for upper level undergrads and graduates in professional programs, in February 2017. What is distinct about the book is how it leads students through a series of applied examples of statistical methods, drawing on real examples from social science research. The original book was prepared with the statistical software R, which is freely available online and has gained in popularity in recent years. But many existing courses in statistics and data sciences, particularly in some subject areas like sociology and law, use STATA, another general purpose package that has been the market leader since the 1980s. We've had several requests for STATA versions of the text as many programs use it by default. This is a "translation" of the original text, keeping all the current pedagogical text but inserting the necessary code and outputs from STATA in their place"--
Publisher: Princeton University Press
ISBN: 0691191093
Category : Political Science
Languages : en
Pages : 464
Book Description
"Princeton University Press published Imai's textbook, Quantitative Social Science: An Introduction, an introduction to quantitative methods and data science for upper level undergrads and graduates in professional programs, in February 2017. What is distinct about the book is how it leads students through a series of applied examples of statistical methods, drawing on real examples from social science research. The original book was prepared with the statistical software R, which is freely available online and has gained in popularity in recent years. But many existing courses in statistics and data sciences, particularly in some subject areas like sociology and law, use STATA, another general purpose package that has been the market leader since the 1980s. We've had several requests for STATA versions of the text as many programs use it by default. This is a "translation" of the original text, keeping all the current pedagogical text but inserting the necessary code and outputs from STATA in their place"--
Clinical Data-Mining
Author: Irwin Epstein
Publisher: Oxford University Press
ISBN: 019533552X
Category : Computers
Languages : en
Pages : 241
Book Description
Clinical Data-Mining (CDM) involves the conceptualization, extraction, analysis, and interpretation of available clinical data for practice knowledge-building, clinical decision-making and practitioner reflection. Depending upon the type of data mined, CDM can be qualitative or quantitative; it is generally retrospective, but may be meaningfully combined with original data collection.Any research method that relies on the contents of case records or information systems data inevitably has limitations, but with proper safeguards these can be minimized. Among CDM's strengths however, are that it is unobtrusive, inexpensive, presents little risk to research subjects, and is ethically compatible with practitioner value commitments. When conducted by practitioners, CDM yields conceptual as well as data-driven insight into their own practice- and program-generated questions.This pocket guide, from a seasoned practice-based researcher, covers all the basics of conducting practitioner-initiated CDM studies or CDM doctoral dissertations, drawing extensively on published CDM studies and completed CDM dissertations from multiple social work settings in the United States, Australia, Israel, Hong Kong and the United Kingdom. In addition, it describes consulting principles for researchers interested in forging collaborative university-agency CDM partnerships, making it a practical tool for novice practitioner-researchers and veteran academic-researchers alike.As such, this book is an exceptional guide both for professionals conducting practice-based research as well as for social work faculty seeking an evidence-informed approach to practice-research integration.
Publisher: Oxford University Press
ISBN: 019533552X
Category : Computers
Languages : en
Pages : 241
Book Description
Clinical Data-Mining (CDM) involves the conceptualization, extraction, analysis, and interpretation of available clinical data for practice knowledge-building, clinical decision-making and practitioner reflection. Depending upon the type of data mined, CDM can be qualitative or quantitative; it is generally retrospective, but may be meaningfully combined with original data collection.Any research method that relies on the contents of case records or information systems data inevitably has limitations, but with proper safeguards these can be minimized. Among CDM's strengths however, are that it is unobtrusive, inexpensive, presents little risk to research subjects, and is ethically compatible with practitioner value commitments. When conducted by practitioners, CDM yields conceptual as well as data-driven insight into their own practice- and program-generated questions.This pocket guide, from a seasoned practice-based researcher, covers all the basics of conducting practitioner-initiated CDM studies or CDM doctoral dissertations, drawing extensively on published CDM studies and completed CDM dissertations from multiple social work settings in the United States, Australia, Israel, Hong Kong and the United Kingdom. In addition, it describes consulting principles for researchers interested in forging collaborative university-agency CDM partnerships, making it a practical tool for novice practitioner-researchers and veteran academic-researchers alike.As such, this book is an exceptional guide both for professionals conducting practice-based research as well as for social work faculty seeking an evidence-informed approach to practice-research integration.
Big Data in Computational Social Science and Humanities
Author: Shu-Heng Chen
Publisher: Springer
ISBN: 3319954652
Category : Computers
Languages : en
Pages : 391
Book Description
This edited volume focuses on big data implications for computational social science and humanities from management to usage. The first part of the book covers geographic data, text corpus data, and social media data, and exemplifies their concrete applications in a wide range of fields including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the book provides a panoramic view of the development of big data in the fields of computational social sciences and humanities. The following questions are addressed: why is there a need for novel data governance for this new type of data?, why is big data important for social scientists?, and how will it revolutionize the way social scientists conduct research? With the advent of the information age and technologies such as Web 2.0, ubiquitous computing, wearable devices, and the Internet of Things, digital society has fundamentally changed what we now know as "data", the very use of this data, and what we now call "knowledge". Big data has become the standard in social sciences, and has made these sciences more computational. Big Data in Computational Social Science and Humanities will appeal to graduate students and researchers working in the many subfields of the social sciences and humanities.
Publisher: Springer
ISBN: 3319954652
Category : Computers
Languages : en
Pages : 391
Book Description
This edited volume focuses on big data implications for computational social science and humanities from management to usage. The first part of the book covers geographic data, text corpus data, and social media data, and exemplifies their concrete applications in a wide range of fields including anthropology, economics, finance, geography, history, linguistics, political science, psychology, public health, and mass communications. The second part of the book provides a panoramic view of the development of big data in the fields of computational social sciences and humanities. The following questions are addressed: why is there a need for novel data governance for this new type of data?, why is big data important for social scientists?, and how will it revolutionize the way social scientists conduct research? With the advent of the information age and technologies such as Web 2.0, ubiquitous computing, wearable devices, and the Internet of Things, digital society has fundamentally changed what we now know as "data", the very use of this data, and what we now call "knowledge". Big data has become the standard in social sciences, and has made these sciences more computational. Big Data in Computational Social Science and Humanities will appeal to graduate students and researchers working in the many subfields of the social sciences and humanities.
Introduction to Data Science for Social and Policy Research
Author: Jose Manuel Magallanes Reyes
Publisher: Cambridge University Press
ISBN: 1107117410
Category : Computers
Languages : en
Pages : 317
Book Description
This comprehensive guide provides a step-by-step approach to data collection, cleaning, formatting, and storage, using Python and R.
Publisher: Cambridge University Press
ISBN: 1107117410
Category : Computers
Languages : en
Pages : 317
Book Description
This comprehensive guide provides a step-by-step approach to data collection, cleaning, formatting, and storage, using Python and R.