Author: Agostino Di Ciaccio
Publisher: Springer Science & Business Media
ISBN: 3642210376
Category : Mathematics
Languages : en
Pages : 464
Book Description
The theme of the meeting was “Statistical Methods for the Analysis of Large Data-Sets”. In recent years there has been increasing interest in this subject; in fact a huge quantity of information is often available but standard statistical techniques are usually not well suited to managing this kind of data. The conference serves as an important meeting point for European researchers working on this topic and a number of European statistical societies participated in the organization of the event. The book includes 45 papers from a selection of the 156 papers accepted for presentation and discussed at the conference on “Advanced Statistical Methods for the Analysis of Large Data-sets.”
Advanced Statistical Methods for the Analysis of Large Data-Sets
Author: Agostino Di Ciaccio
Publisher: Springer Science & Business Media
ISBN: 3642210376
Category : Mathematics
Languages : en
Pages : 464
Book Description
The theme of the meeting was “Statistical Methods for the Analysis of Large Data-Sets”. In recent years there has been increasing interest in this subject; in fact a huge quantity of information is often available but standard statistical techniques are usually not well suited to managing this kind of data. The conference serves as an important meeting point for European researchers working on this topic and a number of European statistical societies participated in the organization of the event. The book includes 45 papers from a selection of the 156 papers accepted for presentation and discussed at the conference on “Advanced Statistical Methods for the Analysis of Large Data-sets.”
Publisher: Springer Science & Business Media
ISBN: 3642210376
Category : Mathematics
Languages : en
Pages : 464
Book Description
The theme of the meeting was “Statistical Methods for the Analysis of Large Data-Sets”. In recent years there has been increasing interest in this subject; in fact a huge quantity of information is often available but standard statistical techniques are usually not well suited to managing this kind of data. The conference serves as an important meeting point for European researchers working on this topic and a number of European statistical societies participated in the organization of the event. The book includes 45 papers from a selection of the 156 papers accepted for presentation and discussed at the conference on “Advanced Statistical Methods for the Analysis of Large Data-sets.”
Statistical Foundations of Data Science
Author: Jianqing Fan
Publisher: CRC Press
ISBN: 0429527616
Category : Mathematics
Languages : en
Pages : 974
Book Description
Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.
Publisher: CRC Press
ISBN: 0429527616
Category : Mathematics
Languages : en
Pages : 974
Book Description
Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.
Statistical Methods in Water Resources
Author: D.R. Helsel
Publisher: Elsevier
ISBN: 0080875084
Category : Science
Languages : en
Pages : 539
Book Description
Data on water quality and other environmental issues are being collected at an ever-increasing rate. In the past, however, the techniques used by scientists to interpret this data have not progressed as quickly. This is a book of modern statistical methods for analysis of practical problems in water quality and water resources.The last fifteen years have seen major advances in the fields of exploratory data analysis (EDA) and robust statistical methods. The 'real-life' characteristics of environmental data tend to drive analysis towards the use of these methods. These advances are presented in a practical and relevant format. Alternate methods are compared, highlighting the strengths and weaknesses of each as applied to environmental data. Techniques for trend analysis and dealing with water below the detection limit are topics covered, which are of great interest to consultants in water-quality and hydrology, scientists in state, provincial and federal water resources, and geological survey agencies.The practising water resources scientist will find the worked examples using actual field data from case studies of environmental problems, of real value. Exercises at the end of each chapter enable the mechanics of the methodological process to be fully understood, with data sets included on diskette for easy use. The result is a book that is both up-to-date and immediately relevant to ongoing work in the environmental and water sciences.
Publisher: Elsevier
ISBN: 0080875084
Category : Science
Languages : en
Pages : 539
Book Description
Data on water quality and other environmental issues are being collected at an ever-increasing rate. In the past, however, the techniques used by scientists to interpret this data have not progressed as quickly. This is a book of modern statistical methods for analysis of practical problems in water quality and water resources.The last fifteen years have seen major advances in the fields of exploratory data analysis (EDA) and robust statistical methods. The 'real-life' characteristics of environmental data tend to drive analysis towards the use of these methods. These advances are presented in a practical and relevant format. Alternate methods are compared, highlighting the strengths and weaknesses of each as applied to environmental data. Techniques for trend analysis and dealing with water below the detection limit are topics covered, which are of great interest to consultants in water-quality and hydrology, scientists in state, provincial and federal water resources, and geological survey agencies.The practising water resources scientist will find the worked examples using actual field data from case studies of environmental problems, of real value. Exercises at the end of each chapter enable the mechanics of the methodological process to be fully understood, with data sets included on diskette for easy use. The result is a book that is both up-to-date and immediately relevant to ongoing work in the environmental and water sciences.
The Statistical Analysis of Small Data Sets
Author: Markus Neuhäuser
Publisher: Oxford University Press
ISBN: 0198872992
Category : Mathematics
Languages : en
Pages : 161
Book Description
We live in the era of big data. However, small data sets are still common for ethical, financial, or practical reasons. Small sample sizes can cause researchers to seek out the most powerful methods to analyse their data, but they may also be wary that some methodologies and assumptions may not be appropriate when samples are small. The book offers advice on the statistical analysis of small data sets for various designs and levels of measurement, helping researchers to analyse such data sets, but also to evaluate and interpret others' analyses. The book discusses the potential challenges associated with a small sample, as well as the ways in which these challenges can be mitigated. General topics with strong relevance to small sample sizes such as meta-analysis, sequential and adaptive designs, and multiple testing are introduced. While the focus is on hypothesis tests and confidence intervals, Bayesian analyses are also covered. Code written in the statistical software R is presented to carry out the proposed methods, many of which are not limited to use on small data sets, and the book also discusses approaches to computing the power or the necessary sample size, respectively.
Publisher: Oxford University Press
ISBN: 0198872992
Category : Mathematics
Languages : en
Pages : 161
Book Description
We live in the era of big data. However, small data sets are still common for ethical, financial, or practical reasons. Small sample sizes can cause researchers to seek out the most powerful methods to analyse their data, but they may also be wary that some methodologies and assumptions may not be appropriate when samples are small. The book offers advice on the statistical analysis of small data sets for various designs and levels of measurement, helping researchers to analyse such data sets, but also to evaluate and interpret others' analyses. The book discusses the potential challenges associated with a small sample, as well as the ways in which these challenges can be mitigated. General topics with strong relevance to small sample sizes such as meta-analysis, sequential and adaptive designs, and multiple testing are introduced. While the focus is on hypothesis tests and confidence intervals, Bayesian analyses are also covered. Code written in the statistical software R is presented to carry out the proposed methods, many of which are not limited to use on small data sets, and the book also discusses approaches to computing the power or the necessary sample size, respectively.
Mixed Effects Models for Complex Data
Author: Lang Wu
Publisher: CRC Press
ISBN: 9781420074086
Category : Mathematics
Languages : en
Pages : 431
Book Description
Although standard mixed effects models are useful in a range of studies, other approaches must often be used in correlation with them when studying complex or incomplete data. Mixed Effects Models for Complex Data discusses commonly used mixed effects models and presents appropriate approaches to address dropouts, missing data, measurement errors, censoring, and outliers. For each class of mixed effects model, the author reviews the corresponding class of regression model for cross-sectional data. An overview of general models and methods, along with motivating examples After presenting real data examples and outlining general approaches to the analysis of longitudinal/clustered data and incomplete data, the book introduces linear mixed effects (LME) models, generalized linear mixed models (GLMMs), nonlinear mixed effects (NLME) models, and semiparametric and nonparametric mixed effects models. It also includes general approaches for the analysis of complex data with missing values, measurement errors, censoring, and outliers. Self-contained coverage of specific topics Subsequent chapters delve more deeply into missing data problems, covariate measurement errors, and censored responses in mixed effects models. Focusing on incomplete data, the book also covers survival and frailty models, joint models of survival and longitudinal data, robust methods for mixed effects models, marginal generalized estimating equation (GEE) models for longitudinal or clustered data, and Bayesian methods for mixed effects models. Background material In the appendix, the author provides background information, such as likelihood theory, the Gibbs sampler, rejection and importance sampling methods, numerical integration methods, optimization methods, bootstrap, and matrix algebra. Failure to properly address missing data, measurement errors, and other issues in statistical analyses can lead to severely biased or misleading results. This book explores the biases that arise when naïve methods are used and shows which approaches should be used to achieve accurate results in longitudinal data analysis.
Publisher: CRC Press
ISBN: 9781420074086
Category : Mathematics
Languages : en
Pages : 431
Book Description
Although standard mixed effects models are useful in a range of studies, other approaches must often be used in correlation with them when studying complex or incomplete data. Mixed Effects Models for Complex Data discusses commonly used mixed effects models and presents appropriate approaches to address dropouts, missing data, measurement errors, censoring, and outliers. For each class of mixed effects model, the author reviews the corresponding class of regression model for cross-sectional data. An overview of general models and methods, along with motivating examples After presenting real data examples and outlining general approaches to the analysis of longitudinal/clustered data and incomplete data, the book introduces linear mixed effects (LME) models, generalized linear mixed models (GLMMs), nonlinear mixed effects (NLME) models, and semiparametric and nonparametric mixed effects models. It also includes general approaches for the analysis of complex data with missing values, measurement errors, censoring, and outliers. Self-contained coverage of specific topics Subsequent chapters delve more deeply into missing data problems, covariate measurement errors, and censored responses in mixed effects models. Focusing on incomplete data, the book also covers survival and frailty models, joint models of survival and longitudinal data, robust methods for mixed effects models, marginal generalized estimating equation (GEE) models for longitudinal or clustered data, and Bayesian methods for mixed effects models. Background material In the appendix, the author provides background information, such as likelihood theory, the Gibbs sampler, rejection and importance sampling methods, numerical integration methods, optimization methods, bootstrap, and matrix algebra. Failure to properly address missing data, measurement errors, and other issues in statistical analyses can lead to severely biased or misleading results. This book explores the biases that arise when naïve methods are used and shows which approaches should be used to achieve accurate results in longitudinal data analysis.
Probabilistic Foundations of Statistical Network Analysis
Author: Harry Crane
Publisher: CRC Press
ISBN: 1351807331
Category : Business & Economics
Languages : en
Pages : 236
Book Description
Probabilistic Foundations of Statistical Network Analysis presents a fresh and insightful perspective on the fundamental tenets and major challenges of modern network analysis. Its lucid exposition provides necessary background for understanding the essential ideas behind exchangeable and dynamic network models, network sampling, and network statistics such as sparsity and power law, all of which play a central role in contemporary data science and machine learning applications. The book rewards readers with a clear and intuitive understanding of the subtle interplay between basic principles of statistical inference, empirical properties of network data, and technical concepts from probability theory. Its mathematically rigorous, yet non-technical, exposition makes the book accessible to professional data scientists, statisticians, and computer scientists as well as practitioners and researchers in substantive fields. Newcomers and non-quantitative researchers will find its conceptual approach invaluable for developing intuition about technical ideas from statistics and probability, while experts and graduate students will find the book a handy reference for a wide range of new topics, including edge exchangeability, relative exchangeability, graphon and graphex models, and graph-valued Levy process and rewiring models for dynamic networks. The author’s incisive commentary supplements these core concepts, challenging the reader to push beyond the current limitations of this emerging discipline. With an approachable exposition and more than 50 open research problems and exercises with solutions, this book is ideal for advanced undergraduate and graduate students interested in modern network analysis, data science, machine learning, and statistics. Harry Crane is Associate Professor and Co-Director of the Graduate Program in Statistics and Biostatistics and an Associate Member of the Graduate Faculty in Philosophy at Rutgers University. Professor Crane’s research interests cover a range of mathematical and applied topics in network science, probability theory, statistical inference, and mathematical logic. In addition to his technical work on edge and relational exchangeability, relative exchangeability, and graph-valued Markov processes, Prof. Crane’s methods have been applied to domain-specific cybersecurity and counterterrorism problems at the Foreign Policy Research Institute and RAND’s Project AIR FORCE.
Publisher: CRC Press
ISBN: 1351807331
Category : Business & Economics
Languages : en
Pages : 236
Book Description
Probabilistic Foundations of Statistical Network Analysis presents a fresh and insightful perspective on the fundamental tenets and major challenges of modern network analysis. Its lucid exposition provides necessary background for understanding the essential ideas behind exchangeable and dynamic network models, network sampling, and network statistics such as sparsity and power law, all of which play a central role in contemporary data science and machine learning applications. The book rewards readers with a clear and intuitive understanding of the subtle interplay between basic principles of statistical inference, empirical properties of network data, and technical concepts from probability theory. Its mathematically rigorous, yet non-technical, exposition makes the book accessible to professional data scientists, statisticians, and computer scientists as well as practitioners and researchers in substantive fields. Newcomers and non-quantitative researchers will find its conceptual approach invaluable for developing intuition about technical ideas from statistics and probability, while experts and graduate students will find the book a handy reference for a wide range of new topics, including edge exchangeability, relative exchangeability, graphon and graphex models, and graph-valued Levy process and rewiring models for dynamic networks. The author’s incisive commentary supplements these core concepts, challenging the reader to push beyond the current limitations of this emerging discipline. With an approachable exposition and more than 50 open research problems and exercises with solutions, this book is ideal for advanced undergraduate and graduate students interested in modern network analysis, data science, machine learning, and statistics. Harry Crane is Associate Professor and Co-Director of the Graduate Program in Statistics and Biostatistics and an Associate Member of the Graduate Faculty in Philosophy at Rutgers University. Professor Crane’s research interests cover a range of mathematical and applied topics in network science, probability theory, statistical inference, and mathematical logic. In addition to his technical work on edge and relational exchangeability, relative exchangeability, and graph-valued Markov processes, Prof. Crane’s methods have been applied to domain-specific cybersecurity and counterterrorism problems at the Foreign Policy Research Institute and RAND’s Project AIR FORCE.
Applied Quantitative Finance
Author: Wolfgang Karl Härdle
Publisher: Springer
ISBN: 3662544865
Category : Business & Economics
Languages : en
Pages : 369
Book Description
This volume provides practical solutions and introduces recent theoretical developments in risk management, pricing of credit derivatives, quantification of volatility and copula modeling. This third edition is devoted to modern risk analysis based on quantitative methods and textual analytics to meet the current challenges in banking and finance. It includes 14 new contributions and presents a comprehensive, state-of-the-art treatment of cutting-edge methods and topics, such as collateralized debt obligations, the high-frequency analysis of market liquidity, and realized volatility. The book is divided into three parts: Part 1 revisits important market risk issues, while Part 2 introduces novel concepts in credit risk and its management along with updated quantitative methods. The third part discusses the dynamics of risk management and includes risk analysis of energy markets and for cryptocurrencies. Digital assets, such as blockchain-based currencies, have become popular b ut are theoretically challenging when based on conventional methods. Among others, it introduces a modern text-mining method called dynamic topic modeling in detail and applies it to the message board of Bitcoins. The unique synthesis of theory and practice supported by computational tools is reflected not only in the selection of topics, but also in the fine balance of scientific contributions on practical implementation and theoretical concepts. This link between theory and practice offers theoreticians insights into considerations of applicability and, vice versa, provides practitioners convenient access to new techniques in quantitative finance. Hence the book will appeal both to researchers, including master and PhD students, and practitioners, such as financial engineers. The results presented in the book are fully reproducible and all quantlets needed for calculations are provided on an accompanying website. The Quantlet platform quantlet.de, quantlet.com, quantlet.org is an integrated QuantNet environment consisting of different types of statistics-related documents and program codes. Its goal is to promote reproducibility and offer a platform for sharing validated knowledge native to the social web. QuantNet and the corresponding Data-Driven Documents-based visualization allows readers to reproduce the tables, pictures and calculations inside this Springer book.
Publisher: Springer
ISBN: 3662544865
Category : Business & Economics
Languages : en
Pages : 369
Book Description
This volume provides practical solutions and introduces recent theoretical developments in risk management, pricing of credit derivatives, quantification of volatility and copula modeling. This third edition is devoted to modern risk analysis based on quantitative methods and textual analytics to meet the current challenges in banking and finance. It includes 14 new contributions and presents a comprehensive, state-of-the-art treatment of cutting-edge methods and topics, such as collateralized debt obligations, the high-frequency analysis of market liquidity, and realized volatility. The book is divided into three parts: Part 1 revisits important market risk issues, while Part 2 introduces novel concepts in credit risk and its management along with updated quantitative methods. The third part discusses the dynamics of risk management and includes risk analysis of energy markets and for cryptocurrencies. Digital assets, such as blockchain-based currencies, have become popular b ut are theoretically challenging when based on conventional methods. Among others, it introduces a modern text-mining method called dynamic topic modeling in detail and applies it to the message board of Bitcoins. The unique synthesis of theory and practice supported by computational tools is reflected not only in the selection of topics, but also in the fine balance of scientific contributions on practical implementation and theoretical concepts. This link between theory and practice offers theoreticians insights into considerations of applicability and, vice versa, provides practitioners convenient access to new techniques in quantitative finance. Hence the book will appeal both to researchers, including master and PhD students, and practitioners, such as financial engineers. The results presented in the book are fully reproducible and all quantlets needed for calculations are provided on an accompanying website. The Quantlet platform quantlet.de, quantlet.com, quantlet.org is an integrated QuantNet environment consisting of different types of statistics-related documents and program codes. Its goal is to promote reproducibility and offer a platform for sharing validated knowledge native to the social web. QuantNet and the corresponding Data-Driven Documents-based visualization allows readers to reproduce the tables, pictures and calculations inside this Springer book.
Dependent Data in Social Sciences Research
Author: Mark Stemmler
Publisher: Springer Nature
ISBN: 3031563182
Category :
Languages : en
Pages : 785
Book Description
Publisher: Springer Nature
ISBN: 3031563182
Category :
Languages : en
Pages : 785
Book Description
Analysis of Mixed Data
Author: Alexander R. de Leon
Publisher: CRC Press
ISBN: 1439884714
Category : Mathematics
Languages : en
Pages : 264
Book Description
A comprehensive source on mixed data analysis, Analysis of Mixed Data: Methods & Applications summarizes the fundamental developments in the field. Case studies are used extensively throughout the book to illustrate interesting applications from economics, medicine and health, marketing, and genetics. Carefully edited for smooth readability and seamless transitions between chapters All chapters follow a common structure, with an introduction and a concluding summary, and include illustrative examples from real-life case studies in developmental toxicology, economics, medicine and health, marketing, and genetics An introductory chapter provides a "wide angle" introductory overview and comprehensive survey of mixed data analysis Blending theory and methodology, this book illustrates concepts via data from different disciplines. Analysis of Mixed Data: Methods & Applications traces important developments, collates basic results, presents terminology and methodologies, and gives an overview of statistical research applications. It is a valuable resource to methodologically interested as well as subject matter-motivated researchers in many disciplines.
Publisher: CRC Press
ISBN: 1439884714
Category : Mathematics
Languages : en
Pages : 264
Book Description
A comprehensive source on mixed data analysis, Analysis of Mixed Data: Methods & Applications summarizes the fundamental developments in the field. Case studies are used extensively throughout the book to illustrate interesting applications from economics, medicine and health, marketing, and genetics. Carefully edited for smooth readability and seamless transitions between chapters All chapters follow a common structure, with an introduction and a concluding summary, and include illustrative examples from real-life case studies in developmental toxicology, economics, medicine and health, marketing, and genetics An introductory chapter provides a "wide angle" introductory overview and comprehensive survey of mixed data analysis Blending theory and methodology, this book illustrates concepts via data from different disciplines. Analysis of Mixed Data: Methods & Applications traces important developments, collates basic results, presents terminology and methodologies, and gives an overview of statistical research applications. It is a valuable resource to methodologically interested as well as subject matter-motivated researchers in many disciplines.
Dependence in Probability and Statistics
Author: Paul Doukhan
Publisher: Springer Science & Business Media
ISBN: 3642141048
Category : Mathematics
Languages : en
Pages : 222
Book Description
This account of recent works on weakly dependent, long memory and multifractal processes introduces new dependence measures for studying complex stochastic systems and includes other topics such as the dependence structure of max-stable processes.
Publisher: Springer Science & Business Media
ISBN: 3642141048
Category : Mathematics
Languages : en
Pages : 222
Book Description
This account of recent works on weakly dependent, long memory and multifractal processes introduces new dependence measures for studying complex stochastic systems and includes other topics such as the dependence structure of max-stable processes.