Adaptive Statistical Methods for Microbiome Association Analysis

Adaptive Statistical Methods for Microbiome Association Analysis PDF Author: Kalins Banerjee
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
The importance of human microbiome has been increasingly recognized, and substantial research is being conducted focusing on how microbial communities are associated with human health and diseases. These association studies not only can improve our understanding of the non-genetic components of complex traits and diseases, but also might open up an entirely new way of drug development. Here, we introduce two novel tests for microbiome association studies viz.; Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) and Adaptive Microbiome Association Test (AMAT). AMDA addresses microbiome differential abundance analysis, whereas AMAT provides a flexible microbiome association testing platform under the generalized linear model framework. Our research focuses explicitly on adaptive statistical multivariate analysis tools that are developed using data-driven learning approaches to suit a wide range of possible scenarios. Realizing the susceptibility of existing methods to the adverse effects of noise accumulation, the proposed two-stage adaptive testing frameworks incorporate feature selection as an intermediate step. Extensive simulation studies and real data applications demonstrate that both AMDA and AMAT are often more powerful than several competing methods while preserving the correct type I error rate.

Adaptive Statistical Methods for Microbiome Association Analysis

Adaptive Statistical Methods for Microbiome Association Analysis PDF Author: Kalins Banerjee
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
The importance of human microbiome has been increasingly recognized, and substantial research is being conducted focusing on how microbial communities are associated with human health and diseases. These association studies not only can improve our understanding of the non-genetic components of complex traits and diseases, but also might open up an entirely new way of drug development. Here, we introduce two novel tests for microbiome association studies viz.; Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) and Adaptive Microbiome Association Test (AMAT). AMDA addresses microbiome differential abundance analysis, whereas AMAT provides a flexible microbiome association testing platform under the generalized linear model framework. Our research focuses explicitly on adaptive statistical multivariate analysis tools that are developed using data-driven learning approaches to suit a wide range of possible scenarios. Realizing the susceptibility of existing methods to the adverse effects of noise accumulation, the proposed two-stage adaptive testing frameworks incorporate feature selection as an intermediate step. Extensive simulation studies and real data applications demonstrate that both AMDA and AMAT are often more powerful than several competing methods while preserving the correct type I error rate.

Adaptive Statistical Methods for Microbiome Association Studies

Adaptive Statistical Methods for Microbiome Association Studies PDF Author: Hyunwook Koh
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Get Book Here

Book Description


Applied Microbiome Statistics

Applied Microbiome Statistics PDF Author: Yinglin Xia
Publisher: CRC Press
ISBN: 1040045669
Category : Mathematics
Languages : en
Pages : 457

Get Book Here

Book Description
This unique book officially defines microbiome statistics as a specific new field of statistics and addresses the statistical analysis of correlation, association, interaction, and composition in microbiome research. It also defines the study of the microbiome as a hypothesis-driven experimental science and describes two microbiome research themes and six unique characteristics of microbiome data, as well as investigating challenges for statistical analysis of microbiome data using the standard statistical methods. This book is useful for researchers of biostatistics, ecology, and data analysts. Presents a thorough overview of statistical methods in microbiome statistics of parametric and nonparametric correlation, association, interaction, and composition adopted from classical statistics and ecology and specifically designed for microbiome research. Performs step-by-step statistical analysis of correlation, association, interaction, and composition in microbiome data. Discusses the issues of statistical analysis of microbiome data: high dimensionality, compositionality, sparsity, overdispersion, zero-inflation, and heterogeneity. Investigates statistical methods on multiple comparisons and multiple hypothesis testing and applications to microbiome data. Introduces a series of exploratory tools to visualize composition and correlation of microbial taxa by barplot, heatmap, and correlation plot. Employs the Kruskal–Wallis rank-sum test to perform model selection for further multi-omics data integration. Offers R code and the datasets from the authors’ real microbiome research and publicly available data for the analysis used. Remarks on the advantages and disadvantages of each of the methods used.

Statistical Analysis of Microbiome Data

Statistical Analysis of Microbiome Data PDF Author: Somnath Datta
Publisher: Springer Nature
ISBN: 3030733513
Category : Medical
Languages : en
Pages : 349

Get Book Here

Book Description
Microbiome research has focused on microorganisms that live within the human body and their effects on health. During the last few years, the quantification of microbiome composition in different environments has been facilitated by the advent of high throughput sequencing technologies. The statistical challenges include computational difficulties due to the high volume of data; normalization and quantification of metabolic abundances, relative taxa and bacterial genes; high-dimensionality; multivariate analysis; the inherently compositional nature of the data; and the proper utilization of complementary phylogenetic information. This has resulted in an explosion of statistical approaches aimed at tackling the unique opportunities and challenges presented by microbiome data. This book provides a comprehensive overview of the state of the art in statistical and informatics technologies for microbiome research. In addition to reviewing demonstrably successful cutting-edge methods, particular emphasis is placed on examples in R that rely on available statistical packages for microbiome data. With its wide-ranging approach, the book benefits not only trained statisticians in academia and industry involved in microbiome research, but also other scientists working in microbiomics and in related fields.

Statistical Methods for the Analysis of Microbiome Data

Statistical Methods for the Analysis of Microbiome Data PDF Author: Anna M. Plantinga
Publisher:
ISBN:
Category :
Languages : en
Pages : 128

Get Book Here

Book Description
The human microbiome plays a vital role in maintaining health, and imbalances in the microbiome are associated with a wide variety of diseases. Understanding whether and how the microbiome is associated with particular health conditions is a focus of many modern microbiome studies, with the hope that a deeper understanding of these associations may lead to more effective prevention and treatment regimens. However, how best to analyze data from microbiome profiling studies remains unclear. The high dimensionality, compositional nature, intrinsic biological structure, and limited availability of samples pose substantial statistical challenges. To face these challenges, we propose novel analytic approaches based on sparse penalized regression strategies and distance-based global association analysis. Most distance-based methods for global microbiome association analysis are restricted to simple dichotomous or quantitative outcomes, but more complex outcomes are increasingly common in microbiome studies. In the first part of this dissertation, we introduce two distance-based methods for the analysis of entire microbial communities in modern microbiome studies. We develop a kernel machine regression-based score test for association between the microbiome and censored time-to-event outcomes. We then propose a novel longitudinal measure of dissimilarity that summarizes changes in the microbiome across time and compares these changes between subjects. Since this dissimilarity may be incorporated into any distance-based analysis framework, it is a highly flexible tool for applying a wide variety of distance-based analyses in longitudinal studies. Identification of associated taxa and detection of predictive microbial signatures are key to translation of microbiome studies. In the second part of this dissertation, we present two penalized regression methods for estimation and prediction with high-dimensional compositional data. Because phylogenetic similarity between bacteria often corresponds to shared functions, our first contribution is to incorporate phylogenetic structure into a penalized regression model for constrained data. We then propose a model that exploits phylogenetic structure to use partial information in the setting of differing feature sets between model-building and prediction datasets. We evaluate the performance of these methods through extensive simulation studies and apply them to studies investigating the association of graft-versus-host disease or body mass index with the gut microbiome.

Statistical and Computational Methods for Microbiome Multi-Omics Data

Statistical and Computational Methods for Microbiome Multi-Omics Data PDF Author: Himel Mallick
Publisher: Frontiers Media SA
ISBN: 2889660915
Category : Science
Languages : en
Pages : 170

Get Book Here

Book Description
This eBook is a collection of articles from a Frontiers Research Topic. Frontiers Research Topics are very popular trademarks of the Frontiers Journals Series: they are collections of at least ten articles, all centered on a particular subject. With their unique mix of varied contributions from Original Research to Review Articles, Frontiers Research Topics unify the most influential researchers, the latest key findings and historical advances in a hot research area! Find out more on how to host your own Frontiers Research Topic or contribute to one as an author by contacting the Frontiers Editorial Office: frontiersin.org/about/contact.

Statistical Methods for Human Microbiome Data Analysis

Statistical Methods for Human Microbiome Data Analysis PDF Author: Jun Chen
Publisher:
ISBN:
Category :
Languages : en
Pages : 107

Get Book Here

Book Description


Statistical Methods for Longitudinal Data Analysis and Reproducible Feature Selection in Human Microbiome Studies

Statistical Methods for Longitudinal Data Analysis and Reproducible Feature Selection in Human Microbiome Studies PDF Author: Lingjing Jiang
Publisher:
ISBN:
Category :
Languages : en
Pages : 101

Get Book Here

Book Description
The microbiome is inherently dynamic, driven by interactions among microbes, with the host, and with the environment. At any point in life, human microbiome can be dramatically altered, either transiently or long term, by diseases, medical interventions or even daily routines. Since the human microbiome is highly dynamic and personalized, longitudinal microbiome studies that sample human-associated microbial communities repeatedly over time provide valuable information for researchers to observe both inter- and intra-individual variability, or to measure changes in response to an intervention in real time. Despite this increasing need in longitudinal data analysis, statistical methods for analyzing sparse longitudinal microbiome data and longitudinal multi-omics data still lag behind. In this dissertation, we describe our efforts in developing two novel statistical methods, Bayesian functional principal components analysis (SFPCA) for sparse longitudinal data analysis, and multivariate sparse functional principal components analysis (mSFPCA) for longitudinal microbiome multi-omics data analysis. Beyond longitudinal data analysis, we are also interested in utilizing statistical techniques for addressing the "reproducibility crisis" in microbiome research, especially in the indispensable task of feature selection. Instead of developing "the best" feature selection method, we focus on discovering a reproducible criterion called Stability for evaluating feature selection methods in order to yield reproducible results in microbiome analysis. To set an appropriate motivation and context for our work, Chapter 1 reviews the importance of longitudinal studies in human microbiome research, and presents the crucial need of developing novel statistical methods to meet the new challenges in longitudinal microbiome data analysis, and of producing reproducible results in microbiome feature selection. Chapter 2 introduces Bayesian SFPCA, a flexible Bayesian approach to SFPCA that enables efficient model selection and graphical model diagnostics for valid longitudinal microbiome applications. Chapter 3 presents mSFPCA, an extension of Bayesian SFPCA from modeling a univariate temporal outcome to simultaneously characterizing multiple temporal measurements, and inferring their temporal associations based on mutual information estimation. Chapter 4 proposes to use reproducibility criterion such as Stability instead of popular model prediction metric such as mean squared error (MSE) to quantify the reproducibility of identified microbial features.

Some Topics on Statistical Analysis of Genetic Imprinting Data and Microbiome Compositional Data

Some Topics on Statistical Analysis of Genetic Imprinting Data and Microbiome Compositional Data PDF Author: Fan Xia
Publisher: Open Dissertation Press
ISBN: 9781361355398
Category :
Languages : en
Pages :

Get Book Here

Book Description
This dissertation, "Some Topics on Statistical Analysis of Genetic Imprinting Data and Microbiome Compositional Data" by Fan, Xia, 夏凡, was obtained from The University of Hong Kong (Pokfulam, Hong Kong) and is being sold pursuant to Creative Commons: Attribution 3.0 Hong Kong License. The content of this dissertation has not been altered in any way. We have altered the formatting in order to facilitate the ease of printing and reading of the dissertation. All rights not granted by the above license are retained by the author. Abstract: Genetic association study is a useful tool to identify the genetic component that is responsible for a disease. The phenomenon that a certain gene expresses in a parent-of-origin manner is referred to as genomic imprinting. When a gene is imprinted, the performance of the disease-association study will be affected. This thesis presents statistical testing methods developed specially for nuclear family data centering around the genetic association studies incorporating imprinting effects. For qualitative diseases with binary outcomes, a class of TDTI* type tests was proposed in a general two-stage framework, where the imprinting effects were examined prior to association testing. On quantitative trait loci, a class of Q-TDTI(c) type tests and another class of Q-MAX(c) type tests were proposed. The proposed testing methods flexibly accommodate families with missing parental genotype and with multiple siblings. The performance of all the methods was verified by simulation studies. It was found that the proposed methods improve the testing power for detecting association in the presence of imprinting. The class of TDTI* tests was applied to a rheumatoid arthritis study data. Also, the class of Q-TDTI(c) tests was applied to analyze the Framingham Heart Study data. The human microbiome is the collection of the microbiota, together with their genomes and their habitats throughout the human body. The human microbiome comprises an inalienable part of our genetic landscape and contributes to our metabolic features. Also, current studies have suggested the variety of human microbiome in human diseases. With the high-throughput DNA sequencing, the human microbiome composition can be characterized based on bacterial taxa relative abundance and the phylogenetic constraint. Such taxa data are often high-dimensional overdispersed and contain excessive number of zeros. Taking into account of these characteristics in taxa data, this thesis presents statistical methods to identify associations between covariate/outcome and the human microbiome composition. To assess environmental/biological covariate effect to microbiome composition, an additive logistic normal multinomial regression model was proposed and a group l1 penalized likelihood estimation method was further developed to facilitate selection of covariates and estimation of parameters. To identify microbiome components associated with biological/clinical outcomes, a Bayesian hierarchical regression model with spike and slab prior for variable selection was proposed and a Markov chain Monte Carlo algorithm that combines stochastic variable selection procedure and random walk metropolis-hasting steps was developed for model estimation. Both of the methods were illustrated using simulations as well as a real human gut microbiome dataset from The Penn Gut Microbiome Project. DOI: 10.5353/th_b5223971 Subjects: Genomic imprinting - Statistical methods Body, Human - Microbiology - Statistical methods

Bioinformatic and Statistical Analysis of Microbiome Data

Bioinformatic and Statistical Analysis of Microbiome Data PDF Author: Yinglin Xia
Publisher: Springer Nature
ISBN: 3031213912
Category : Science
Languages : en
Pages : 717

Get Book Here

Book Description
This unique book addresses the bioinformatic and statistical modelling and also the analysis of microbiome data using cutting-edge QIIME 2 and R software. It covers core analysis topics in both bioinformatics and statistics, which provides a complete workflow for microbiome data analysis: from raw sequencing reads to community analysis and statistical hypothesis testing. It includes real-world data from the authors’ research and from the public domain, and discusses the implementation of QIIME 2 and R for data analysis step-by-step. The data as well as QIIME 2 and R computer programs are publicly available, allowing readers to replicate the model development and data analysis presented in each chapter so that these new methods can be readily applied in their own research. Bioinformatic and Statistical Analysis of Microbiome Data is an ideal book for advanced graduate students and researchers in the clinical, biomedical, agricultural, and environmental fields, as well as those studying bioinformatics, statistics, and big data analysis.