Likelihood-based Density Estimation Using Deep Architectures

Likelihood-based Density Estimation Using Deep Architectures PDF Author: Priyank Jaini
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
Multivariate density estimation is a central problem in unsupervised machine learning that has been studied immensely in both statistics and machine learning. Several methods have thus been proposed for density estimation including classical techniques like histograms, kernel density estimation methods, mixture models, and more recently neural density estimation that leverages the recent advances in deep learning and neural networks to tractably represent a density function. In today's age, when large amounts of data are being generated in almost every field, it is of paramount importance to develop density estimation methods that are cheap both computationally and in memory cost. The main contribution of this thesis is in providing a principled study of parametric density estimation methods using mixture models and triangular maps for neural density estimation. The first part of the thesis focuses on the compact representation of mixture models using deep architectures like latent tree models, hidden Markov models, tensorial mixture models, hierarchical tensor formats and sum-product networks. It provides a unifying view of possible representations of mixture models using such deep architectures. The unifying view allows us to prove exponential separation between deep mixture models and mixture models represented using shallow architectures, demonstrating the benefits of depth in their representation. In a surprising result thereafter, we prove that a deep mixture model can be approximated using the conditional gradient algorithm by a shallow architecture of polynomial size w.r.t. the inverse of the approximation accuracy. Next, we address the more practical problem of density estimation of mixture models for streaming data by proposing an online Bayesian Moment Matching algorithm for Gaussian mixture models that can be distributed over several processors for fast computation. Exact Bayesian learning of mixture models is intractable because the number of terms in the posterior grows exponentially w.r.t. to the number of observations. We circumvent this problem by projecting the exact posterior on to a simple family of densities by matching a set of sufficient moments. Subsequently, we extend this algorithm for sequential data modeling using transfer learning by learning a hidden Markov model over the observations with Gaussian mixtures. We apply this algorithm on three diverse applications of activity recognition based on smartphone sensors, sleep stage classification for predicting neurological disorders using electroencephalography data and network size prediction for telecommunication networks. In the second part, we focus on neural density estimation methods where we provide a unified framework for estimating densities using monotone and bijective triangular maps represented using deep neural networks. Using this unified framework we study the limitations and representation power of recent flow based and autoregressive methods. Based on this framework, we subsequently propose a novel Sum-of-Squares polynomial flow that is interpretable, universal and easy to train.

Likelihood-based Density Estimation Using Deep Architectures

Likelihood-based Density Estimation Using Deep Architectures PDF Author: Priyank Jaini
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
Multivariate density estimation is a central problem in unsupervised machine learning that has been studied immensely in both statistics and machine learning. Several methods have thus been proposed for density estimation including classical techniques like histograms, kernel density estimation methods, mixture models, and more recently neural density estimation that leverages the recent advances in deep learning and neural networks to tractably represent a density function. In today's age, when large amounts of data are being generated in almost every field, it is of paramount importance to develop density estimation methods that are cheap both computationally and in memory cost. The main contribution of this thesis is in providing a principled study of parametric density estimation methods using mixture models and triangular maps for neural density estimation. The first part of the thesis focuses on the compact representation of mixture models using deep architectures like latent tree models, hidden Markov models, tensorial mixture models, hierarchical tensor formats and sum-product networks. It provides a unifying view of possible representations of mixture models using such deep architectures. The unifying view allows us to prove exponential separation between deep mixture models and mixture models represented using shallow architectures, demonstrating the benefits of depth in their representation. In a surprising result thereafter, we prove that a deep mixture model can be approximated using the conditional gradient algorithm by a shallow architecture of polynomial size w.r.t. the inverse of the approximation accuracy. Next, we address the more practical problem of density estimation of mixture models for streaming data by proposing an online Bayesian Moment Matching algorithm for Gaussian mixture models that can be distributed over several processors for fast computation. Exact Bayesian learning of mixture models is intractable because the number of terms in the posterior grows exponentially w.r.t. to the number of observations. We circumvent this problem by projecting the exact posterior on to a simple family of densities by matching a set of sufficient moments. Subsequently, we extend this algorithm for sequential data modeling using transfer learning by learning a hidden Markov model over the observations with Gaussian mixtures. We apply this algorithm on three diverse applications of activity recognition based on smartphone sensors, sleep stage classification for predicting neurological disorders using electroencephalography data and network size prediction for telecommunication networks. In the second part, we focus on neural density estimation methods where we provide a unified framework for estimating densities using monotone and bijective triangular maps represented using deep neural networks. Using this unified framework we study the limitations and representation power of recent flow based and autoregressive methods. Based on this framework, we subsequently propose a novel Sum-of-Squares polynomial flow that is interpretable, universal and easy to train.

Probability Density Estimation with Neural Networks and Its Application to Blind Signal Processing

Probability Density Estimation with Neural Networks and Its Application to Blind Signal Processing PDF Author: Amir Sarajedini
Publisher:
ISBN:
Category :
Languages : en
Pages : 390

Get Book Here

Book Description


Maximum Likelihood Density Estimation by Means of a Neural Network

Maximum Likelihood Density Estimation by Means of a Neural Network PDF Author: Jianxiong Wu
Publisher:
ISBN:
Category : Estimation theory
Languages : en
Pages : 21

Get Book Here

Book Description


Maximum Penalized Likelihood Estimation

Maximum Penalized Likelihood Estimation PDF Author: P.P.B. Eggermont
Publisher: Springer Nature
ISBN: 1071612441
Category : Mathematics
Languages : en
Pages : 514

Get Book Here

Book Description
This book deals with parametric and nonparametric density estimation from the maximum (penalized) likelihood point of view, including estimation under constraints. The focal points are existence and uniqueness of the estimators, almost sure convergence rates for the L1 error, and data-driven smoothing parameter selection methods, including their practical performance. The reader will gain insight into technical tools from probability theory and applied mathematics.

Combinatorial Methods in Density Estimation

Combinatorial Methods in Density Estimation PDF Author: Luc Devroye
Publisher: Springer Science & Business Media
ISBN: 1461301254
Category : Mathematics
Languages : en
Pages : 219

Get Book Here

Book Description
Density estimation has evolved enormously since the days of bar plots and histograms, but researchers and users are still struggling with the problem of the selection of the bin widths. This book is the first to explore a new paradigm for the data-based or automatic selection of the free parameters of density estimates in general so that the expected error is within a given constant multiple of the best possible error. The paradigm can be used in nearly all density estimates and for most model selection problems, both parametric and nonparametric.

Math and Architectures of Deep Learning

Math and Architectures of Deep Learning PDF Author: Krishnendu Chaudhury
Publisher: Simon and Schuster
ISBN: 1638350809
Category : Computers
Languages : en
Pages : 550

Get Book Here

Book Description
Shine a spotlight into the deep learning “black box”. This comprehensive and detailed guide reveals the mathematical and architectural concepts behind deep learning models, so you can customize, maintain, and explain them more effectively. Inside Math and Architectures of Deep Learning you will find: Math, theory, and programming principles side by side Linear algebra, vector calculus and multivariate statistics for deep learning The structure of neural networks Implementing deep learning architectures with Python and PyTorch Troubleshooting underperforming models Working code samples in downloadable Jupyter notebooks The mathematical paradigms behind deep learning models typically begin as hard-to-read academic papers that leave engineers in the dark about how those models actually function. Math and Architectures of Deep Learning bridges the gap between theory and practice, laying out the math of deep learning side by side with practical implementations in Python and PyTorch. Written by deep learning expert Krishnendu Chaudhury, you’ll peer inside the “black box” to understand how your code is working, and learn to comprehend cutting-edge research you can turn into practical applications. Foreword by Prith Banerjee. About the technology Discover what’s going on inside the black box! To work with deep learning you’ll have to choose the right model, train it, preprocess your data, evaluate performance and accuracy, and deal with uncertainty and variability in the outputs of a deployed solution. This book takes you systematically through the core mathematical concepts you’ll need as a working data scientist: vector calculus, linear algebra, and Bayesian inference, all from a deep learning perspective. About the book Math and Architectures of Deep Learning teaches the math, theory, and programming principles of deep learning models laid out side by side, and then puts them into practice with well-annotated Python code. You’ll progress from algebra, calculus, and statistics all the way to state-of-the-art DL architectures taken from the latest research. What's inside The core design principles of neural networks Implementing deep learning with Python and PyTorch Regularizing and optimizing underperforming models About the reader Readers need to know Python and the basics of algebra and calculus. About the author Krishnendu Chaudhury is co-founder and CTO of the AI startup Drishti Technologies. He previously spent a decade each at Google and Adobe. Table of Contents 1 An overview of machine learning and deep learning 2 Vectors, matrices, and tensors in machine learning 3 Classifiers and vector calculus 4 Linear algebraic tools in machine learning 5 Probability distributions in machine learning 6 Bayesian tools for machine learning 7 Function approximation: How neural networks model the world 8 Training neural networks: Forward propagation and backpropagation 9 Loss, optimization, and regularization 10 Convolutions in neural networks 11 Neural networks for image classification and object detection 12 Manifolds, homeomorphism, and neural networks 13 Fully Bayes model parameter estimation 14 Latent space and generative modeling, autoencoders, and variational autoencoders A Appendix

A Course in Density Estimation

A Course in Density Estimation PDF Author: Luc Devroye
Publisher: Birkhäuser
ISBN:
Category : Juvenile Nonfiction
Languages : en
Pages : 216

Get Book Here

Book Description


Density Ratio Estimation in Machine Learning

Density Ratio Estimation in Machine Learning PDF Author: Masashi Sugiyama
Publisher: Cambridge University Press
ISBN: 0521190177
Category : Computers
Languages : en
Pages : 343

Get Book Here

Book Description
This book introduces theories, methods and applications of density ratio estimation, a newly emerging paradigm in the machine learning community.

Nonparametric Density Estimation

Nonparametric Density Estimation PDF Author: Luc Devroye
Publisher: New York ; Toronto : Wiley
ISBN:
Category : Mathematics
Languages : en
Pages : 376

Get Book Here

Book Description
This book gives a rigorous, systematic treatment of density estimates, their construction, use and analysis with full proofs. It develops L1 theory, rather than the classical L2, showing how L1 exposes fundamental properties of density estimates masked by L2.

Deep Neural Network Based Crowd Density Estimation for Counting, Detection and Tracking

Deep Neural Network Based Crowd Density Estimation for Counting, Detection and Tracking PDF Author: 康頔
Publisher:
ISBN:
Category : Computer science
Languages : en
Pages : 124

Get Book Here

Book Description