Variable Screening and Model Selection in Censored Quantile Regression Via Sparse Penalties and Stepwise Refinement

Variable Screening and Model Selection in Censored Quantile Regression Via Sparse Penalties and Stepwise Refinement PDF Author:
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description


A Non-iterative Method for Fitting the Single Index Quantile Regression Model with Uncensored and Censored Data

A Non-iterative Method for Fitting the Single Index Quantile Regression Model with Uncensored and Censored Data PDF Author: Eliana Christou
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
Quantile regression (QR) is becoming increasingly popular due to its relevance in many scientific investigations. Linear and nonlinear QR models have been studied extensively, while recent research focuses on the single index quantile regression (SIQR) model. Compared to the single index mean regression (SIMR) problem, the fitting and the asymptotic theory of the SIQR model are more complicated due to the lack of closed form expressions for estimators of conditional quantiles. Consequently, existing methods are necessarily iterative. We propose a non-iterative estimation algorithm, and derive the asymptotic distribution of the proposed estimator under heteroscedasticity. For identifiability, we use a parametrization that sets the first coefficient to 1 instead of the typical condition which restricts the norm of the parametric component. This distinction is more than simply cosmetic as it affects, in a critical way, the correspondence between the estimator derived and the asymptotic theory. The ubiquity of high dimensional data has led to a number of variable selection methods for linear/nonlinear QR models and, recently, for the SIQR model. We propose a new algorithm for simultaneous variable selection and parameter estimation applicable also for heteroscedastic data. The proposed algorithm, which is non-iterative, consists of two steps. Step 1 performs an initial variable selection method. Step 2 uses the results of Step 1 to obtain better estimation of the conditional quantiles and, using them, to perform simultaneous variable selection and estimation of the parametric component of the SIQR model. It is shown that the initial variable selection method of Step 1 consistently estimates the relevant variables, and that the estimated parametric component derived in Step 2 satisfies the oracle property. Furthermore, QR is particularly relevant for the analysis of censored survival data as an alternative to proportional hazards and the accelerated failure time models. Such data occur frequently in biostatistics, environmental sciences, social sciences and econometrics. There is a large body of work for linear/nonlinear QR models for censored data, but it is only recently that the SIQR model has received some attention. However, the only existing method for fitting the SIQR model uses an iterative algorithm and no asymptotic theory for the resulting estimator of the Euclidean parameter is given. We propose a new non-iterative estimation algorithm, and derive the asymptotic distribution of the proposed estimator under heteroscedasticity.

Statistical Learning with Sparsity

Statistical Learning with Sparsity PDF Author: Trevor Hastie
Publisher: CRC Press
ISBN: 1498712177
Category : Business & Economics
Languages : en
Pages : 354

Get Book Here

Book Description
Discover New Methods for Dealing with High-Dimensional DataA sparse statistical model has only a small number of nonzero parameters or weights; therefore, it is much easier to estimate and interpret than a dense model. Statistical Learning with Sparsity: The Lasso and Generalizations presents methods that exploit sparsity to help recover the underl

Seamless R and C++ Integration with Rcpp

Seamless R and C++ Integration with Rcpp PDF Author: Dirk Eddelbuettel
Publisher: Springer Science & Business Media
ISBN: 146146868X
Category : Computers
Languages : en
Pages : 236

Get Book Here

Book Description
Rcpp is the glue that binds the power and versatility of R with the speed and efficiency of C++. With Rcpp, the transfer of data between R and C++ is nearly seamless, and high-performance statistical computing is finally accessible to most R users. Rcpp should be part of every statistician's toolbox. -- Michael Braun, MIT Sloan School of Management "Seamless R and C++ integration with Rcpp" is simply a wonderful book. For anyone who uses C/C++ and R, it is an indispensable resource. The writing is outstanding. A huge bonus is the section on applications. This section covers the matrix packages Armadillo and Eigen and the GNU Scientific Library as well as RInside which enables you to use R inside C++. These applications are what most of us need to know to really do scientific programming with R and C++. I love this book. -- Robert McCulloch, University of Chicago Booth School of Business Rcpp is now considered an essential package for anybody doing serious computational research using R. Dirk's book is an excellent companion and takes the reader from a gentle introduction to more advanced applications via numerous examples and efficiency enhancing gems. The book is packed with all you might have ever wanted to know about Rcpp, its cousins (RcppArmadillo, RcppEigen .etc.), modules, package development and sugar. Overall, this book is a must-have on your shelf. -- Sanjog Misra, UCLA Anderson School of Management The Rcpp package represents a major leap forward for scientific computations with R. With very few lines of C++ code, one has R's data structures readily at hand for further computations in C++. Hence, high-level numerical programming can be made in C++ almost as easily as in R, but often with a substantial speed gain. Dirk is a crucial person in these developments, and his book takes the reader from the first fragile steps on to using the full Rcpp machinery. A very recommended book! -- Søren Højsgaard, Department of Mathematical Sciences, Aalborg University, Denmark "Seamless R and C ++ Integration with Rcpp" provides the first comprehensive introduction to Rcpp. Rcpp has become the most widely-used language extension for R, and is deployed by over one-hundred different CRAN and BioConductor packages. Rcpp permits users to pass scalars, vectors, matrices, list or entire R objects back and forth between R and C++ with ease. This brings the depth of the R analysis framework together with the power, speed, and efficiency of C++. Dirk Eddelbuettel has been a contributor to CRAN for over a decade and maintains around twenty packages. He is the Debian/Ubuntu maintainer for R and other quantitative software, edits the CRAN Task Views for Finance and High-Performance Computing, is a co-founder of the annual R/Finance conference, and an editor of the Journal of Statistical Software. He holds a Ph.D. in Mathematical Economics from EHESS (Paris), and works in Chicago as a Senior Quantitative Analyst.

The Oxford Handbook of Applied Nonparametric and Semiparametric Econometrics and Statistics

The Oxford Handbook of Applied Nonparametric and Semiparametric Econometrics and Statistics PDF Author: Jeffrey Racine
Publisher: Oxford University Press
ISBN: 0199857946
Category : Business & Economics
Languages : en
Pages : 562

Get Book Here

Book Description
This volume, edited by Jeffrey Racine, Liangjun Su, and Aman Ullah, contains the latest research on nonparametric and semiparametric econometrics and statistics. Chapters by leading international econometricians and statisticians highlight the interface between econometrics and statistical methods for nonparametric and semiparametric procedures.

Flexible Imputation of Missing Data, Second Edition

Flexible Imputation of Missing Data, Second Edition PDF Author: Stef van Buuren
Publisher: CRC Press
ISBN: 0429960352
Category : Mathematics
Languages : en
Pages : 444

Get Book Here

Book Description
Missing data pose challenges to real-life data analysis. Simple ad-hoc fixes, like deletion or mean imputation, only work under highly restrictive conditions, which are often not met in practice. Multiple imputation replaces each missing value by multiple plausible values. The variability between these replacements reflects our ignorance of the true (but missing) value. Each of the completed data set is then analyzed by standard methods, and the results are pooled to obtain unbiased estimates with correct confidence intervals. Multiple imputation is a general approach that also inspires novel solutions to old problems by reformulating the task at hand as a missing-data problem. This is the second edition of a popular book on multiple imputation, focused on explaining the application of methods through detailed worked examples using the MICE package as developed by the author. This new edition incorporates the recent developments in this fast-moving field. This class-tested book avoids mathematical and technical details as much as possible: formulas are accompanied by verbal statements that explain the formula in accessible terms. The book sharpens the reader’s intuition on how to think about missing data, and provides all the tools needed to execute a well-grounded quantitative analysis in the presence of missing data.

Applied Survival Analysis Using R

Applied Survival Analysis Using R PDF Author: Dirk F. Moore
Publisher: Springer
ISBN: 3319312456
Category : Medical
Languages : en
Pages : 245

Get Book Here

Book Description
Applied Survival Analysis Using R covers the main principles of survival analysis, gives examples of how it is applied, and teaches how to put those principles to use to analyze data using R as a vehicle. Survival data, where the primary outcome is time to a specific event, arise in many areas of biomedical research, including clinical trials, epidemiological studies, and studies of animals. Many survival methods are extensions of techniques used in linear regression and categorical data, while other aspects of this field are unique to survival data. This text employs numerous actual examples to illustrate survival curve estimation, comparison of survivals of different groups, proper accounting for censoring and truncation, model variable selection, and residual analysis. Because explaining survival analysis requires more advanced mathematics than many other statistical topics, this book is organized with basic concepts and most frequently used procedures covered in earlier chapters, with more advanced topics near the end and in the appendices. A background in basic linear regression and categorical data analysis, as well as a basic knowledge of calculus and the R system, will help the reader to fully appreciate the information presented. Examples are simple and straightforward while still illustrating key points, shedding light on the application of survival analysis in a way that is useful for graduate students, researchers, and practitioners in biostatistics.

Ecological Models and Data in R

Ecological Models and Data in R PDF Author: Benjamin M. Bolker
Publisher: Princeton University Press
ISBN: 0691125228
Category : Computers
Languages : en
Pages : 408

Get Book Here

Book Description
Introduction and background; Exploratory data analysis and graphics; Deterministic functions for ecological modeling; Probability and stochastic distributions for ecological modeling; Stochatsic simulation and power analysis; Likelihood and all that; Optimization and all that; Likelihood examples; Standar statistics revisited; Modeling variance; Dynamic models.

SAS and R

SAS and R PDF Author: Ken Kleinman
Publisher: CRC Press
ISBN: 1466584491
Category : Mathematics
Languages : en
Pages : 473

Get Book Here

Book Description
An Up-to-Date, All-in-One Resource for Using SAS and R to Perform Frequent Tasks The first edition of this popular guide provided a path between SAS and R using an easy-to-understand, dictionary-like approach. Retaining the same accessible format, SAS and R: Data Management, Statistical Analysis, and Graphics, Second Edition explains how to easily perform an analytical task in both SAS and R, without having to navigate through the extensive, idiosyncratic, and sometimes unwieldy software documentation. The book covers many common tasks, such as data management, descriptive summaries, inferential procedures, regression analysis, and graphics, along with more complex applications. New to the Second Edition This edition now covers RStudio, a powerful and easy-to-use interface for R. It incorporates a number of additional topics, including using application program interfaces (APIs), accessing data through database management systems, using reproducible analysis tools, and statistical analysis with Markov chain Monte Carlo (MCMC) methods and finite mixture models. It also includes extended examples of simulations and many new examples. Enables Easy Mobility between the Two Systems Through the extensive indexing and cross-referencing, users can directly find and implement the material they need. SAS users can look up tasks in the SAS index and then find the associated R code while R users can benefit from the R index in a similar manner. Numerous example analyses demonstrate the code in action and facilitate further exploration. The datasets and code are available for download on the book’s website.

A Handbook of Statistical Graphics Using SAS ODS

A Handbook of Statistical Graphics Using SAS ODS PDF Author: Geoff Der
Publisher: CRC Press
ISBN: 1466599030
Category : Mathematics
Languages : en
Pages : 250

Get Book Here

Book Description
Easily Use SAS to Produce Your Graphics Diagrams, plots, and other types of graphics are indispensable components in nearly all phases of statistical analysis, from the initial assessment of the data to the selection of appropriate statistical models to the diagnosis of the chosen models once they have been fitted to the data. Harnessing the full graphics capabilities of SAS, A Handbook of Statistical Graphics Using SAS ODS covers essential graphical methods needed in every statistician’s toolkit. It explains how to implement the methods using SAS 9.4. The handbook shows how to use SAS to create many types of statistical graphics for exploring data and diagnosing fitted models. It uses SAS’s newer ODS graphics throughout as this system offers a number of advantages, including ease of use, high quality of results, consistent appearance, and convenient semiautomatic graphs from the statistical procedures. Each chapter deals graphically with several sets of example data from a wide variety of areas, such as epidemiology, medicine, and psychology. These examples illustrate the use of graphic displays to give an overview of data, to suggest possible hypotheses for testing new data, and to interpret fitted statistical models. The SAS programs and data sets are available online.