Abstract
For regression problems that involve many potential predictors, the Bayesian variable selection (BVS) method is a powerful tool. This method associates each model with its posterior probability and achieves excellent prediction performance through Bayesian model averaging. The main challenges of using such models include specifying a suitable prior and computing posterior quantities for inference. We contribute to the literature of BVS modelling in the following aspects. We first propose a new family of priors, called the mnet prior, which is indexed by a few hyperparameters that allow great flexibility in the prior density. The hyperparameters can also be treated as random, so that their values need not be tuned manually, but will instead adapt to the data. Simulation studies are used to demonstrate good prediction and variable selection performances of these models. Secondly, the analytical expression of the posterior distribution is unavailable for the BVS model under the mnet prior in general, as is the case for most BVS models. We develop an adaptive Markov chain Monte Carlo algorithm that facilitates the computation in high-dimensional regression problems. We finally showcase various ways to do inference with BVS models, highlighting a new way to visualize the importance of each predictor along with estimation of the coefficients and their uncertainties. These are demonstrated through the analysis of a breast cancer gene expression dataset.
Original language | English |
---|---|
Pages (from-to) | 180-197 |
Number of pages | 18 |
Journal | Canadian Journal of Statistics |
Volume | 44 |
Issue number | 2 |
DOIs | |
Publication status | Published - 1 Jun 2016 |
Externally published | Yes |
Keywords
- Bayesian computing
- Hyperparameters
- Markov chain Monte Carlo
- Median probability model
- Penalized regression
- Posterior distribution
- Variable selection
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty