A tutorial on Bayesian multi-model linear regression with BAS and JASP.
Date
2021-12
Journal Title
Journal ISSN
Volume Title
Repository Usage Stats
views
downloads
Citation Stats
Abstract
Linear regression analyses commonly involve two consecutive stages of statistical inquiry. In the first stage, a single 'best' model is defined by a specific selection of relevant predictors; in the second stage, the regression coefficients of the winning model are used for prediction and for inference concerning the importance of the predictors. However, such second-stage inference ignores the model uncertainty from the first stage, resulting in overconfident parameter estimates that generalize poorly. These drawbacks can be overcome by model averaging, a technique that retains all models for inference, weighting each model's contribution by its posterior probability. Although conceptually straightforward, model averaging is rarely used in applied research, possibly due to the lack of easily accessible software. To bridge the gap between theory and practice, we provide a tutorial on linear regression using Bayesian model averaging in JASP, based on the BAS package in R. Firstly, we provide theoretical background on linear regression, Bayesian inference, and Bayesian model averaging. Secondly, we demonstrate the method on an example data set from the World Happiness Report. Lastly, we discuss limitations of model averaging and directions for dealing with violations of model assumptions.
Type
Department
Description
Provenance
Citation
Permalink
Published Version (Please cite this version)
Publication Info
Bergh, Don van den, Merlise A Clyde, Akash R Komarlu Narendra Gupta, Tim de Jong, Quentin F Gronau, Maarten Marsman, Alexander Ly, Eric-Jan Wagenmakers, et al. (2021). A tutorial on Bayesian multi-model linear regression with BAS and JASP. Behavior research methods, 53(6). pp. 2351–2371. 10.3758/s13428-021-01552-2 Retrieved from https://hdl.handle.net/10161/27371.
This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.
Collections
Scholars@Duke
Merlise Clyde
Model uncertainty and choice in prediction and variable selection problems for linear, generalized linear models and multivariate models. Bayesian Model Averaging. Prior distributions for model selection and model averaging. Wavelets and adaptive kernel non-parametric function estimation. Spatial statistics. Experimental design for nonlinear models. Applications in proteomics, bioinformatics, astro-statistics, air pollution and health effects, and environmental sciences.
Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.