Loading…
Academic Journal
Bayesian model selection via mean-field variational approximation.
Zhang, Yangfan, Yang, Yun
Journal of the Royal Statistical Society: Series B (Statistical Methodology). Jul2024, Vol. 86 Issue 3, p742-770. 29p.
Saved in:
Title | Bayesian model selection via mean-field variational approximation. |
---|---|
Authors | Zhang, Yangfan, Yang, Yun |
Source |
Journal of the Royal Statistical Society: Series B (Statistical Methodology). Jul2024, Vol. 86 Issue 3, p742-770. 29p.
|
Abstract |
This article considers Bayesian model selection via mean-field (MF) variational approximation. Towards this goal, we study the non-asymptotic properties of MF inference that allows latent variables and model misspecification. Concretely, we show a Bernstein–von Mises (BvM) theorem for the variational distribution from MF under possible model misspecification, which implies the distributional convergence of MF variational approximation to a normal distribution centring at the maximal likelihood estimator. Motivated by the BvM theorem, we propose a model selection criterion using the evidence lower bound (ELBO), and demonstrate that the model selected by ELBO tends to asymptotically agree with the one selected by the commonly used Bayesian information criterion (BIC) as the sample size tends to infinity. Compared to BIC, ELBO tends to incur smaller approximation error to the log-marginal likelihood (a.k.a. model evidence) due to a better dimension dependence and full incorporation of the prior information. Moreover, we show the geometric convergence of the coordinate ascent variational inference algorithm, which provides a practical guidance on how many iterations one typically needs to run when approximating the ELBO. These findings demonstrate that variational inference is capable of providing a computationally efficient alternative to conventional approaches in tasks beyond obtaining point estimates. [ABSTRACT FROM AUTHOR]
|
Subject Terms | |
Copyright of Journal of the Royal Statistical Society: Series B (Statistical Methodology) is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
|
|
This result is restricted to LU affiliated users only.
Sign in or register for an institutional account to gain full access, if eligible. |