The Zig-Zag process and super-efficient sampling for Bayesian analysis of big data

None, None; None, None; None, None

The Zig-Zag process and super-efficient sampling for Bayesian analysis of big data

Journal Article (2019)

Author(s)

G.N.J.C. Bierkens (TU Delft - Statistics)

Paul Fearnhead (Lancaster University)

Gareth Roberts (University of Warwick)

Research Group

Statistics

Copyright

DOI related publication

https://doi.org/10.1214/18-AOS1715

MCMC Piecewise deterministic Markov process Nonreversible Markov process Stochastic gradient Langevin dynamics Sub-sampling Exact sampling

To reference this document use:

https://resolver.tudelft.nl/uuid:c8aa0421-e47f-48c3-8611-9bd8a55d6f2c

More Info

expand_more

Publication Year

2019

Language

English

Copyright

Research Group

Statistics

Issue number

3

Volume number

47

Pages (from-to)

1288-1320

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Standard MCMC methods can scale poorly to big data settings due to the need to evaluate the likelihood at each iteration. There have been a number of approximate MCMC algorithms that use sub-sampling ideas to reduce this computational burden, but with the drawback that these algorithms no longer target the true posterior distribution. We introduce a new family of Monte Carlo methods based upon a multidimensional version of the Zig-Zag process of [Ann. Appl. Probab. 27 (2017) 846–882], a continuous-time piecewise deterministic Markov process. While traditional MCMC methods are reversible by construction (a property which is known to inhibit rapid convergence) the Zig-Zag process offers a flexible nonreversible alternative which we observe to often have favourable convergence properties. We show how the Zig-Zag process can be simulated without discretisation error, and give conditions for the process to be ergodic. Most importantly, we introduce a sub-sampling version of the Zig-Zag process that is an example of an exact approximate scheme, that is, the resulting approximate process still has the posterior as its stationary distribution. Furthermore, if we use a control-variate idea to reduce the variance of our unbiased estimator, then the Zig-Zag process can be super-efficient: after an initial preprocessing step, essentially independent samples from the posterior distribution are obtained at a computational cost which does not depend on the size of the data.

Files

AOS1715.pdf

(pdf | 0.758 Mb)

License info not available