Matrix Product Operator Restricted Boltzmann Machines

Conference Paper (2019)
Author(s)

Cong Chen (The University of Hong Kong)

K. Batselier (TU Delft - Team Jan-Willem van Wingerden, The University of Hong Kong)

Ching Yun Ko (The University of Hong Kong)

Ngai Wong (The University of Hong Kong)

Research Group
Team Jan-Willem van Wingerden
Copyright
© 2019 Cong Chen, K. Batselier, Ching Yun Ko, Ngai Wong
DOI related publication
https://doi.org/10.1109/IJCNN.2019.8851877
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 Cong Chen, K. Batselier, Ching Yun Ko, Ngai Wong
Research Group
Team Jan-Willem van Wingerden
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.@en
ISBN (print)
978-1-7281-2009-6
ISBN (electronic)
978-1-7281-1985-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

A restricted Boltzmann machine (RBM) learns a probability distribution over its input samples and has numerous uses like dimensionality reduction, classification and generative modeling. Conventional RBMs accept vectorized data that dismiss potentially important structural information in the original tensor (multi-way) input. Matrix-variate and tensor-variate RBMs, named MvRBM and TvRBM, have been proposed but are all restrictive by model construction and have weak model expression power. This work presents the matrix product operator RBM (MPORBM) that utilizes a tensor network generalization of Mv/TvRBM, preserves input formats in both the visible and hidden layers, and results in higher expressive power. A novel training algorithm integrating contrastive divergence and an alternating optimization procedure is also developed. Numerical experiments compare the MPORBM with the traditional RBM and MvRBM for data classification and image completion and denoising tasks. The expressive power of the MPORBM as a function of the MPO-rank is also investigated.

Files

08851877.pdf
(pdf | 0.595 Mb)
- Embargo expired in 30-03-2020
License info not available