Bayesian factorizations of big sparse tensors. - Texas A&M University (TAMU) Scholar

abstract

It has become routine to collect data that are structured as multiway arrays (tensors). There is an enormous literature on low rank and sparse matrix factorizations, but limited consideration of extensions to the tensor case in statistics. The most common low rank tensor factorization relies on parallel factor analysis (PARAFAC), which expresses a rank k tensor as a sum of rank one tensors. When observations are only available for a tiny subset of the cells of a big tensor, the low rank assumption is not sufficient and PARAFAC has poor performance. We induce an additional layer of dimension reduction by allowing the effective rank to vary across dimensions of the table. For concreteness, we focus on a contingency table application. Taking a Bayesian approach, we place priors on terms in the factorization and develop an efficient Gibbs sampler for posterior computation. Theory is provided showing posterior concentration rates in high-dimensional settings, and the methods are shown to have excellent performance in simulations and several real data applications.

authors

Bhattacharya, Anirban

published proceedings

J Am Stat Assoc

author list (cited authors)

Zhou, J., Bhattacharya, A., Herring, A., & Dunson, D.

citation count

45

complete list of authors

Zhou, Jing||Bhattacharya, Anirban||Herring, Amy||Dunson, David

publication date

January 2015

publisher

Taylor & Francis Publisher

published in

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION Journal

keywords

Bayesian
Big Data
Categorical Data
Contingency Table
Low Rank
Matrix Completion
Parafac
Tensor Factorization

PubMed Central ID

31210707

Digital Object Identifier (DOI)

10.1080/01621459.2014.983233

start page

1562

end page

1576

volume

110

issue

512

URL

http%3A%2F%2Fdx.doi.org%2F10.1080%2F01621459.2014.983233

Bayesian factorizations of big sparse tensors. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL