Bayesian poisson calculus for latent feature modeling via generalized Indian buffet process priors

Lancelot F. James*

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

18 Citations (Scopus)

Abstract

Statistical latent feature models, such as latent factor models, are models where each observation is associated with a vector of latent features. A general problem is how to select the number/types of features, and related quantities. In Bayesian statistical machine learning, one seeks (nonparametric) models where one can learn such quantities in the presence of observed data. The Indian Buffet Process (IBP), devised by Griffiths and Ghahramani (2005), generates a (sparse) latent binary matrix with columns representing a potentially unbounded number of features and where each row corresponds to an individual or object. Its generative scheme is cast in terms of customers entering sequentially an Indian Buffet restaurant and selecting previously sampled dishes as well as new dishes. Dishes correspond to latent features shared by individuals. The IBP has been applied to a wide range of statistical problems. Recent works have demonstrated the utility of generalizations to nonbinary matrices. The purpose of this work is to describe a unified mechanism for construction, Bayesian analysis, and practical sampling of broad generalizations of the IBP that generate (sparse) matrices with general entries. An adaptation of the Poisson partition calculus is employed to handle the complexities, including combinatorial aspects, of these models. Our work reveals a spike and slab characterization, and also presents a general framework for multivariate extensions.We close by highlighting a multivariate IBP with condiments, and the role of a stable-Beta Dirichlet multivariate prior.

Original languageEnglish
Pages (from-to)2016-2045
Number of pages30
JournalAnnals of Statistics
Volume45
Issue number5
DOIs
Publication statusPublished - Oct 2017

Bibliographical note

Publisher Copyright:
© Institute of Mathematical Statistics, 2017.

Keywords

  • Bayesian statistical machine learning
  • Indian buffet process
  • Nonparametric latent feature models
  • Poisson process calculus
  • Spike and slab priors

Fingerprint

Dive into the research topics of 'Bayesian poisson calculus for latent feature modeling via generalized Indian buffet process priors'. Together they form a unique fingerprint.

Cite this