Classification via minimum incremental coding length (MICL)

John Wright*, Yi Ma, Yangyu Tao, Zhouchen Lin, Heung Yeung Shum

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

12 Citations (Scopus)

Abstract

We present a simple new criterion for classification, based on principles from lossy data compression. The criterion assigns a test sample to the class that uses the minimum number of additional bits to code the test sample, subject to an allowable distortion. We prove asymptotic optimality of this criterion for Gaussian data and analyze its relationships to classical classifiers. Theoretical results provide new insights into relationships among popular classifiers such as MAP and RDA, as well as unsupervised clustering methods based on lossy compression [13]. Minimizing the lossy coding length induces a regularization effect which stabilizes the (implicit) density estimate in a small-sample setting. Compression also provides a uniform means of handling classes of varying dimension. This simple classification criterion and its kernel and local versions perform competitively against existing classifiers on both synthetic examples and real imagery data such as handwritten digits and human faces, without requiring domain-specific information.

Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference
PublisherNeural Information Processing Systems
ISBN (Print)160560352X, 9781605603520
Publication statusPublished - 2008
Externally publishedYes
Event21st Annual Conference on Neural Information Processing Systems, NIPS 2007 - Vancouver, BC, Canada
Duration: 3 Dec 20076 Dec 2007

Publication series

NameAdvances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference

Conference

Conference21st Annual Conference on Neural Information Processing Systems, NIPS 2007
Country/TerritoryCanada
CityVancouver, BC
Period3/12/076/12/07

Fingerprint

Dive into the research topics of 'Classification via minimum incremental coding length (MICL)'. Together they form a unique fingerprint.

Cite this