Semisupervised generalized discriminant analysis

Yu Zhang*, Dit Yan Yeung

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Generalized discriminant analysis (GDA) is a commonly used method for dimensionality reduction. In its general form, it seeks a nonlinear projection that simultaneously maximizes the between-class dissimilarity and minimizes the within-class dissimilarity to increase class separability. In real-world applications where labeled data are scarce, GDA may not work very well. However, unlabeled data are often available in large quantities at very low cost. In this paper, we propose a novel GDA algorithm which is abbreviated as semisupervised generalized discriminant analysis (SSGDA). We utilize unlabeled data to maximize an optimality criterion of GDA and formulate the problem as an optimization problem that is solved using the constrained concave-convex procedure. The optimization procedure leads to estimation of the class labels for the unlabeled data. We propose a novel confidence measure and a method for selecting those unlabeled data points whose labels are estimated with high confidence. The selected unlabeled data can then be used to augment the original labeled dataset for performing GDA. We also propose a variant of SSGDA, called M-SSGDA, which adopts the manifold assumption to utilize the unlabeled data. Extensive experiments on many benchmark datasets demonstrate the effectiveness of our proposed methods.

Original languageEnglish
Pages (from-to)1207-1217
Number of pages11
JournalIEEE Transactions on Neural Networks
Volume22
Issue number8
DOIs
Publication statusPublished - 1 Aug 2011

Keywords

  • Constrained concave-convex procedure
  • dimensionality reduction
  • generalized discriminant analysis
  • semisupervised learning

Fingerprint

Dive into the research topics of 'Semisupervised generalized discriminant analysis'. Together they form a unique fingerprint.

Cite this