Discovering classification from data of multiple sources

Charles X. Ling*, Qiang Yang

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

10 Citations (Scopus)

Abstract

In many large e-commerce organizations, multiple data sources are often used to describe the same customers, thus it is important to consolidate data of multiple sources for intelligent business decision making. In this paper, we propose a novel method that predicts the classification of data from multiple sources without class labels in each source. We test our method on artificial and real-world datasets, and show that it can classify the data accurately. From the machine learning perspective, our method removes the fundamental assumption of providing class labels in supervised learning, and bridges the gap between supervised and unsupervised learning.

Original languageEnglish
Pages (from-to)181-201
Number of pages21
JournalData Mining and Knowledge Discovery
Volume12
Issue number2-3
DOIs
Publication statusPublished - May 2006

Keywords

  • Learning classifications from unlabeled data of multiple sources
  • Learning from multiple sources of data
  • New solutions for multiple data source mining

Fingerprint

Dive into the research topics of 'Discovering classification from data of multiple sources'. Together they form a unique fingerprint.

Cite this