Abstract
In many large e-commerce organizations, multiple data sources are often used to describe the same customers, thus it is important to consolidate data of multiple sources for intelligent business decision making. In this paper, we propose a novel method that predicts the classification of data from multiple sources without class labels in each source. We test our method on artificial and real-world datasets, and show that it can classify the data accurately. From the machine learning perspective, our method removes the fundamental assumption of providing class labels in supervised learning, and bridges the gap between supervised and unsupervised learning.
| Original language | English |
|---|---|
| Pages (from-to) | 181-201 |
| Number of pages | 21 |
| Journal | Data Mining and Knowledge Discovery |
| Volume | 12 |
| Issue number | 2-3 |
| DOIs | |
| Publication status | Published - May 2006 |
Keywords
- Learning classifications from unlabeled data of multiple sources
- Learning from multiple sources of data
- New solutions for multiple data source mining