Abstract
Crowdsourcing has been widely established as a means to enable human computation at large-scale, in particular for tasks that require manual labelling of large sets of data items. Answers obtained from heterogeneous crowd workers are aggregated to obtain a robust result. In this paper, we consider partial-Agreement tasks that are common in many applications such as image tagging and document annotation, where items are assigned sets of labels. Going beyond the state-of-The-Art, we propose a novel Bayesian nonparametric model to aggregate the partial-Agreement answers in a generic way. This model enables us to compute the consensus of partially-sound and partially-complete worker answers, while taking into account mutual relations in labels and different answer sets. An evaluation of our method using real-world datasets reveals that it consistently outperforms the state-of-The-Art in terms of precision, recall, and scalability.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1749-1750 |
| Number of pages | 2 |
| ISBN (Electronic) | 9781538655207 |
| DOIs | |
| Publication status | Published - 24 Oct 2018 |
| Externally published | Yes |
| Event | 34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, France Duration: 16 Apr 2018 → 19 Apr 2018 |
Publication series
| Name | Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 |
|---|
Conference
| Conference | 34th IEEE International Conference on Data Engineering, ICDE 2018 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 16/04/18 → 19/04/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Answer Aggregation
- Bayesian Models
- Crowdsourcing
- Nonparametric Models