Abstract
Text2text question classification (TQC) is a foundational task in the question classification (QC) field, with a wide range of applications in both industry and academia, such as intelligent customer service systems. Conventional QC tasks typically rely on one or more user-provided keywords to classify questions. In contrast, TQC problems involve categorizing semantically similar standard questions, which are then represented in short text format. However, due to the limited availability of TQC datasets, the process of manual labeling often results in noisy labels that do not accurately reflect the true class of a question, introducing bias into the training data. Noisy labels can lead to unreliable and uncertain supervised signals, which have a significant negative impact on the performance of models. To tackle these challenges, we propose the Evidential Robust Deep Learning (ERDL) framework, which integrates TQC Contrastive Loss (TCL) and TQC Evidential Learning Loss (TEL) to achieve accurate semantic similarity and handle noisy data in the TQC dataset. Notably, TEL is a novel loss function based on evidential learning that models the output as a Dirichlet distribution to capture the uncertainty resulting from noisy data. We evaluated our framework using four noisy TQC datasets and found that it outperformed relevant baselines, as indicated by the experimental results.
| Original language | English |
|---|---|
| Title of host publication | Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings |
| Editors | Lazaros Iliadis, Antonios Papaleonidas, Plamen Angelov, Chrisina Jayne |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 211-222 |
| Number of pages | 12 |
| ISBN (Print) | 9783031442032 |
| DOIs | |
| Publication status | Published - 2023 |
| Externally published | Yes |
| Event | 32nd International Conference on Artificial Neural Networks, ICANN 2023 - Heraklion, Greece Duration: 26 Sept 2023 → 29 Sept 2023 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 14263 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 32nd International Conference on Artificial Neural Networks, ICANN 2023 |
|---|---|
| Country/Territory | Greece |
| City | Heraklion |
| Period | 26/09/23 → 29/09/23 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Keywords
- Evidential learning
- Noisy label
- Text2text question classification
Fingerprint
Dive into the research topics of 'Evidential Robust Deep Learning for Noisy Text2text Question Classification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver