Evidential Robust Deep Learning for Noisy Text2text Question Classification

Haoran Wang, Jiyao Wang, Yuqiu Chen, Zehua Peng, Zuping Zhang*

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

Text2text question classification (TQC) is a foundational task in the question classification (QC) field, with a wide range of applications in both industry and academia, such as intelligent customer service systems. Conventional QC tasks typically rely on one or more user-provided keywords to classify questions. In contrast, TQC problems involve categorizing semantically similar standard questions, which are then represented in short text format. However, due to the limited availability of TQC datasets, the process of manual labeling often results in noisy labels that do not accurately reflect the true class of a question, introducing bias into the training data. Noisy labels can lead to unreliable and uncertain supervised signals, which have a significant negative impact on the performance of models. To tackle these challenges, we propose the Evidential Robust Deep Learning (ERDL) framework, which integrates TQC Contrastive Loss (TCL) and TQC Evidential Learning Loss (TEL) to achieve accurate semantic similarity and handle noisy data in the TQC dataset. Notably, TEL is a novel loss function based on evidential learning that models the output as a Dirichlet distribution to capture the uncertainty resulting from noisy data. We evaluated our framework using four noisy TQC datasets and found that it outperformed relevant baselines, as indicated by the experimental results.

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings
EditorsLazaros Iliadis, Antonios Papaleonidas, Plamen Angelov, Chrisina Jayne
PublisherSpringer Science and Business Media Deutschland GmbH
Pages211-222
Number of pages12
ISBN (Print)9783031442032
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event32nd International Conference on Artificial Neural Networks, ICANN 2023 - Heraklion, Greece
Duration: 26 Sept 202329 Sept 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14263 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference32nd International Conference on Artificial Neural Networks, ICANN 2023
Country/TerritoryGreece
CityHeraklion
Period26/09/2329/09/23

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Keywords

  • Evidential learning
  • Noisy label
  • Text2text question classification

Fingerprint

Dive into the research topics of 'Evidential Robust Deep Learning for Noisy Text2text Question Classification'. Together they form a unique fingerprint.

Cite this