Skip to main navigation Skip to search Skip to main content

SENA: Leveraging set-level consistency adversarial learning for robust pre-trained language model adaptation

  • Jianqi Gao
  • , Jian Cao*
  • , Hang Yu
  • , Yonggang Zhang
  • , Zhen Fang
  • *Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Using pre-trained language models (PLM) to generate embeddings for downstream tasks has achieved great success in recent years. The pre-trained embeddings can be adapted to downstream tasks by encouraging the embedding similarity among samples within the same class through auxiliary tasks with contrastive learning (CL) objectives. However, existing methods face two issues: (i) class imbalance and over-representation caused by instance sampling bias in CL, and (ii) gradient conflicts between auxiliary and downstream tasks. To deal with these issues, we propose a novel approach called set-level consistency adversarial learning (SENA). Specifically, SENA leverages on two techniques, i.e., instance-to-set function and consistency adversarial learning, to yield task-specific embeddings. To mitigate the issue of instance sampling bias in CL, SENA incorporates set-level discriminative features into individual instance embeddings by employing an instance-to-set function, which are then employed as prototypes for each category in contrastive learning. Additionally, to tackle gradient conflicts between CL and downstream tasks, SENA first identifies the most inconsistent cases and then eliminates the inconsistency in an adversarial learning manner. SENA is validated on GLUE benchmark and three intent classification datasets. Comprehensive experiments demonstrate the effectiveness of SENA on various tasks.

Original languageEnglish
Article number113831
JournalKnowledge-Based Systems
Volume324
Early online date9 Jun 2025
DOIs
Publication statusPublished - 3 Aug 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2025 Elsevier B.V.

Keywords

  • Pre-trained language models
  • Contrastive learning
  • Gradient conflicts
  • Set-level consistency adversarial learning
  • Instance-to-set function

Fingerprint

Dive into the research topics of 'SENA: Leveraging set-level consistency adversarial learning for robust pre-trained language model adaptation'. Together they form a unique fingerprint.

Cite this