Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Rui Yang*, Xiaoman Pan, Feng Luo, Shuang Qiu, Han Zhong, Dong Yu, Jianshu Chen*

*Corresponding author for this work

Research output: Contribution to journalConference article published in journalpeer-review

Fingerprint

Dive into the research topics of 'Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment'. Together they form a unique fingerprint.
Sort by

Computer Science

Psychology