Abstract
Traditional Chinese medicine (TCM) has relied on specific combinations of herbs in prescriptions to treat various symptoms and signs for thousands of years. Predicting TCM prescriptions poses a fascinating technical challenge with significant practical implications. However, this task faces limitations due to the scarcity of high-quality clinical datasets and the complex relationship between symptoms and herbs. To address these issues, we introduce DigestDS, a novel dataset comprising practical medical records from experienced experts in digestive system diseases. We also propose a method, TCM-FTP (TCM Fine-Tuning Pre-trained), to leverage pre-trained large language models (LLMs) via supervised fine-tuning on DigestDS. Additionally, we enhance computational efficiency using a low-rank adaptation technique. Moreover, TCM-FTP incorporates data augmentation by permuting herbs within prescriptions, exploiting their order-agnostic nature. Impressively, TCM-FTP achieves an F1-score of 0.8031, significantly outperforming previous methods. Furthermore, it demonstrates remarkable accuracy in dosage prediction, achieving a normalized mean square error of 0.0604. In contrast, LLMs without fine-tuning exhibit poor performance. Although LLMs have demonstrated wide-ranging capabilities, our work underscores the necessity of fine-tuning for TCM prescription prediction and presents an effective way to accomplish this.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024 |
| Editors | Mario Cannataro, Huiru Zheng, Lin Gao, Jianlin Cheng, Joao Luis de Miranda, Ester Zumpano, Xiaohua Hu, Young-Rae Cho, Taesung Park |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 4092-4097 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798350386226 |
| DOIs | |
| Publication status | Published - 2024 |
| Externally published | Yes |
| Event | 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024 - Lisbon, Portugal Duration: 3 Dec 2024 → 6 Dec 2024 |
Publication series
| Name | Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024 |
|---|
Conference
| Conference | 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024 |
|---|---|
| Country/Territory | Portugal |
| City | Lisbon |
| Period | 3/12/24 → 6/12/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Fine-tuning
- Herb dosage prediction
- Large language models
- Prescription prediction
- Traditional Chinese medicine
Fingerprint
Dive into the research topics of 'TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver