Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models

Lujun LI, Peijie DONG, Zhenheng TANG, Xiang LIU, Qiang WANG, Wenhan LUO, Wei XUE, Qifeng LIU, Xiaowen CHU, Yike GUO*

*Corresponding author for this work

Research output: Contribution to conferenceConference Paperpeer-review

Original languageEnglish
Publication statusPublished - Dec 2024
EventThe Thirty-eighth Annual Conference on Neural Information Processing Systems -
Duration: 1 Dec 20241 Dec 2024

Conference

ConferenceThe Thirty-eighth Annual Conference on Neural Information Processing Systems
Period1/12/241/12/24

Cite this