Skip to main navigation Skip to search Skip to main content

LLM.265: Video Codecs are Secretly Tensor Codecs

  • Ceyu Xu
  • , Yongji Wu
  • , Xinyu Yang
  • , Beidi Chen
  • , Matthew Lentz
  • , Danyang Zhuo
  • , Lisa Wu Wills

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

As the parameter size of large language models (LLMs) continues to expand, the need for a large memory footprint and high communication bandwidth have become significant bottlenecks for the training and inference of LLMs. To mitigate these bottlenecks, various tensor compression techniques have been proposed to reduce the data size, thereby alleviating memory requirements and communication pressure. Our research found that video codecs, despite being originally designed for compressing videos, show excellent efficiency when compressing various types of tensors. We demonstrate that video codecs can be versatile and general-purpose tensor codecs while achieving the state-of-the-art compression efficiency in various tasks. We further make use of the hardware video encoding and decoding module available on GPUs to create a framework capable of both inference and training with video codecs repurposed as tensor codecs. Building on insights gained from video codecs, we further show that the hardware of the video codecs can be customized and enhanced to significantly improve tensor encoding/decoding throughput without incurring substantial costs, making it a highly effective solution for large-scale model deployment without requiring significant modifications to the existing GPU architecture.

Original languageEnglish
Title of host publicationMICRO 2025 - 58th IEEE/ACM International Symposium on Microarchitecture
PublisherIEEE Computer Society
Pages445-460
Number of pages16
ISBN (Electronic)9798400715730
DOIs
Publication statusPublished - 17 Oct 2025
Externally publishedYes
Event58th IEEE/ACM International Symposium on Microarchitecture , MICRO 2025 - Seoul, Korea, Republic of
Duration: 18 Oct 202522 Oct 2025

Publication series

NameProceedings of the Annual International Symposium on Microarchitecture, MICRO
VolumePart of 213862
ISSN (Print)1072-4451

Conference

Conference58th IEEE/ACM International Symposium on Microarchitecture , MICRO 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period18/10/2522/10/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s).

Keywords

  • Large Language Models
  • Video Codecs
  • Model Compression
  • Distributed Training

Fingerprint

Dive into the research topics of 'LLM.265: Video Codecs are Secretly Tensor Codecs'. Together they form a unique fingerprint.

Cite this