Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching

Shitong Shao, Zeyuan Yin, Muxin Zhou, Xindong Zhang, Zhiqiang Shen*

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

The lightweight 'local-match-global' matching introduced by SRe2L successfully creates a distilled dataset with comprehensive information on the full 224×224 ImageNetlk. However, this one-sided approach is limited to a particular backbone, layer, and statistics, which limits the improvement of the generalization of a distilled dataset. We suggest that sufficient and various 'local-match-global' matching are more precise and effective than a single one and have the ability to create a distilled dataset with richer information and better generalization ability. We call this perspective 'generalized matching' and propose Generalized Various Backbone and Statistical Matching (G-VBSM) in this work, which aims to create a synthetic dataset with densities, ensuring consistency with the complete dataset across various backbones, layers, and statistics. As experimentally demonstrated, G-VBSM is the first algorithm to obtain strong performance across both small-scale and large-scale datasets. Specifically, G-VBSM achieves performances of 38.7% on CIFAR-I00, 47.6% on Tiny-ImageNet, and 31.4% on the full 224×224 ImageNet1 k, respectivelySettings: CIFAR-I00 with 128-width ConvNet under 10 images per class (lPC), Tiny-ImageNet with ResNet18 under 50 IPC, and ImageNetlk with ResNet18 under 10 IPC.. These results surpass all SOTA methods by margins of 3.9%, 6.5%, and 10.1%, respectively.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
PublisherIEEE Computer Society
Pages16709-16718
Number of pages10
ISBN (Electronic)9798350353006
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024 - Seattle, United States
Duration: 16 Jun 202422 Jun 2024

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)1063-6919

Conference

Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
Country/TerritoryUnited States
CitySeattle
Period16/06/2422/06/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • Dataset Condensation
  • Generalized Matching
  • Large-scale Dataset

Fingerprint

Dive into the research topics of 'Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching'. Together they form a unique fingerprint.

Cite this