Skip to main navigation Skip to search Skip to main content

Towards Semantically Faithful Diffusion Representation for Generalizable Retinal Image Segmentation

  • Yingpeng Xie
  • , Hao Chen
  • , Jing Qin
  • , Yongtao Zhang
  • , Lei Dong
  • , Jie Du
  • , Tianfu Wang*
  • , Baiying Lei*
  • *Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Retinal image segmentation is essential for analyzing retinal structures like vessels and diagnosing retinopathy. However, the inherent intricacy of the retina, along with annotation scarcity and data heterogeneity, presents prevalent challenges in creating accurate and generalizable deep learning models. Diffusion models, while initially developed for image generation, have recently shown great promise for visual perception by leveraging the learned internal representations. However, these diffusion representations, which spread across network blocks (space) and diffusion timesteps (time), potentially suffer from issues like stochastic semantic distortion and cumulative structural blurring, compromising their semantic fidelity to the source image. In this paper, by delving into the generalization property of diffusion models, we propose a novel anchoring inversion strategy to derive diffusion representations that are semantically faithful to the source image from the deterministic trajectory. Furthermore, we introduce a time-space frequency-Aware aggregation interpreter (T&S-FreqAgg) to aggregate the multi-scale and multi-Timestep diffusion representations in a frequency-Aware way for Domain Generalizable Semantic Segmentation (DGSS). Extensive experiments on nine public retinal image datasets demonstrate the superiority of our proposed framework, DiffDGSSv2, over state-of-The-Art methods. Our code will be available at: https://github.com/Xyporz/DiffDGSSv2.

Original languageEnglish
Article number11146911
Pages (from-to)668 - 680
JournalIEEE Transactions on Medical Imaging
Volume45
Issue number2
Early online date2 Sept 2025
DOIs
Publication statusPublished - Feb 2026

Bibliographical note

Publisher Copyright:
© IEEE. 1982-2012 IEEE.

Keywords

  • Retinal image segmentation
  • Diffusion models
  • Semantically faithful representations

Fingerprint

Dive into the research topics of 'Towards Semantically Faithful Diffusion Representation for Generalizable Retinal Image Segmentation'. Together they form a unique fingerprint.

Cite this