Abstract
Retinal image segmentation is essential for analyzing retinal structures like vessels and diagnosing retinopathy. However, the inherent intricacy of the retina, along with annotation scarcity and data heterogeneity, presents prevalent challenges in creating accurate and generalizable deep learning models. Diffusion models, while initially developed for image generation, have recently shown great promise for visual perception by leveraging the learned internal representations. However, these diffusion representations, which spread across network blocks (space) and diffusion timesteps (time), potentially suffer from issues like stochastic semantic distortion and cumulative structural blurring, compromising their semantic fidelity to the source image. In this paper, by delving into the generalization property of diffusion models, we propose a novel anchoring inversion strategy to derive diffusion representations that are semantically faithful to the source image from the deterministic trajectory. Furthermore, we introduce a time-space frequency-Aware aggregation interpreter (T&S-FreqAgg) to aggregate the multi-scale and multi-Timestep diffusion representations in a frequency-Aware way for Domain Generalizable Semantic Segmentation (DGSS). Extensive experiments on nine public retinal image datasets demonstrate the superiority of our proposed framework, DiffDGSSv2, over state-of-The-Art methods. Our code will be available at: https://github.com/Xyporz/DiffDGSSv2.
| Original language | English |
|---|---|
| Article number | 11146911 |
| Pages (from-to) | 668 - 680 |
| Journal | IEEE Transactions on Medical Imaging |
| Volume | 45 |
| Issue number | 2 |
| Early online date | 2 Sept 2025 |
| DOIs | |
| Publication status | Published - Feb 2026 |
Bibliographical note
Publisher Copyright:© IEEE. 1982-2012 IEEE.
Keywords
- Retinal image segmentation
- Diffusion models
- Semantically faithful representations
Fingerprint
Dive into the research topics of 'Towards Semantically Faithful Diffusion Representation for Generalizable Retinal Image Segmentation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver