S2P-Matching: Self-Supervised Patch-Based Matching Using Transformer for Capsule Endoscopic Images Stitching

Feng Lu, Dao Zhou, Haoyang Chen, Shuai Liu, Xianliang Ling, Lei Zhu, Tingting Gong, Bin Sheng*, Xiaofei Liao, Hai Jin, Ping Li, David Dagan Feng

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

3 Citations (Scopus)

Abstract

The Magnetically Controlled Capsule Endoscopy (MCCE) has a limited shooting range, resulting in capturing numerous fragmented images and an inability to precisely locate and examine the region of interest (ROI) as traditional endoscopy can. Addressing this issue, image stitching around the ROI can be employed to aid in the diagnosis of gastrointestinal (GI) tract conditions. However, MCCE images possess unique characteristics, such as weak texture, close-up shooting, and large angle rotation, presenting challenges to current image-matching methods. In this context, a method named S2P-Matching is proposed for self-supervised patch-based matching in MCCE image stitching. The method involves augmenting the raw data by simulating the capsule endoscopic camera's behavior around the GI tract's ROI. Subsequently, an improved contrast learning encoder is utilized to extract local features, represented as deep feature descriptors. This encoder comprises two branches that extract distinct scale features, which are combined over the channel without manual labeling. The data-driven descriptors are then input into a Transformer model to obtain patch-level matches by learning the globally consented matching priors in the pseudo-ground-truth match pairs. Finally, the patch-level matching is refined and filtered to the pixel-level. The experimental results on real-world MCCE images demonstrate that S2P-Matching provides enhanced accuracy in addressing challenging issues in the GI tract environment with image parallax. The performance improvement can reach up to 203 and 55.8% in terms of NCM (Number of Correct Matches) and SR (Success Rate), respectively. This approach is expected to facilitate the wide adoption of MCCE-based gastrointestinal screening.

Original languageEnglish
Pages (from-to)540-551
Number of pages12
JournalIEEE Transactions on Biomedical Engineering
Volume72
Issue number2
DOIs
Publication statusPublished - 2025

Bibliographical note

Publisher Copyright:
© 1964-2012 IEEE.

Keywords

  • Capsule endoscopy
  • image stitching
  • multi-view simulation
  • patch-level matching
  • self-supervised contrastive learning
  • transformer

Fingerprint

Dive into the research topics of 'S2P-Matching: Self-Supervised Patch-Based Matching Using Transformer for Capsule Endoscopic Images Stitching'. Together they form a unique fingerprint.

Cite this