Poster Poster Program Diagnostic and Interventional Radiology Physics

Multi-Modal Self-Supervised Contrastive Learning for a Foundational Model for Brachytherapy

Abstract
Purpose

This project uses a multi-modal method to train a foundational model for medical images of cervical cancer patients. Multiple imaging modalities are used over the course of treatment: multiparametric MRI, CT, cone-beam CT, and PET when indicated. This motivates a paired approach for deep learning to leverage the shared information across modalities and aim for image features relevant to downstream clinical tasks.

Methods

Contrastive learning (CL) is a method of self-supervised learning (requiring no labeled data) that pushes together ‘positive’ pairs and separates ‘negative’ pairs in the model’s deep features. This project utilizes multi-modal CL, using scans of different modalities for the same patient as positive pairs. CL typically treats all negative pairs equally, but different patients may have similar anatomy and disease progression. To address this, a soft contrastive loss was developed using clinical information to create a similarity metric for negative pairs. As a preliminary experiment, a vision transformer (ViT) pre-trained on 3D medical images was fine-tuned on an in-house MRI and CT dataset.

Results

The training progress of the ViT was evaluated using attention rollout to visualize the attention flow for a given input, highlighting regions most crucial for extracting deep features. Initial epochs show an emphasis on structures with high voxel intensity, while subsequent iterations resulted in a more balanced attention map incorporating the full pelvic region. At 50 epochs, the attention began leaking into the image background, which is indicative of overfitting.

Conclusion

This work shows potential in using multi-modal CL with clinical information to train foundational ViTs. The attention maps demonstrated that relevant anatomical regions were learned by the model in extracting deep features, but overfitting would eventually begin. The model will be evaluated on downstream tasks to examine if these learned features can result in performance improvements.

People

Related

Similar sessions

Poster Poster Program
Jul 19 · 07:00
B-Trac – Breast Tissue Rotation and Compression Apparatus for Calibration

Mammography (compressed 2D) and MRI (uncompressed 3D) capture breast tissue under different conditions, complicating tumor localization across modalities. To bridge this gap, we developed a customizable physical platform to simul...

Dayadna Hernandez Perez
Diagnostic and Interventional Radiology Physics 0 people interested
Poster Poster Program
Jul 19 · 07:00
Comprehensive Medical Physics Assessment of Digital Mammography Equipment: A Three-Year Multi-Site Evaluation of Technical Performance and Radiation Safety at 24 Saudi Arabian Healthcare Institutions (2022–2024)

To conduct a comprehensive multi-center audit evaluating the technical performance, image quality, and radiation safety of digital mammography systems across 24 unique healthcare facilities in Saudi Arabia. This study aims to est...

Sami Alshaikh, PhD
Diagnostic and Interventional Radiology Physics 0 people interested
Poster Poster Program
Jul 19 · 07:00
Starting Small: Implementing a CT Protocol Optimization Program

This talk describes our organization’s CT optimization program, and how we implemented it to make efficient use of limited physicist time.

Robert J. Cropp, PhD
Diagnostic and Interventional Radiology Physics 0 people interested