Poster Poster Program Therapy Physics

A Smart Calibrated Probabilistic U-Net for Aleatoric and Epistemic Uncertainty Quantification In Medical Image Segmentation

Abstract
Purpose

Image segmentation is a critical yet challenging step in the radiotherapy workflow. Although many AI-based segmentation models report high accuracy, substantial epistemic and aleatoric uncertainties persist. This work proposes a probabilistic U-Net framework to explicitly quantify and calibrate segmentation uncertainty, with the goal of improving reliability for clinical decision-making.

Methods

We developed a Smart Calibrated Probabilistic U-Net (SCP-UNet) that departs from conventional probabilistic U-Net designs that employ separate prior and posterior encoders. The proposed model employs a single encoder with a fixed standard normal prior N(0, 1), eliminating mask-dependent encoding. Bottleneck features are mapped to latent variables through adaptive pooling and linear projections, followed by reparameterized sampling to generate diverse predictions. The model was optimized using a VAE-style ELBO objective combining binary cross-entropy, Dice, KL divergence, and calibration losses. SCP-UNet was trained using AdamW optimizer with mixed-precision optimization and evaluated separately on prostate (4 structures, 42 cases) and head-and-neck (5 structures, 35 cases) datasets using segmentation accuracy, uncertainty metrics, and uncertainty-error correlation.

Results

On the test (validation) sets, mean Dice/HD95 were 0.8556 ± 0.1932/8.77 ± 1.43mm (0.7959 ± 0.1127/8.56 ± 1.12mm) for prostate and 0.7034 ± 0.2483/1.69 ± 0.84mm (0.7246 ± 0.1037/1.78 ±0.56mm) for head-and-neck, respectively. Mean predictive entropy was substantially higher for prostate than for head-and-neck (0.2315 vs 0.00046), corresponding to overall error rates of 0.96% and 29.09%, respectively. Moderate segmentation error-entropy correlations were observed for both sites (0.4484 prostate; 0.3855 head-and-neck). Epistemic uncertainty ratios were 0.0 for prostate and 0.086 for head-and-neck.

Conclusion

The proposed SCP-UNet achieved well-calibrated aleatoric uncertainty for prostate segmentation, whereas head-and-neck segmentation exhibited overconfidence and elevated error rates, indicating suboptimal uncertainty calibration. These findings highlight the dataset- and task-dependent nature of uncertainty quantification. This probabilistic segmentation framework enables explainable uncertainty estimation to support informed segmentation review and quality assurance.

People

Related

Similar sessions

Poster Poster Program
Jul 19 · 07:00
Python-Based Automation Framework for Annual Machine QA Data Archiving In Qatrack+

Annual water-tank measurements help ensure beam characteristics remain consistent with commissioning baselines. However, the lack of a standardized processing workflow and decentralized data storage makes it difficult to analyze...

Syed Bilal Ahmad, PhD
Therapy Physics 0 people interested
Poster Poster Program
Jul 19 · 07:00
User Expectations and Current Availability of HDR Brachytherapy Audits In Europe

The aim of this work was to evaluate the need to implement more dosimetric audits in high‐dose‐rate brachytherapy (HDR-BT) in Europe and to identify which characteristics such audits should meet according to users.

Javier Vijande, PhD Laura Oliver Cañamás
Therapy Physics 0 people interested