Final Year Project · Medical Imaging · 2026

Balancing Fidelity,
Utility & Privacy in
Synthetic Cardiac MRI

A comparative study benchmarking DDPM, Latent Diffusion Models, and Flow Matching for synthetic cardiac MRI generation - quantifying trade-offs between image quality, segmentation utility, and patient privacy.

DDPM Latent Diffusion Flow Matching Cardiac MRI Synthetic Data Privacy Preservation Segmentation Utility

Research Abstract

Deep learning in cardiac MRI (CMR) is fundamentally constrained by both data scarcity and privacy regulations. This study systematically benchmarks three generative architectures: Denoising Diffusion Probabilistic Models (DDPM), Latent Diffusion Models (LDM), and Flow Matching (FM) for synthetic CMR generation. Utilizing a two-stage pipeline where anatomical masks condition image synthesis, we evaluate generated data across three critical axes: fidelity, utility, and privacy.


Our results show that diffusion-based models, particularly DDPM, provide the most effective balance between downstream segmentation utility, image fidelity, and privacy preservation under limited-data conditions, while FM demonstrates promising privacy characteristics with slightly lower task-level performance. These findings quantify the trade-offs between cross-domain generalization and patient confidentiality, establishing a framework for safe and effective synthetic data augmentation in medical imaging.

3 Generative Architectures Benchmarked
3 Evaluation Axes: Fidelity · Utility · Privacy
2 Stage Synthesis Pipeline
1 Unified Evaluation Framework

System Pipeline

A two-phase pipeline: generative models produce synthetic CMR images from noise conditioned on anatomical masks, then outputs are rigorously evaluated across fidelity, utility, and privacy. Click each model tab to explore its architecture.

Phase 1 Generation
Real Mask
Real Mask
OR
Gaussian Noise
Gaussian Noise
Denoising Diffusion Probabilistic Model · Mask-Conditioned UNet
xT
Noise
↓ DS
UNet
Mid
Block
↑ US
UNet
0
×T steps
Mask Conditioning
cross-attention injected at each diffusion timestep
Latent Diffusion Model · VAE Encoder–Decoder + Latent UNet
zT
Noise
VAE
Encoder ↓
Latent
UNet
VAE
Decoder ↑
0
pixel space
Mask Conditioning
cross-attention in compressed latent space · reduced compute
Flow Matching · ODE Solver + Velocity UNet with ControlNet
x0
Noise
ODE
Solver
Velocity
UNet
x1
data space
ControlNet
Mask Embeddings
MultiScale Residuals
Generated CMR
Synthetic CMR
Phase 2 Evaluation
Distribution
Based Evaluation
FIDKID

Statistical divergence between real and synthetic image distributions at the feature level

Fidelity
Perceptual & Structural Quality
PSNRSSIM MS-SSIMLPIPS FIDKID

Full-reference & no-reference metrics for pixel, structural, and perceptual quality

Utility
Downstream Segmentation Task
Real
Images+Masks
Seg
Model
Synthetic
+Masks
↓ evaluated on real test dataset
Privacy
Patient Confidentiality
Nearest-Neighbour Analysis
L2SSIM LPIPSNNDR
Membership Inference Attack
FCREBinary Clf AUC

Synthetic CMR Images

Generated Results

Generative Models Benchmarked

Efficient
LDM
Latent Diffusion Model

Diffusion process operating in a compressed latent space via a pre-trained encoder-decoder, enabling computationally efficient synthesis with competitive image quality at reduced resource cost.

FidelityGood
UtilityGood
PrivacyModerate
Novel Approach
FM
Flow Matching

Continuous normalizing flow trained with simulation-free objectives. Demonstrates strong privacy characteristics with promising generalization, slightly below DDPM in task utility.

FidelityGood
UtilityModerate
PrivacyStrong

Results & Comparative Analysis

Fidelity Evaluation

Evaluation Metric Diffusion-DDPM Diffusion-LDM Flow Match
SSIM ↑ 0.22 0.18 0.22
MS-SSIM ↑ 0.36 0.33 0.40
PSNR ↑ 10.67 9.95 11.44
FID ↓ 72.52 95.17 108.32
KID ↓ 0.04 0.08 0.098
LPIPS ↓ 0.49 0.51 0.48

↓ lower is better · ↑ higher is better · Teal = best per metric

Evaluation of Segmentation Model

Training Setup M&M Testing ACDC Testing
DiceIoUHD95ASD DiceIoUHD95ASD
M&M (Real) 0.900.842.991.04 0.910.832.890.94
ACDC (Real) 0.910.833.881.23 0.950.872.650.75
DDPM Full-Syn 0.870.805.771.79 0.870.806.281.78
DDPM ACDC-Syn 0.860.795.981.88 0.880.814.721.45
DDPM M&M-Syn 0.890.834.341.41 0.900.844.611.34
LDM Full-Syn 0.870.795.281.74 0.870.804.901.52
LDM ACDC-Syn 0.870.807.942.48 0.880.799.382.43
LDM M&M-Syn 0.890.824.171.40 0.880.822.261.53
FM Full-Syn 0.870.807.802.08 0.880.816.291.81
FM ACDC-Syn 0.820.738.752.90 0.850.767.742.19
FM M&M-Syn 0.850.825.041.64 0.890.825.731.67

Full-Syn: conditioned synthetic masks only  ·  ACDC-Syn: conditioned on ACDC training masks  ·  M&M-Syn: conditioned on M&M training masks  ·  Teal = best among synthetic setups

Privacy Evaluation

Evaluation Metric Diffusion-DDPM Diffusion-LDM Flow Match
Nearest Neighbor L2 ↑ 12.0 10.5 19.0
LPIPS ↑ 0.36 0.37 0.41
NNDR ↑ 0.83 0.85 0.87
ROC_AUC (MIA) ↓ 0.6029 0.580 0.6038

↑ higher is better for L2, LPIPS, NNDR · ↓ ROC-AUC closer to 0.5 indicates stronger privacy

FINDING 01 - Fidelity

DDPM achieves the best distribution-level fidelity with the lowest FID (72.52) and KID (0.04), while Flow Matching leads on pixel-level metrics PSNR (11.44) and MS-SSIM (0.40).

FINDING 02 - Utility

M&M-conditioned synthesis consistently outperforms other synthetic setups. DDPM M&M-Syn achieves the best ACDC Dice (0.90), while LDM M&M-Syn leads on M&M Dice (0.89) and HD95 (4.17).

FINDING 03 - Privacy

Flow Matching provides the strongest privacy with the highest L2 (19.0), LPIPS (0.41), and NNDR (0.87). LDM achieves the lowest MIA ROC-AUC (0.580), closest to the ideal 0.5.

FINDING 04 - Overall

All three models yield MIA AUC scores of 0.58-0.60, confirming robust resistance to re-identification attacks and validating the pipeline for safe synthetic data augmentation in medical imaging.

Team & Supervisors

Team

K.W.I.T. Kumarasinghe
K.W.I.T. Kumarasinghe
E/20/211
A.H.D. Kavya
A.H.D. Kavya
E/20/197
D.M.B. Edirisooriya
D.M.B. Edirisooriya
E/20/093

Supervisors

Dr. Isuru Nawinne
Dr. Isuru Nawinne
Academic Supervisor
Prof. Roshan G. Ragel
Prof. Roshan G. Ragel
Academic Supervisor
Dr. Vajira Thambawita
Dr. Vajira Thambawita
External Supervisor