Skip to main content

Pathological Fluid Segmentation in OCT with U-Net

Ablation study on retinal fluid segmentation in OCT images comparing six U-Net configurations. Best configuration (ResNet34 + BCE+Dice): Dice 0.824, IoU 0.716, AUC-ROC 0.993 on 5-fold cross-validation.

Visit website
  • Medical Imaging
  • Semantic Segmentation
  • Ablation Study
  • PyTorch
OCT retinal image with segmented pathological fluid overlay

Research Context

Optical Coherence Tomography (OCT) is the gold standard for retinal diagnosis, but manual segmentation of pathological fluid (DME, SRF, PED) is time-consuming and observer-dependent. Automated pixel-level segmentation enables faster, reproducible clinical decisions.

Ablation Study — Results

ConfigurationDiceIoUAUC-ROC
Scratch + BCE0.7410.6090.981
Scratch + Dice0.7630.6350.984
ResNet34 + BCE0.7980.6720.989
ResNet34 + Dice0.8110.6950.991
ResNet34 + BCE+Dice ★0.8240.7160.993
ResNet34 + BCE+Dice + Aug0.8190.7100.992

Experimental Design

Three factors varied across 6 configurations: encoder initialisation (scratch vs. ImageNet pre-trained ResNet34), loss function (BCE / Dice / combined), and data augmentation. All evaluated with 5-fold stratified cross-validation on 50 annotated OCT volume slices.

Key Findings

  • ImageNet pre-training provides substantial gains even for medical imaging domains — ResNet34 encoder consistently outperforms scratch initialisation
  • Combined BCE+Dice loss outperforms either alone, especially on thin fluid structures with high class imbalance
  • Data augmentation helps generalisation but requires careful tuning on small datasets (50 pairs)
  • Stack: Python · PyTorch · segmentation-models-pytorch · U-Net · ResNet34 · scikit-learn