Automated Medical Image Annotation for Dataset Building

Table of content

  1. Abstract
  2. Related works
  3. Pipeline
  4. Results and Analysis
  5. Conclusion
  6. Publications
  7. Links

Abstract

Medical image annotation to build datasets leverages in many clinical applications such as diagnosis and treatment planning. Automated medical image annotation shows an efficient solution over manual annotation in dataset building. In this work, we focus on automated user-interactive oral image annotation that could perform automated annotations with assistance of user prompts such as text,points and bounding boxes. Meta AI’s Segment Anything Model (SAM) , a vision foundation model trained on the largest segmentation dataset for interactive promptable segmentation with impressive zero-shot performance has increased the potential for medical image segmentation. However, SAM shows limited performance with the images that differ from the trained dataset or images with challenging conditions like irregular regions and boundaries and text-to-mask task seems exploratory.

In this work, we explore a comprehensive study on automating oral image annotation and related work using the foundation models such as SAM, Dino, Grounding Dino,Grounded SAM addressing the above limitations. At the end, we discuss the potential research gaps in automating medical image annotation and propose our methodology to address the identified gaps.

Object Detection Foundation model

Grounding DINO

image

Object Segmentation Foundation model

1) SAM

image

2) MedSAM

Models used for Detection and Segmentation

1) Grounded-SAM

2) TongueSAM

image

Few-shot paradigm

Few-shot keypoint detection

Few-shot Segmentation

1) UniverSeg : Universal Medical Image Segmentation

Pipeline

image

Results and Analysis

MedSAM Results

image image

MedSAM Finetune (Flare 22 CT dataset)

image image

MedSAM Finetune (Tufts Teeth dataset with training)

image

MedSAM Finetune (Tufts Teeth dataset with validation)

image image

Few-shot keypoint detection Results

Episodic Attention Maps

image

Support Images

image

Query

image

Query Prediction

image

UniverSeg Results

image

Increasing the support set size improves the Dice score evaluation of the prediction.

Visualizations

1) WBC dataset Support set samples

image

Test Predictions for varying Support Set Size N

image

3) OASIS dataset Support set samples

image

5) Tufts dataset Support set samples

image

Test Predictions for varying Support Set Size N

image

6) ISIC2018 dataset Support set samples

image

Test Predictions for varying Support Set Size N

image

Conclusion

Publications

Team

Supervisors