Computer Vision Analysis of Surgical Footage

Automated real-time monitoring and assessment of cataract surgical videos.

By Li Ge in Machine Learning Computer Vision

December 6, 2019

Rotation advisor: Yin Li, PhD.

In collaboration with: Stephen K. Sauer, MD.

Motivation

  • Cataracts are the leading cause of blindness in the world, according to the World Health Organization.

  • Cataracts Surgeries are the most common surgical interventions performed in the world (~19M interventions / year).

  • Cataract cases are estimated to increase 78% by 2050.

  • Ophthalmology residents spend a large portion of their training in learning cataract surgery.

  • A key challenge in the training is to develop a systematic and objective assessment of the surgical competency of the residents.

Results

  • The trained Mask R-CNN model achieves outstanding performance detecting pupils and twelve different surgical tools.
  • We are able to monitor important metrics such as pupil deviation and pupil magnification in real-time.
Pupil mAP (IoU=0.50:0.95) AP50 AP75
Segmentation 81.13 96.98 86.19

Full Video Metrics

Surgery Clip Real-Time Monitoring

Dataset

  • CaDIS
  • 4738 images extracted from 25 videos with corresponding semantic annotation.
  • Training: 3582, Validation: 542, Testing: 614.

Method

Posted on:
December 6, 2019
Length:
1 minute read, 155 words
Categories:
Machine Learning Computer Vision
See Also: