Optimization of Deep Learning Models for inference in low resource environments

  • Siddhesh Thakur
  • , Sarthak Pati
  • , Junwen Wu
  • , Ravi Panchumarthy
  • , Deepthi Karkada
  • , Alexander Kozlov
  • , Vasily Shamporov
  • , Alexander Suslov
  • , Daniil Lyakhov
  • , Maksim Proshin
  • , Prashant Shah
  • , Dimitrios Makris
  • , Spyridon Bakas

Research output: Contribution to journalArticlepeer-review

Abstract

Artificial Intelligence (AI), and particularly deep learning (DL), has shown great promise to revolutionize healthcare. However, clinical translation is often hindered by demanding hardware requirements.

In this study, we assess the effectiveness of optimization techniques for DL models in healthcare applications, targeting varying AI workloads across the domains of radiology, histopathology, and medical RGB imaging, while evaluating across hardware configurations. The assessed AI workloads focus on both segmentation and classification workloads, by virtue of brain extraction in Magnetic Resonance Imaging (MRI), colorectal cancer delineation in Hematoxylin & Eosin (H&E) stained digitized tissue sections, and diabetic foot ulcer classification in RGB images. We quantitatively evaluate model performance in terms of model runtime during inference (including speedup, latency, and memory usage) and model utility on unseen data.

Our results demonstrate that optimization techniques can substantially improve model runtime, without compromising model utility. These findings suggest that optimization techniques can facilitate the clinical translation of AI models in
low-resource environments, making them more practical for real-world healthcare applications even in underserved regions.
Original languageEnglish
Article number110615
Number of pages13
JournalComputers in Biology and Medicine
Volume196
Issue numberB
Early online date26 Jul 2025
DOIs
Publication statusPublished - Sept 2025

Keywords

  • Artificial Intelligence
  • Deep learning
  • Low-resource
  • Medical imaging
  • Optimization

Fingerprint

Dive into the research topics of 'Optimization of Deep Learning Models for inference in low resource environments'. Together they form a unique fingerprint.

Cite this