Kevin Wu

Wednesday, November 8th, 2023

2:00 pm – 3:00 pm PST 

CCSR 0235

Zoom Link: https://stanford.zoom.us/j/95984592133?pwd=MDRXcyt4eTlCVlZhdHZYZ0prLzBGZz09

Title: Medical AI After Deployment: Data-driven analyses and methods for clinically viable AI

Abstract: Medical AI algorithms have undergone significant development and regulatory approval, with over 600 FDA-approved medical AI devices currently. However, their actual clinical safety and impact remain unclear. First, we analyze FDA submission documents and find that the majority of FDA approvals do not report multi-site evaluation, and nearly none have prospective analyses. Second, we track the occurrences of newly released AI billing codes in a nationwide insurance claims database and find that only a handful of products have meaningful clinical adoption. Finally, we systematically track device updating in FDA submissions and find that the majority of devices have not had updates to model weights since initial approval. Given these limitations, we propose several methods to address common issues with algorithmic deployment. First, we present a framework for understanding the marginal contribution of distribution shifts to overall model degradation. Second, we present a method for efficient missing data collection in the context of fixed models. Finally, we present ways to improve the robustness of evaluating medical LLMs.