Generative AI Workshops

July 24, 2024

DBDS, TDS, and CPHS sponsor first Epic Cosmos Hackathon

NIgam Shah addresses attendees of the Epic Cosmos Hackathon

Professor Nigam Shah addresses the attendees of the first Epic Cosmos Hackathon.

On July 24th, Stanford Technology & Digital Solutions (TDS), in conjunction with the Department of Biomedical Data Science (DBDS) and the Center for Population Health Sciences (CPHS), sponsored the first Epic Cosmos Hackathon for early adopters at Stanford. The Epic Cosmos dataset, with over 250M patient records, provides an exciting new opportunity for researchers at Stanford to explore data on a nationwide scale. A multitude of potential early use cases currently exist, such as developing rare disease classifiers, exploring health equity, training foundation models on the data, and analyzing associations between medications and particular health conditions. Epic Cosmos has recently been made available to select Stanford researchers and future plans includes a roll out to a broader set of Stanford researchers in the coming year.

Around 60 MD, PhD and other researchers from the Stanford Health Care, Stanford Medicine Children’s Health, Stanford School of Medicine and School of Engineering gathered in the Li Ka Shing Building for this kick-off event, along with Epic Cosmos technical support team members. There was broad representation from clinical departments, affiliates of both the adult and children’s hospitals, and researchers from DBDS, PHS, and the Civil & Environmental Engineering department. The intent was to share best practices, enhance everyone’s skills and familiarity with Cosmos, and foster collaboration in Stanford’s growing Cosmos community.

Opening the event, DBDS Professor and Stanford Health Care Chief Data Scientist Nigam Shah, also a member of the Epic Cosmos Advisory Council, addressed the opportunities for Epic Cosmos. DBDS Professor Roxana Daneshjou also discussed use cases and her experience as one of the first faculty members on campus to get the Data Architect certification which enables her to dive more deeply into the Epic Cosmos data with SQL queries. DBDS Executive Director Karen Matthys shared a short overview of the department and our mission to enable AI research across the School of Medicine.

At this stage, Epic Cosmos is open to faculty in DBDS after going through the onboarding process. Please contact Roxana Daneshjou or Karen Ebert Matthys if you’d like to be involved.

Acknowledgements to Stanford TDS and the Office of the Chief Medical Information Officer, and in particular to Todd Ferris, Albert Chiou, Pronoy Saha, and Anthea Buchin. We also appreciate the strong collaboration with the Epic Cosmos team and look forward to building on the momentum with Epic Cosmos in the next phase of rollout.

 

May 17, 2024

Generative AI in Healthcare VC Panel

Panelists at the Venture Capitalist Forum, May, 2024.

On May 17th, the conference room was filled to capacity for the Generative AI in Healthcare VC Panel, organized by the Department of Biomedical Data Science (DBDS) and the BIODS 295 course. The event, featuring five prominent venture capitalists, attracted not only DBDS members but also graduate students, postdocs, and faculty from various other Stanford departments, underscoring the widespread interest in AI’s impact on healthcare.

The panelists, including Jay Rughani of a16z, Fern Mandelbaum of Emerson Collective, Cheryl Cheng of Vive Collective, Rafic Makki of Mubadala Capital, and Eric Chen of OVO Fund, discussed emerging trends, challenges, and critical success factors for AI-driven healthcare startups. They had sage advice for students, encouraging them while at Stanford go on “dates” to find co-founders and to use their Stanford student status to reach out to anyone anywhere. Moderated by Karen Matthys, Executive Director of DBDS, the lively discussion was followed by a reception that provided further networking opportunities for attendees.

 

 

May 30, 2023

Generative AI Workshop Overview

The Department of Biomedical Data Science held a special Generative AI in Healthcare and Medicine Innovation Workshop on Tuesday May 30th for a packed room of graduate students and faculty across the School of Medicine, Graduate School of Business, Engineering and H&S, along with outside partners from industry. The goals were to bring together interdisciplinary teams and perspectives to ideate on how generative AI can potentially help solve significant real-world healthcare challenges, and to broaden understanding of responsible AI and potential unintended consequences. The d.School design-thinking framework was introduced to the participants – many who had never experienced such a brainstorming session previously.

The workshop had over 180 registrants and started with a standing-room only panel discussion from senior leaders in industry and academia. The panel was moderated by Professor Sylvia Plevritis, Chair of the Dept. of Biomedical Data Science (DBDS) at Stanford, and notable panelists included:

  • Eric Horvitz (alumnus of the Stanford Biomedical Data Science program class of 1990), Chief Scientific Officer of Microsoft and member of President Biden’s Council on Science and Technology
  • Jia Li, Co-Teacher of Stanford Generative AI in Medicine course spring quarter, and Co-Founder of HealthUnity
  • Lori Sherer, Partner at Bain & Company
  • Professor Nigam Shah, Chief Data Scientist at Stanford Healthcare, and faculty in the Department of Biomedical Data Science

The panel focused on solutions and challenges with generative AI in healthcare, and a few key themes emerged:

Promising Opportunities: There are exciting opportunities for generative AI to address critical areas of healthcare and medicine, such as to hasten drug discovery, reduce hospital operational workloads, and to provide more accessible health services to those who often don’t have coverage. Horvitz shared that he is most excited about the applications with protein structures & interactions, such as for new drug discovery. “There are new methods in generative AI, including methods that employ ‘diffusion modeling’ to help design new proteins, such as new vaccines and therapeutics,” he commented. “There are also applications of large-scale language models in the biosciences, that could do synthesis across the wide bioscience literature to do such tasks as help to identify candidates for drug repurposing by reasoning about pathways that link medications, protein expression, and illness.”

ly help solve significant real-world healthcare challenges, and to broaden understanding of responsible AI and potential unintended consequences. The d.School design-thinking framework was introduced to the participants – many who had never experienced such a brainstorming session previously.

With regards to Generative AI in clinical applications, Professor Nigam Shah commented that we tend to get excited about the generative capabilities of language models. He posited that the real value comes not from ability to generate; but rather from the internal representations that these models learn. How does a patient become a vector of 256 numbers, and what else can you do with the vector?  Professor Shah’s research with Microsoft on generative AI in clinical settings is testing the bounds of ChatGPT capabilities: https://hai.stanford.edu/news/how-well-do-large-language-models-support-clinician-information-needsLori Sherer from Bain gave a few statistics: “Overall rate of misdiagnosis in the US is about 20%, leading to roughly 10% of patient deaths. And 30% of patients don’t follow recommended treatment plans.”  Generative AI solutions could potentially reduce error rates and encourage positive healthy behaviors among patients.

There are already some early impressive success cases in applying generative AI to healthcare and medicine, such as DBDS Professor James Zou’s lab that is involved in Generative AI for novel antibiotics, including the first one that’s experimentally validated. (paper in review). However, the panelists agreed that most of the early wins with generative AI in the field will likely come from lower risk applications. Sherer explained that back-office applications are the target with her healthcare and life sciences clients, and Horvitz added that it’s already happening today with large corporations in other industries, including finance.

Unintended Consequences:

Jia Li stressed that responsible AI and inclusiveness are very serious concerns. For example: “The accessibility is quite important for people from rural areas of India, Africa and other countries. The use of generative AI may not be effective unless the knowledge is understandable for these users.” Lori Sherer shared her perspective in consulting, saying “all of our clients very concerned about biases. People are moving ahead with rigor and caution, but they are not waiting for government oversight.” Shah added that “it’s the inappropriate sharing of data that may lead to biases later on.” He also highlighted the confusion about privacy versus security, saying “HIPAA is not about privacy; it is a misunderstood law. It requires you to send patient info to another provider if needed for care delivery.” With regards to security, Nigam posed: “No one wants your data accessible to 3rd parties. What does it mean to have security? There isn’t a framework right now. What is the risk of de-identification?”

Attendees work together at the Generative Artificial Intelligence event

Horvitz shared the analogy of the pilot/co-pilot situation, saying: “we seek designs and mechanisms for human-AI interaction that put the human in the ‘pilot’s seat’ and where AI plays the role of ‘co-pilot’ versus the other way around.” Regarding end-to end-security, Horvitz mentioned that “enterprise-grade cloud computing solutions” now provide organizations with private instances of the foundation models “so that data is not shared—and also enable the use of use of cryptographic methods that allow for HIPAA-compliant communication to and from the AI models, so that sensitive medical data is protected.”

Future Perspective: Many improvements to generative AI are coming and new opportunities lie ahead. On the technology side, Horvitz shared that “very shortly – we’ll see multimodal capabilities of ChatGPT4 – with vision and language together.” Nigam spoke about digital twin technology that could allow specific drugs recommendations for each patient and that have the potential to accelerate biological discovery. With generative AI creating synthetic data feeding a digital twin model, this could be a significant step forward in precision health. Li commented: “We have a lot of silo’d data in healthcare. How can we leverage such data so that gen AI models can gain meaning out of it, and we can personalize the outcome for patients?” She added that Gen AI can leverage multimodal data; but can also generate synthetic data so that privacy can be protected in the studies. “The field is moving so fast and the potential of gen AI in healthcare is still yet to be seen,” she said.

Advice for Students:  Lori Sherer stressed that there are massive opportunities for students with generative AI at every layer of technology development and implementation – from models to applications to change management in healthcare. Horvitz commented: “If I were in school, I’d be going back to neurobiology.” He shared his continued fascination with the brain and advised student to stay flexible and gain skills collaborating with others on projects, reflecting the interdisciplinary nature of this field. Shah added that students need to “understand incentives on a project. Who pays who, how often and for what? If you really want to understand healthcare, you need to know that and not just basic biology.” And Jia Li advised students to “pursue what you’re most excited about, and keep on learning about the latest advancements.  Interdisciplinary knowledge will help you go a long way.”

Following the panel, the audience broke into small table groups to ideate around particular problem statements in healthcare and medicine that could potentially be improved with generative AI. These ranged from challenges in life sciences and drug discovery to individual care delivery to healthcare systems.

The table brainstorming session was led by senior leaders internal and external to Stanford, including Sylvia Plevritis, Faculty and Chair of the Department of Biomedical Data Science, Scott Penberthy, Managing Director of Applied AI at Google, Matt Lungren, Chief Medical Information Officer at Microsoft, Lori Sherer, Partner at Bain & Company, Jia Li, Co-Teacher of Stanford Generative AI in Medicine course spring quarter and Co-Founder of HealthUnity, George Savage, Managing Director of Spring Ridge Ventures, and Peter Ro, Medical Director of Advances in Primary Care Palo Alto Medical Foundation.

The energy in the room was extremely high throughout the entire session and many groups were standing huddled around their flip charts, adding ideas on post-it notes and then clustering the brainstorming.

Here are several examples of the ideation session brainstorming outcome:

I. Problem Statement: Predicting recurrence of breast cancer: how likely is a woman to have a recurrence of estrogen receptor (ER) positive breast cancer? Can we use generative AI to explore other warning signs by continuously monitoring the medical record EHR?

Concrete Example: Sally just got diagnosed with ER positive and HER2 negative breast cancer, the most common type (70% of breast cancer). She has a risk of recurrence that is constant in the next 15 years. What can she do to reduce her risk and reduce her stress level?

Key Insights: The discussion started with focus on biomarkers and clinical trials; however, the main focus of the brainstorming ended up around solutions to reduce stress and anxiety for women in this situation. For the woman who knows she has the same risk for 15 years, the table brainstormed on how to manage the stress level and provide some peace of mind. If there is real-time monitoring in a background system, then the woman may not have to worry as much. Perhaps the stress could be lowered if the woman knows that the burden is on someone else’s shoulders to monitor the situation for them. This system could generate the risk indications and alert their doctors as needed.

2.     Problem Statement: How do we audit large language models for bias and safety? And how do we make LLMs more trustworthy in medical and healthcare applications?

Key Insights: The main key takeaways come from data quality, verification, and transparency, which are three important pillars to understand and evaluate safety and bias concerns in large language models. In particular:

1. We should create better model cards. Model cards should provide organized information about how the model was trained, which data was used, and how it was evaluated. This can help both users and practitioners understand the model’s background and any potential biases involved.

2. Then, we need to evaluate datasets themselves more effectively and develop a quality index to gain a better understanding of the data we are using. This, of course, involves considering factors like how well the data represents different perspectives and whether any biases are present.

3. Finally, we should carry out continuous testing and verification even after the model is deployed. Many times, issues and biases are discovered along the way, and by regularly checking the models, we can identify and address these issues.

3. Problem Statement: Hallucinations and more efficient clinical paperwork/communications: How do we combine genAI, vector databases, information retrieval to limit hallucination in patient-facing systems, e.g., explaining care & benefits, understanding a diagnosis, preparing a clinical note?

Concrete example: The busy oncologist needs to summarize the 60 patients she saw today, writing “authorization approvals” for insurance companies after dinner.

Brainstorming Outcome: The team felt that putting the clinician in charge, and focusing the work on a patient-by-patient basis, was best. The AI would draft, and clinician would check the work. Techniques like rag and react can help reduce dreaming. One key idea to support doctors was an Intuit-style interface like TurboTax. This would still need to be approved by the hospitals for HIPAA etc. The clinician-first, simple & intuitive interface, and analogy to the IRS and banks was really compelling.

4. Problem Statement: Clinical Workflow: How can we successfully implement generative AI solutions into the clinical workflow, overcoming issues with past expert systems?

Concrete Example: Dr. Jones is an experienced internal medicine physician. He has seen many health IT products come and go. Most of all he has faith in his training and experience having managed thousands of patients over 30 years. Why should he trust a tool that tells him what to do clinically?

Key Insights: The brainstorming identified two top potential application areas: (a) Medical notes/voice to text, and (b) Managing at the population/cohort level. The biggest “aha’s” in the discussion were:

  1. Need to automate data capture in the workflow
  2. Change the doctors’ perceptions that the models are a black box: Involve doctors in the process (if it’s a homegrown solution). Let them see the models, how they were produced and validated, etc.
  3. Pushing insights into workflow at right time is important – when will the insights be most valued?

5. Problem Statement: Healthier Diets: How can we leverage generative AI to help people with chronic diseases to have healthier diets?

Concrete Example: Sophie is a patient with Diabetes Type II, who is struggling with her daily diet. She tried to search online and got an overwhelming amount of do’s and don’ts. Some of the recommended food seems very healthy but she doesn’t like the taste. Also there are significant amount of medical terminology that she couldn’t understand.

Key Insights: The goal is to enable “DietGPT – Your Personal Dietitian” to make healthy and enjoyable diet recommendations. It is critical to solve several components/challenges below in order to build trustworthy application: (1) not achieving the goal (healthy but not enjoyable, or vice versa), (2) Bias/equity, (3) Edgecase/risk, (4) Evaluation, (5) Liability and (6) Behavior change/adherence. The team discussed strategies to mitigate the problems such as having humans in the loop, positioning it as education, assistant role etc.

6. Problem Statement: Healthcare Resource Optimization: How can Generative AI be employed to optimize healthcare resource allocation, including staff scheduling, patient prioritization, and supply chain management?

Concrete Example: Dr. Susan, a family physician, runs a small clinic in a suburban neighborhood. She has a dedicated team of medical coders and billers who handle the administrative side of her practice. The medical coders in Susan’s clinic are responsible for translating patient diagnoses, treatments, and procedures into universally recognized medical codes. However, due to the complexity and variability of medical notes, as well as the vast number of codes in systems like the International Classification of Diseases (ICD) and the Current Procedural Terminology (CPT), errors often occur. These errors can lead to claims being denied by insurance companies, resulting in lost revenue for the clinic.

Key Insights:

1.     Start Small and Scale Up: Begin by identifying repetitive tasks or frequent codes as the initial target for piloting the use of generative AI. This approach can help identify any issues early on, limiting their potential impact and making it easier to correct them.

2.     Ensure Seamless Integration with Existing Systems: Engage with EHR vendors or specialized IT consultants to ensure proper integration after pilot testing.

3.     Provide Adequate Training: Both the technical and medical staff need adequate training to understand how to use the AI, troubleshoot problems, and interpret the AI’s outputs. The aim should be to develop a collaborative human-AI workforce, where each plays to their strengths.

4.     Ensure Data Security and Compliance: An AI system’s data security measures should be rigorously tested before it’s implemented.

5.     Establish Clear Lines of Responsibility: Ensure there are clear lines of responsibility among staff for managing the AI system, responding to any discrepancies or errors, and maintaining a high standard of care.

6.     Engage Stakeholders Early: From physicians to coders and billers, everyone who will be affected by the AI system should be involved in the process early on. Their input can help identify potential issues before they become problems, and their buy-in can ensure smoother implementation.

7. Problem Statement: Payer/Insurer Challenge: How could generative AI help to improve communications and relationships with patients? For example, how could it help to decipher explanation-of-benefits notices and laboratory test results for readers unfamiliar with the language used in these communications?

Concrete Example: John is a patient who went to the wrong MRI imaging center (referred by his doctor) and the reimbursement was denied because it was not in-network.  He now has a $2500 bill for a test that the insurance company is refusing to pay. John has a terrible impression of the payer and there have been many calls to sort it out. The provider is also concerned that they will not get paid.

Key Insights: The upshot on the accepting vs. the resisting was the following:

  • Providers and patients would be accepting of solutions to solve this problem; but payers benefit from the confusion and as much as they want better patient/customer relationships, fixing this would result in more claims being paid and that would hurt their bottom line.
  • As for the solution itself we thought there could be passive listening (GenAi in Nuance or something) that listens to the dialog between patient and physician during the appointment and if it seems like the physician is going to order an MRI or even if the dialog makes it seem like that would be required the system automatically looks up the patient’s coverages and then recommends the top 3-4 places the patient should be referred to and send a text message to the patient’s phone with the descriptions of the options and google maps and scheduling functionality so the appointment with an “approved” MRI provider can be made while sitting there in the office
  • We talked about the provider databases being very bad (ever Payer’s website and provider look ups are terrible) so we also brainstormed that the information for the Payer: Provider coverage could actually be an advertising supported database – The providers would pay to be listed and they would be required to include which plans they support so that the info returned to the patient would be accurate and so the providers have a way to market to patients directly and at the point of care
  • We talked about GenAI upstream from the provider visit if a symptom checker could predict the need for an MRI and order that automatically, and get it approved before the appointment so once the patient actually got into the providers’ office the MRI output would already be available to the provider to read and then recommend course of treatment … seemed like a waste of provider time to have them do a 15 minute appointment to order the MRI
  • We also talked about downstream if there were mistakes in the provider database and the patient still went to the wrong provider and who would pay?  We didn’t have a great answer for that but wanted to recognize that the look up might not always be accurate.

Other table group write-ups TBD (do not include at this point):

Problem Statement: Genome Sequencing: How can we use generative AI to synthesize new genomic sequences that either have similar properties to real human sequences, but because they are not real, can be shared without compromising patient privacy, or that have specific properties and could be used to insert new traits into, for example crops?

Key Insights: A process emerges: Data generation and inference; Representations; Synthetic Data; Privacy; Security. [need to elaborate]

—-

Problem Statement: To prevent cognitive decline in people with markers for Alzheimers, how can we come up w/ a healthcare system to support healthy behavior, using generative AI?
Persona Vignette: Patty’s mother lives in a memory-oriented assisted living facility and requires daily care. All of Patty’s aunts have experienced similar cognitive issues, beginning in their 60s. What should Patty do to assess her risk and delay the onset of cognitive decline?

Key Insights: Grouping around 3 areas: (a) Characterize cognitive decline (b) Manage data properly and (c) Make it acceptable to patients & families. Any ideas particularly interesting?

——

Problem Statement: There is a growing population of older patients with complex medical conditions – how can generative AI help care for patients outside of the traditional visit model?
Persona Vignette: Maria is a 72-year-old woman with history of hypertension, arthritis, and diabetes who lives by herself. She has difficulty with ambulation and does not have many friends or family in the area who can help drive her to appointments.
Key Insights: Non-intrusive monitoring system.

—Karen Matthys, DBDS Executive Director