Show
Lab Member
Position: Deciphering the Functions of DNAs, RNAs, and Proteins Should Consider Multi-Modal Large Language Models
Pengtao Xie, Victor Nizet, Lei Wang, Ahmed Alaa, Daniel C. Zielinski, Trey Ideker, Bernhard Palsson
ICML 2026 · Spotlight
ReasonEdit: Editing Vision-Language Models using Human Reasoning
Jiaxing Qiu, Kaihua Hou, Roxana Daneshjou, Ahmed Alaa, Thomas Hartvigsen
ICML · 2026
ER-Reason: A Benchmark Dataset for LLM Clinical Reasoning in the Emergency Room
Nikita Mehandru, Niloufar Golchini, Namrata Garg, Kathy LeSaint, Christopher Nash, Anu Ramachandran, Travis Zack, Liam McCoy, Adam Rodman, David Bamman, Melanie Molina†, Ahmed Alaa(†Co-senior authors)
arXiv preprint · 2026
CheXthought: A Global Multimodal Dataset of Clinical Chain-of-thought Reasoning and Visual Attention for Chest X-ray Interpretation
Sonali Sharma, Jin Long, George Shih, Sarah Eid, Christian Bluethgen, Francine L. Jacobson, Emily B. Tsai, Ahmed M. Alaa, Curtis P. Langlotz, Global Radiology Consortium
arXiv preprint · 2026
Test-Time Hinting for Black-Box Vision-Language Models
Kaihua Hou, Abhijith Varma Mudunuri, Jiaxing Qiu, Roxana Daneshjou, Thomas Hartvigsen, Ahmed Alaa
arXiv preprint · 2026
Causal Effect Estimation with Learned Instrument Representations
Frances Dean*, Jenna Fields*, R. Bhalerao, Marie-Laure Charpignon, Ahmed Alaa (*Co-first authors)
arXiv preprint · 2026
State-Space Modeling in Natural Language
Nikita Mehandru, Marie-Laure Charpignon, Kaihua Hou, David Bamman, Ahmed Alaa
ICLR Workshop on Time Series in the Age of Large Models · 2026
Strategic Feature Selection
Jivat Kaur, Pratik Patil, Divya Shanmugam, Emma Pierson, Michael I. Jordan, Nika Haghtalab, Meena Jagadeesan†, Ahmed Alaa†, Serena Wang† (†Co-senior authors)
arXiv preprint · 2026
Aligning Language Model Benchmarks with Pairwise Preferences
M. Gutierrez, X. Leng, H. Cyberey, J.R. Schwarz, Ahmed Alaa, Thomas Hartvigsen
arXiv preprint · 2026
Advances in LLM Reasoning Enable Flexibility in Clinical Problem-Solving
K. Shidara, P. Prem, J. Kim, A. Podlasek, F. Liu, Ahmed Alaa, D. Bernardo
arXiv preprint · 2026
Artificial Intelligence Surrogate Models to Predict Long-Term Cardiovascular Effects of Immune Checkpoint Inhibitor Therapies Using Electrocardiograms
Frances Dean, J. Barrios, G. Tison, J.J. Moslehi, Ahmed Alaa
Journal of Clinical Oncology · 2026 · Abstract
MedEvalArena: A Self-Generated, Peer-Judged Benchmark for Medical Reasoning
P. Prem, K. Shidara, V. Kuppa, E. Wheeler, F. Liu, Ahmed Alaa, D. Bernardo
medRxiv · 2026
Machine Learning Cross-Platform Proteomic Imputation Enables Protein Quality Scoring and Replication of Epidemiological Associations
Linke Li*, Ahmed Alaa*, Y. Tan, Ilker Demirel, S. Friedman, Q. Zha, R.P. Trac, K.D. Taylor, et al. (*Co-first authors)
bioRxiv · 2026
A Deep Learning Approach to Quantitative PCR that Learns from Ground Truth
Ziad Obermeyer, H. Vu, Alexander Schubert, S. Selvan, P. Giannikopoulos, Ahmed Alaa
Preprint · 2026
Position: Medical Large Language Model Benchmarks Should Prioritize Construct Validity
Ahmed Alaa, Thomas Hartvigsen, Niloufar Golchini, Shiladitya Dutta, Frances Dean, Inioluwa Deborah Raji, Travis Zack
ICML 2025 · Oral Presentation
Generalized Venn and Venn-Abers Calibration with Applications in Conformal Prediction
Lars van der Laan, Ahmed Alaa
ICML · 2025
Model Editing with Graph-Based External Memory
Y.K. Atri, Ahmed Alaa, Thomas Hartvigsen
ACL · 2025
Lifelong Model Editing with Graph-Based External Memory
Y.K. Atri, Ahmed Alaa, Thomas Hartvigsen
ACL Findings · 2025
Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes
L. Zhang, B. Jindal, Ahmed Alaa, R. Weinreb, D. Wilson, E. Segal, J. Zou, Pengtao Xie
Nature Communications · 2025
A Longitudinal Analysis of Declining Medical Safety Messaging in Generative AI Models
Sonali Sharma, Ahmed M. Alaa, Roxana Daneshjou
npj Digital Medicine · 2025
Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning
J. Kim, A. Podlasek, K. Shidara, F. Liu, Ahmed Alaa, D. Bernardo
Scientific Reports · 2025
BioAgents: Bridging the Gap in Bioinformatics Analysis with Multi-Agent Systems
Nikita Mehandru, A.K. Hall, O. Melnichenko, Y. Dubinina, D. Tsirulnikov, Ahmed Alaa, et al.
Scientific Reports · 2025
Viability of Machine Translation for Healthcare in Low-Resourced Languages
H.H. Nigatu, Nikita Mehandru, N.H. Abadi, B. Gebremeskel, Ahmed Alaa, et al.
EMNLP · 2025
Hybrid Meta-learners for Estimating Heterogeneous Treatment Effects
Zhongyuan Liang, Lars van der Laan, Ahmed Alaa
arXiv preprint · 2025
Conformal Prediction Sets with Improved Conditional Coverage using Trust Scores
Jivat Kaur, Michael I. Jordan, Ahmed Alaa
arXiv preprint · 2025
Lifelong Knowledge Editing Requires Better Regularization
A. Gupta, P. Prateepamornkul, M. Lu, Ahmed Alaa, Thomas Hartvigsen, et al.
arXiv preprint · 2025
Data Reuse Enables Cost-Efficient Randomized Trials of Medical AI Models
M. Nercessian, W. Zhang, Alexander Schubert, D. Yang, M. Chung, Ahmed Alaa†, Adam Yala† (†Co-senior authors)
arXiv preprint · 2025
Reliable Agent Engineering Should Integrate Machine-Compatible Organizational Principles
R.P. Xian, G.A. Gabison, Ahmed Alaa, C. Riedl, G.G. Chrysos
arXiv preprint · 2025
Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing
A. Gupta, C. Fang, A. Ozdemir, M. Lu, Ahmed Alaa, Thomas Hartvigsen, et al.
AAAI Workshop on Knowledgeable Foundation Models · 2025
Physician-versus Large Language Model-Generated Summaries in the Emergency Department
Niloufar Golchini, Nikita Mehandru, Ahmed Alaa†, Melanie Molina† (†Co-senior authors)
medRxiv · 2025
Extracting TNFi Switching Reasons and Trajectories From Real-World Data Using Large Language Models
Brenda Y. Miao, M. Binvignat, A. Garcia-Agundez, M. Bravo, C.Y.K. Williams, Ahmed Alaa, et al.
medRxiv · 2025
Self-Calibrating Conformal Prediction
Lars van der Laan, Ahmed Alaa
NeurIPS · 2024
Med-Real2Sim: Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised Learning
Keying Kuang, Frances Dean, Jack B. Jedlicki, David Ouyang, Anthony Philippakis†, David Sontag†, Ahmed Alaa(†Co-senior authors)
NeurIPS · 2024
Prediction-Powered Generalization of Causal Inferences
Ilker Demirel, Ahmed Alaa, Anthony Philippakis, David Sontag
ICML · 2024
Mean-Field Chaos Diffusion Models
Sungwoo Park, Dongjun Kim, Ahmed Alaa
ICML 2024 · Oral Presentation
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan, Sungwoo Park, Alexander Schubert, Anthony Philippakis, Ahmed M. Alaa
ICLR · 2024
Evaluating Large Language Models as Agents in the Clinic
Nikita Mehandru, Brenda Miao, Eduardo Rodriguez Almaraz, Madhumita Sushil, Atul Butte, Ahmed Alaa
NPJ Digital Medicine · 2024
Artificial Intelligence–Based Copilots to Generate Causal Evidence
M. Petersen, Ahmed Alaa, E. Kıcıman, C. Holmes, M. van der Laan
NEJM AI · 2024
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care
IEEE Journal of Biomedical and Health Informatics · 2024
A Machine Learning Approach for Predicting Textbook Outcome After Cytoreductive Surgery and Hyperthermic Intraperitoneal Chemotherapy
A. Ashraf Ganjouei, F. Romero-Hernandez, J.J. Wang, A. Hamed, Ahmed Alaa, et al.
World Journal of Surgery · 2024
Generating New Drug Repurposing Hypotheses Using Disease-Specific Hypergraphs
Ayush Jain, Marie-Laure Charpignon, Irene Y. Chen, Anthony Philippakis, Ahmed Alaa
Pacific Symposium on Biocomputing · 2024
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
Shenghuan Sun, Greg M. Goldgof, Alexander Schubert, Z. Sun, Thomas Hartvigsen, Atul J. Butte, Ahmed Alaa
NeurIPS Workshop on Multimodal Algorithmic Reasoning · 2024
Veridical Data Science for Medical Foundation Models
Ahmed Alaa, Bin Yu
arXiv preprint · 2024
Large Language Models as Co-Pilots for Causal Inference in Medical Studies
Ahmed Alaa, R.V. Phillips, E. Kıcıman, L.B. Balzer, M. van der Laan, M. Petersen
arXiv preprint · 2024
Seq-to-Final: A Benchmark for Tuning from Sequential Distributions to a Final Time Point
C.X. Ji, Ahmed M. Alaa, David Sontag
arXiv preprint · 2024
EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation
J.W. Kim, Ahmed Alaa, D. Bernardo
arXiv preprint · 2024
Generation of Guideline-Based Clinical Decision Trees in Oncology Using Large Language Models
Brenda Y. Miao, Eduardo Rodriguez Almaraz, A. Ashraf Ganjouei, A. Suresh, Travis Zack, Ahmed Alaa, et al.
medRxiv · 2024
Deep Learning-Derived Splenic Radiomics, Genomics, and Coronary Artery Disease
M. Kamineni, V.K. Raghu, B. Truong, Ahmed Alaa, A. Schuermans, S.F. Friedman, et al.
medRxiv · 2024
Conformal Meta-Learners for Predictive Inference of Individual Treatment Effects
Ahmed Alaa, Zaid Ahmad, Mark van der Laan
NeurIPS 2023 · Oral Presentation
DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology
Marco Aversa, Gabriel Nobis, Miriam Hägele, Kai Standvoss, Mihaela Chirica, Roderick Murray-Smith, Ahmed M. Alaa, Lukas Ruff, Daniela Ivanova, Wojciech Samek, Frederick Klauschen, Bruno Sanguinetti, Luis Oala
NeurIPS 2023 · Spotlight
Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback
Shenghuan Sun, Greg Goldgof, Atul Butte, Ahmed M. Alaa
NeurIPS 2023 · Spotlight
Conformalized Unconditional Quantile Regression
Ahmed M. Alaa, Z. Hussain, David Sontag
AISTATS · 2023
PREDICT Underestimates Survival of Patients with HER2-Positive Early-Stage Breast Cancer
Ahmed M. Alaa, A.L. Harris, Mihaela van der Schaar
npj Breast Cancer · 2023
External Validity of Machine Learning-Based Prognostic Scores for Cystic Fibrosis: A Retrospective Study Using the UK and Canadian Registries
Y. Qin, Ahmed Alaa, A. Floto, Mihaela van der Schaar
PLOS Digital Health · 2023
Large-Scale Study of Temporal Shift in Health Insurance Claims
C.X. Ji, Ahmed M. Alaa, David Sontag
Conference on Health, Inference, and Learning (CHIL) · 2023
Generating Drug Repurposing Hypotheses through the Combination of Disease-Specific Hypergraphs
Ayush Jain, Marie-Laure Charpignon, Irene Y. Chen, Anthony Philippakis, Ahmed Alaa
Machine Learning for Health (ML4H) · 2023
Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data
Shiladitya Dutta, H. Wei, Lars van der Laan, Ahmed M. Alaa
NeurIPS Workshop on Robustness of Few-shot and Zero-shot Models · 2023
Identifying Splenic Radiomics Features Associated with Risk of Coronary Artery Disease
M. Kamineni, Z. Yu, V.K. Raghu, Ahmed M. Alaa, A. Schuermans, S.F. Friedman, et al.
Circulation · 2023 · Abstract
How Faithful is your Synthetic Data? Sample-Level Metrics for Evaluating and Auditing Generative Models
Ahmed M. Alaa, Boris van Breugel, Evgeny Saveliev, Mihaela van der Schaar
ICML · 2022
ETAB: A Benchmark Suite for Visual Representation Learning in Echocardiography
Ahmed Alaa, Anthony Philippakis, David Sontag
NeurIPS · 2022
Mining for Informative Signals in Biological Sequences
Nature Machine Intelligence · 2022