Federated benchmarking of medical artificial intelligence with MedPerf

Alexandros Karargyris; Renato Umeton; Micah J. Sheller; Alejandro Aristizabal; Johnu George; Anna Wuest; Sarthak Pati; Hasan Kassem; Maximilian Zenk; Ujjwal Baid; Prakash Narayana Moorthy; Alexander Chowdhury; Junyi Guo; Sahil Nalawade; Jacob Rosenthal; David Kanter; Maria Xenochristou; Daniel J. Beutel; Verena Chung; Timothy Bergquist; James Eddy; Abubakar Abid; Lewis Tunstall; Omar Sanseviero; Dimitrios Dimitriadis; Yiming Qian; Xinxing Xu; Yong Liu; Rick Siow Mong Goh; Srini Bala; Victor Bittorf; Sreekar Reddy Puchala; Biagio Ricciuti; Soujanya Samineni; Eshna Sengupta; Akshay Chaudhari; Cody Coleman; Bala Desinghu; Gregory Diamos; Debo Dutta; Diane Feddema; Grigori Fursin; Xinyuan Huang; Satyananda Kashyap; Nicholas Lane; Indranil Mallick; Pietro Mascagni; Virendra Mehta; Cassiano Ferro Moraes; Vivek Natarajan; Nikola Nikolov; Nicolas Padoy; Gennady Pekhimenko; Vijay Janapa Reddi; G. Anthony Reina; Pablo Ribalta; Abhishek Singh; Jayaraman J. Thiagarajan; Jacob Albrecht; Thomas Wolf; Geralyn Miller; Huazhu Fu; Prashant Shah; Daguang Xu; Poonam Yadav; David Talby; Mark M. Awad; Jeremy P. Howard; Michael Rosenthal; Luigi Marchionni; Massimo Loda; Jason M. Johnson; Spyridon Bakas; Peter Mattson

doi:10.1038/s42256-023-00652-2

Nature Machine Intelligence

Paper

17 Jul 2023

Federated benchmarking of medical artificial intelligence with MedPerf

Download paper

Abstract

Medical artificial intelligence (AI) has tremendous potential to advance healthcare by supporting and contributing to the evidence-based practice of medicine, personalizing patient treatment, reducing costs, and improving both healthcare provider and patient experience. Unlocking this potential requires systematic, quantitative evaluation of the performance of medical AI models on large-scale, heterogeneous data capturing diverse patient populations. Here, to meet this need, we introduce MedPerf, an open platform for benchmarking AI models in the medical domain. MedPerf focuses on enabling federated evaluation of AI models, by securely distributing them to different facilities, such as healthcare organizations. This process of bringing the model to the data empowers each facility to assess and verify the performance of AI models in an efficient and human-supervised process, while prioritizing privacy. We describe the current challenges healthcare and AI communities face, the need for an open platform, the design philosophy of MedPerf, its current implementation status and real-world deployment, our roadmap and, importantly, the use of MedPerf with multiple international institutions within cloud-based technology and on-premises scenarios. Finally, we welcome new contributions by researchers and organizations to further strengthen MedPerf as an open benchmarking platform.

Conference paper