Understanding metric-related pitfalls in image analysis validation.

Abstract

Validation metrics are key for tracking scientific progress and bridging the current chasm between artificial intelligence research and its translation into practice. However, increasing evidence shows that, particularly in image analysis, metrics are often chosen inadequately. Although taking into account the individual strengths, weaknesses and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multistage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides a reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Although focused on biomedical image analysis, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. The work serves to enhance global comprehension of a key topic in image analysis validation.

Department

Description

Provenance

Subjects

Artificial Intelligence

Citation

Published Version (Please cite this version)

10.1038/s41592-023-02150-0

Publication Info

Reinke, Annika, Minu D Tizabi, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, A Emre Kavur, Tim Rädsch, Carole H Sudre, et al. (2024). Understanding metric-related pitfalls in image analysis validation. Nature methods, 21(2). pp. 182–194. 10.1038/s41592-023-02150-0 Retrieved from https://hdl.handle.net/10161/33084.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Scholars@Duke

Benis

Arriel Benis

Adjunct Associate Professor in the Department of Biomedical Engineering

Dr. Arriel Benis is a researcher and educator working at the intersection of medical informatics, digital health, and artificial intelligence, advancing health systems and biomedical engineering innovation. His work leverages AI, data science, and knowledge management to improve health-related decision-making at the individual, population, and public health levels.

His research focuses on developing data-driven healthcare solutions that enhance patient care, optimize clinical processes, and promote sustainable systems. Dr. Benis has engineered (a) clinical decision support systems with direct patient and healthcare partitioners impact such as ADHD, PTSD, and diabetes patient management and health communication, (b) MIMO -the Medical Informatics and Digital Health Multilingual Ontology- integrating more than 3500 terms and concepts across 30+ languages, actively deployed in healthcare organizations for AI-powered training and international projects support, (c) smart home and smart city health monitoring approach from a One Health viewpoint. Dr. Benis is a pioneer of the One Digital Health framework, which strategically links digital health innovation with environmental monitoring.

His past academic positions include serving as a department head and track director in biomedical and health informatics. He holds various leadership roles in the international medical informatics community, is a fellow of the International Academy for Health Sciences Informatics, and is the Editor-in-Chief of JMIR Medical Informatics. Dr. Benis is committed to training the next generation of innovators in digital health and medical informatics.


Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.