Which explanations do clinicians prefer? A comparative evaluation of XAI understandability and actionability in predicting the need for hospitalization.

Publication date: Jul 16, 2025

This study aims to address the gap in understanding clinicians’ attitudes toward explainable AI (XAI) methods applied to machine learning models using tabular data, commonly found in clinical settings. It specifically explores clinicians’ perceptions of different XAI methods from the ALFABETO project, which predicts COVID-19 patient hospitalization based on clinical, laboratory, and chest X-ray at time of presentation to the Emergency Department. The focus is on two cognitive dimensions: understandability and actionability of the explanations provided by explainable-by-design and post-hoc methods. A questionnaire-based experiment was conducted with 10 clinicians from the IRCCS Policlinico San Matteo Foundation in Pavia, Italy. Each clinician evaluated 10 real-world cases, rating predictions and explanations from three XAI tools: Bayesian networks, SHapley Additive exPlanations (SHAP), and AraucanaXAI. Two cognitive statements for each method were rated on a Likert scale, as well as the agreement with the prediction. Two clinicians answered the survey during think-aloud interviews. Clinicians demonstrated generally positive attitudes toward AI, but high compliance rates (86% on average) indicate a risk of automation bias. Understandability and actionability are positively correlated, with SHAP being the preferred method due to its simplicity. However, the perception of methods varies according to specialty and expertise. The findings suggest that SHAP and AraucanaXAI are promising candidates for improving the use of XAI in clinical decision support systems (DSSs), highlighting the importance of clinicians’ expertise, specialty, and setting on the selection and development of supportive XAI advice. Finally, the study provides valuable insights into the design of future XAI DSSs.

Open Access PDF

Concepts Keywords
Araucanaxai Adult
Bias Clinical decision-making
Clinicians COVID-19
Future Female
Italy Hospitalization
Human-AI collaboration
Humans
Interpretability
Italy
Machine Learning
Male
Middle Aged
Questionnaire
SARS-CoV-2
Surveys and Questionnaires
Think-aloud protocol
User study

Semantics

Type Source Name
disease MESH COVID-19
disease MESH Emergency
pathway REACTOME Reproduction
drug DRUGBANK Nonoxynol-9
disease IDO algorithm
disease IDO process
disease MESH chronic obstructive pulmonary disease
disease MESH respiratory failure
disease IDO blood
disease IDO history
disease MESH hypertension
disease MESH cardiovascular disease
disease MESH chronic renal failure
disease MESH stroke
disease MESH ischemic heart disease
disease MESH atrial fibrillation
disease MESH dementia
disease MESH Edema
drug DRUGBANK Flunarizine
drug DRUGBANK L-Valine
drug DRUGBANK Esomeprazole
disease MESH infectious diseases
drug DRUGBANK Coenzyme M
drug DRUGBANK Pentaerythritol tetranitrate
drug DRUGBANK Aspartame

Original Article

(Visited 4 times, 1 visits today)