A novel machine learning architecture to improve classification of intermediate cases in health: workflow and case study for public health.

Publication date: Jul 16, 2025

The practice of medicine has evolved significantly during the past decade, with the emergence of Machine Learning (ML) that offers the opportunity of personalized patient-tailored care. However, ML models still face some challenges when classifying patients where clear-cut boundaries between classes are hard to identify. In this work, we propose an ML architecture to improve the sensitivity of detecting patients in intermediate “hard-to-classify” classes. The proposed architecture replaces a single classifier with a group of cascaded increasingly specialized classifiers: the ‘Human-like’, the ‘Segregating’, and the ‘Deep’ classifiers. Its effectiveness is tested, using 8 ML algorithms (Logistic Regression, Support Vector Machine, K-Nearest Neighbor, Decision Tree, Random Forest, XGBoost, CatBoost, and Artificial Neural Network) to predict the feeling of protection among healthcare workers during the COVID-19 pandemic, based on a global online survey, then validated on two other outputs. The results show, for most algorithms, an enhanced detection of data points belonging to intermediate classes (up to 14% absolute increase in accuracy), as well as an overall improvement in the models’ accuracies (up to 5. 8% absolute increase). The validation experiments yielded similar results with improved accuracies for most algorithms when compared to the single classifier architecture. This novel architecture is proving to be a very promising tool for improving accuracy of the models when classifying patients in intermediate classes, regardless of the algorithm used. Accuracy-improvement for likert-type scale measures offers an opportunity for rapidly identifying “risk-profiles” during emergencies and beyond. This applies equally to patients and healthcare providers, with potential for improving quality of care and strengthening patient-centered healthcare systems that prioritize healthcare providers’ wellbeing.

Open Access PDF

Concepts Keywords
Catboost Algorithms
Decade Artificial intelligence
Forest COVID-19
Healthcare Health Personnel
Pandemic Humans
Machine Learning
Machine learning
Maternity
Neural Networks, Computer
Pandemics
Public Health
Public health
SARS-CoV-2
Support Vector Machine
Workflow

Semantics

Type Source Name
disease MESH COVID-19 pandemic
disease IDO algorithm
disease MESH emergencies
disease IDO quality
pathway REACTOME Reproduction
disease MESH confusion
disease MESH dementia
disease MESH myocardial infarction
disease MESH cardiovascular risk factors
disease MESH frontotemporal dementia
drug DRUGBANK Esomeprazole
disease MESH abnormalities
drug DRUGBANK Cysteamine
disease IDO process
drug DRUGBANK Aspartame
disease MESH infection

Original Article

(Visited 2 times, 1 visits today)