Evaluating predictive performance, validity, and applicability of machine learning models for predicting HIV treatment interruption: a systematic review
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
BMC Global and Public Health
Abstract
Background HIV treatment interruption remains a significant barrier to achieving global HIV/AIDS control goals.
Machine learning (ML) models offer potential for predicting treatment interruption by leveraging large clinical data.
Understanding how these models were developed, validated, and applied remains essential for advancing research.
Methods We searched databases including the PubMed, BMC, Cochrane Library, Scopus, ScienceDirect, Lancet,
and Google Scholar, for studies published in English from 1990 to September 2024. Search terms covered HIV,
machine learning, treatment interruption, and loss to follow‑up. Articles were screened and reviewed independently,
and data were extracted using the CHecklist for critical Appraisal and data extraction for systematic Reviews of pre
diction Modelling Studies (CHARMS) tool. Risk of bias was assessed with Prediction model Risk Of Bias Assessment
Tool (PROBAST). The Preferred Reporting Items for Systematic reviews and Meta‑analysis (PRISMA) guidelines were
followed throughout.
Results Out of 116,672 records, 9 studies met the inclusion criteria and reported 12 ML models. Random For
est, XGBoost, and AdaBoost were predominant models (91.7%). Internal validation was performed in all models,
but only two models included external validation. Performance varied, with a mean area under the receiver operat
ing characteristic curve (AUC‑ROC) of 0.668 (standard deviation (SD) = 0.066), indicating moderate discrimination.
About 75% of models showed a high risk of bias due to inadequate handling of missing data, lack of calibration,
and the absence of decision curve analysis (DCA).
Conclusions ML models show promise for predicting HIV treatment interruption, particularly in resource‑limited set
tings. Future research should prioritize external validation, robust missing data handling, and decision curve analysis
and include sociocultural predictors to improve model robustness.
Description
Research Article
