Machine-learning models to predict iron recovery after blood donation: a model development and external validation study.

Pubmed ID: 40447352

Journal: The Lancet. Haematology

Publication Date: June 1, 2025

MeSH Terms: Humans, Male, Adult, Female, Adolescent, Middle Aged, Young Adult, Blood Donors, Retrospective Studies, Hemoglobins, Iron, Ferritins, Biomarkers, Machine Learning, Blood Donation

Authors: Custer B, Li W, Russell WA, Su CY, Meulenbeld A, Jagirdar H, Janssen MP, Swanevelder R, Bruhn R, Kaidarova Z, Bravo MD, Cao S, van den Berg K

Cite As: Li W, Su CY, Meulenbeld A, Jagirdar H, Janssen MP, Swanevelder R, Bruhn R, Kaidarova Z, Bravo MD, Cao S, Custer B, van den Berg K, Russell WA. Machine-learning models to predict iron recovery after blood donation: a model development and external validation study. Lancet Haematol 2025 Jun;12(6):e431-e441.

Studies:

Abstract

BACKGROUND: Machine-learning models directly predicting iron biomarkers after blood donation could help to manage donation-associated iron deficiency and avoid low haemoglobin deferrals. No such models have been externally validated internationally. Our aim was to develop and externally validate machine-learning models predicting returning blood donors' haemoglobin and ferritin. METHODS: We developed machine-learning models using retrospective blood donation data. The training cohort included 2425 repeat donors (2007-09 US-based RISE study); external validation used 2014-23 cohorts from the USA, South Africa, and the Netherlands. Models predicted donors' ferritin and haemoglobin at return donations by use of variables that are commonly measured by blood collectors (time until donors return, donation history, demographics, and baseline iron biomarkers). Models were selected via cross-validation and externally validated in donors aged at least 15 years in two contexts: those with baseline ferritin and haemoglobin measured (haemoglobin and ferritin) and those with only baseline haemoglobin measured (haemoglobin only). Model performance was assessed by use of root-mean-square percentage error (RMSPE). FINDINGS: When predicting return haemoglobin in the RISE cohort, model performance was similar in the haemoglobin and ferritin dataset (n=2625 donation visits, RMSPE=6·78) and haemoglobin only dataset (n=3488 donation visits, RMSPE=6·78). In the external datasets, containing 11 000 to 514 000 donations, RMSPE never increased more than 8%. When predicting return ferritin in RISE, performance was better in the haemoglobin and ferritin dataset (RMSPE=14·9) than in the haemoglobin only dataset (RMSPE=27·4). In external validation, RMSPE never increased more than 0·4% and 28% in the haemoglobin-only datasets and the haemoglobin and ferritin datasets, respectively. INTERPRETATION: Machine-learning models predicting haemoglobin and ferritin at return donations generalised well across diverse settings and could enable individualised approaches to manage iron deficiency while maintaining a sufficient blood supply. FUNDING: The Association for the Advancement of Blood and Biotherapies. TRANSLATION: For the Dutch translation of the abstract see Supplementary Materials section.