TY - JOUR
T1 - Using machine learning to predict anticoagulation control in atrial fibrillation: A UK Clinical Practice Research Datalink study
AU - Gordon, Jason
AU - Mason, Thomas
AU - Norman, Max
AU - Hurst, Michael
AU - Dickerson, Carissa
AU - Sandler, Belinda
AU - Pollock, Kevin
AU - Farooqui, Usman
AU - Groves, Lara
AU - Tsang, Carmen
AU - Clifton, David
AU - Bakhai, Ameet
AU - Hill, Nathan
N1 - Funding Information:
This study was sponsored by Bristol Myers Squibb Pharmaceutical Ltd & Pfizer Inc. The funding agreement ensured the authors? independence in designing the study, interpreting the data, and preparing the manuscript for publication.
Funding Information:
This study was sponsored by Bristol Myers Squibb Pharmaceutical Ltd & Pfizer Inc. . The funding agreement ensured the authors’ independence in designing the study, interpreting the data, and preparing the manuscript for publication.
Funding Information:
BS, KP, UF, and NH are employed by Bristol Myers Squibb (BMS) UK Ltd. JG, MN, CD, LG, and CT are current employees of Health Economics and Outcomes Research (HEOR) Ltd. who received funding from BMS UK Ltd. for this study. MH and TM were employees of HEOR Ltd. at the time of the study; MH is now employed by BMS UK Ltd. DC undertakes consultancy work with Biobeats Ltd. and Sensyne Health plc. AB does not hold any stock but is involved in research sponsored by and is a member of advisory panels and speakers’ bureau for Daiichi Sankyo, Pfizer, BMS, Bayer and Boehringer Ingelheim.
Publisher Copyright:
© 2021 The Authors
PY - 2021/8/1
Y1 - 2021/8/1
N2 - Objective: To investigate the predictive performance of machine learning (ML) algorithms for estimating anticoagulation control in patients with atrial fibrillation (AF) who are treated with warfarin. Methods: This was a retrospective cohort study of adult patients (≥18 years) between 2007 and 2016 using linked primary and secondary care data (Clinical Practice Research Datalink GOLD and Hospital Episode Statistics). Various ML techniques were explored to predict suboptimal anticoagulation control, defined as time in therapeutic range (TTR) < 70% based on International Normalised Ratio (INR) 2.0–3.0. Baseline (linear and non-linear support vector machines; random forests; stochastic gradient boosting [XGBoost]; neural networks [NN]) and time-varying data (6-week intervals up to 30 weeks (long-short term memory [LSTM] NN)) were applied. Patient records depicting unique lines of warfarin therapy (LOT) were separated into training (70%) and holdout sets (30%) for model training and testing, respectively. Results: 35,479 patients were eligible for inclusion, of whom 24,684 and 10,795 were assigned to the training (32,683 unique LOTs) and holdout sets (14,218 unique LOTs). Across all models, depression (diagnosis and/or prescription of antidepressant medication) was a significant driver in predicting anticoagulation control. At baseline, XGBoost was the best-performing model (area under the curve [AUC]: 0.624) due to its ability to identify non-linear associations such as age and weight (greater probability of suboptimal control: <65 and >80 years and <70 kg, respectively). Addition of time-varying data to the LSTM NN improved predictive performance, plateauing at AUC of 0.830 at 30 weeks. Conclusion: ML algorithms displayed clinically useful ability to predict patients who are at greater risk of suboptimal control. The addition of time-varying data to the algorithm, especially prior INR measurements, improved predictive performance. These algorithms provide improved predictive tools for identifying patients who may benefit from more frequent INR monitoring or switching to alternative therapies.
AB - Objective: To investigate the predictive performance of machine learning (ML) algorithms for estimating anticoagulation control in patients with atrial fibrillation (AF) who are treated with warfarin. Methods: This was a retrospective cohort study of adult patients (≥18 years) between 2007 and 2016 using linked primary and secondary care data (Clinical Practice Research Datalink GOLD and Hospital Episode Statistics). Various ML techniques were explored to predict suboptimal anticoagulation control, defined as time in therapeutic range (TTR) < 70% based on International Normalised Ratio (INR) 2.0–3.0. Baseline (linear and non-linear support vector machines; random forests; stochastic gradient boosting [XGBoost]; neural networks [NN]) and time-varying data (6-week intervals up to 30 weeks (long-short term memory [LSTM] NN)) were applied. Patient records depicting unique lines of warfarin therapy (LOT) were separated into training (70%) and holdout sets (30%) for model training and testing, respectively. Results: 35,479 patients were eligible for inclusion, of whom 24,684 and 10,795 were assigned to the training (32,683 unique LOTs) and holdout sets (14,218 unique LOTs). Across all models, depression (diagnosis and/or prescription of antidepressant medication) was a significant driver in predicting anticoagulation control. At baseline, XGBoost was the best-performing model (area under the curve [AUC]: 0.624) due to its ability to identify non-linear associations such as age and weight (greater probability of suboptimal control: <65 and >80 years and <70 kg, respectively). Addition of time-varying data to the LSTM NN improved predictive performance, plateauing at AUC of 0.830 at 30 weeks. Conclusion: ML algorithms displayed clinically useful ability to predict patients who are at greater risk of suboptimal control. The addition of time-varying data to the algorithm, especially prior INR measurements, improved predictive performance. These algorithms provide improved predictive tools for identifying patients who may benefit from more frequent INR monitoring or switching to alternative therapies.
KW - Anticoagulation control
KW - Atrial fibrillation
KW - International normalised ratio
KW - Machine learning
KW - Unsupervised learning
KW - Warfarin
U2 - 10.1016/j.imu.2021.100688
DO - 10.1016/j.imu.2021.100688
M3 - Article
SN - 2352-9148
VL - 25
JO - Informatics in Medicine Unlocked
JF - Informatics in Medicine Unlocked
M1 - 100688
ER -