Sample sizes of prediction model studies in prostate cancer were rarely justified and often insufficient

Shane D Collins, Niels Peek, Richard D Riley, Glen P Martin

Research output: Contribution to journalArticlepeer-review

Abstract

OBJECTIVE: Developing clinical prediction models (CPMs) on data of sufficient sample size is critical to help minimize overfitting. Using prostate cancer as a clinical exemplar, we aimed to investigate to what extent existing CPMs adhere to recent formal sample size criteria, or historic rules of thumb of events per predictor parameter (EPP)≥10.

STUDY DESIGN AND SETTING: A systematic review to identify CPMs related to prostate cancer, which provided enough information to calculate minimum sample size. We compared the reported sample size of each CPM against the traditional 10 EPP rule of thumb and formal sample size criteria.

RESULTS: About 211 CPMs were included. Three of the studies justified the sample size used, mostly using EPP rules of thumb. Overall, 69% of the CPMs were derived on sample sizes that surpassed the traditional EPP≥10 rule of thumb, but only 48% surpassed recent formal sample size criteria. For most CPMs, the required sample size based on formal criteria was higher than the sample sizes to surpass 10 EPP.

CONCLUSION: Few of the CPMs included in this study justified their sample size, with most justifications being based on EPP. This study shows that, in real-world data sets, adhering to the classic EPP rules of thumb is insufficient to adhere to recent formal sample size criteria.

Original languageEnglish
Pages (from-to)53-60
Number of pages8
JournalJournal of Clinical Epidemiology
Volume133
Early online date28 Dec 2020
DOIs
Publication statusPublished - 1 May 2021

Fingerprint

Dive into the research topics of 'Sample sizes of prediction model studies in prostate cancer were rarely justified and often insufficient'. Together they form a unique fingerprint.

Cite this