A method to combine target volume data from 3D and 4D planned thoracic radiotherapy patient cohorts for machine learning applications

Corinne Johnson, Gareth Price, Jonathan Khalifa, Corinne Faivre-Finn, Andre Dekker, Christopher Moore, Marcel Van Herk

Research output: Contribution to journalArticlepeer-review

322 Downloads (Pure)

Abstract

Background and Purpose: The gross tumour volume (GTV) is predictive of clinical outcome and consequently features in many machine-learned models. 4D-planning, however, has prompted substitution of the GTV with the internal gross target volume (iGTV). We present and validate a method to synthesise GTV data from the iGTV, allowing the combination of 3D and 4D planned patient cohorts for modelling.

Material and Methods: Expert delineations in 40 non-small cell lung cancer patients were used to develop linear fit and erosion methods to synthesise the GTV volume and shape. Quality was assessed using Dice Similarity Coefficients (DSC) and closest point measurements; by calculating dosimetric features; and by assessing the quality of random forest models built on patient populations with and without synthetic GTVs.

Results: Volume estimates were within the magnitudes of inter-observer delineation variability. Shape comparisons produced mean DSCs of 0.8817 and 0.8584 for upper and lower lobe cases, respectively. A model trained on combined true and synthetic data performed significantly better than models trained on GTV alone, or combined GTV and iGTV data.

Conclusions: Accurate synthesis of GTV size from the iGTV permits the combination of lung cancer patient cohorts, facilitating machine learning applications in thoracic radiotherapy.
Original languageEnglish
JournalRadiotherapy and Oncology
Early online date6 Dec 2017
DOIs
Publication statusPublished - Feb 2018

Research Beacons, Institutes and Platforms

  • Manchester Cancer Research Centre

Fingerprint

Dive into the research topics of 'A method to combine target volume data from 3D and 4D planned thoracic radiotherapy patient cohorts for machine learning applications'. Together they form a unique fingerprint.

Cite this