MRI-based automatic segmentation of rectal cancer using 2D U-Net on two independent cohorts
Knuth, Franziska Hanna; Adde, Ingvild Askim; Huynh, Bao Ngoc; Grøndahl, Aurora Rosvoll; Winter, René; Negård, Anne; Holmedal, Stein Harald; Meltzer, Sebastian; Ree, Anne Hansen; Flatmark, Kjersti; Dueland, Svein; Hole, Knut Håkon; Seierstad, Therese; Redalen, Kathrine; Futsæther, Cecilia Marie
Peer reviewed, Journal article
Accepted version
View/ Open
Date
2021Metadata
Show full item recordCollections
- Institutt for fysikk [2770]
- Publikasjoner fra CRIStin - NTNU [39093]
Abstract
Background
Tumor delineation is time- and labor-intensive and prone to inter- and intraobserver variations. Magnetic resonance imaging (MRI) provides good soft tissue contrast, and functional MRI captures tissue properties that may be valuable for tumor delineation. We explored MRI-based automatic segmentation of rectal cancer using a deep learning (DL) approach. We first investigated potential improvements when including both anatomical T2-weighted (T2w) MRI and diffusion-weighted MR images (DWI). Secondly, we investigated generalizability by including a second, independent cohort.
Material and methods
Two cohorts of rectal cancer patients (C1 and C2) from different hospitals with 109 and 83 patients, respectively, were subject to 1.5 T MRI at baseline. T2w images were acquired for both cohorts and DWI (b-value of 500 s/mm2) for patients in C1. Tumors were manually delineated by three radiologists (two in C1, one in C2). A 2D U-Net was trained on T2w and T2w + DWI. Optimal parameters for image pre-processing and training were identified on C1 using five-fold cross-validation and patient Dice similarity coefficient (DSCp) as performance measure. The optimized models were evaluated on a C1 hold-out test set and the generalizability was investigated using C2.
Results
For cohort C1, the T2w model resulted in a median DSCp of 0.77 on the test set. Inclusion of DWI did not further improve the performance (DSCp 0.76). The T2w-based model trained on C1 and applied to C2 achieved a DSCp of 0.59.
Conclusion
T2w MR-based DL models demonstrated high performance for automatic tumor segmentation, at the same level as published data on interobserver variation. DWI did not improve results further. Using DL models on unseen cohorts requires caution, and one cannot expect the same performance.