TY - JOUR
T1 - WORD
T2 - A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image
AU - Luo, Xiangde
AU - Liao, Wenjun
AU - Xiao, Jianghong
AU - Chen, Jieneng
AU - Song, Tao
AU - Zhang, Xiaofan
AU - Li, Kang
AU - Metaxas, Dimitris N.
AU - Wang, Guotai
AU - Zhang, Shaoting
N1 - Funding Information:
This work was supported by the National Natural Science Foundation of China [ 81771921 , 61901084 ], the National Key Research and Development Program, China [ 2020YFB1711503 ] and also by key research and development project of Sichuan province, China [ 20ZDYF2817 ]. We would like to thank Mr. Zhiqiang Hu and Guofeng Lv from the SenseTime Research for constructive discussions and suggestions and also thank M.D. J. Xiao and W. Liao and their team members for data collection, annotation, checking and user study. We also would like to thank the Shanghai AI Lab and Shanghai SenseTime Research for their high-performance computation support.
Funding Information:
This work was supported by the National Natural Science Foundation of China [81771921, 61901084], the National Key Research and Development Program, China [2020YFB1711503] and also by key research and development project of Sichuan province, China [20ZDYF2817]. We would like to thank Mr. Zhiqiang Hu and Guofeng Lv from the SenseTime Research for constructive discussions and suggestions and also thank M.D. J. Xiao and W. Liao and their team members for data collection, annotation, checking and user study. We also would like to thank the Shanghai AI Lab and Shanghai SenseTime Research for their high-performance computation support.
Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/11
Y1 - 2022/11
N2 - Whole abdominal organ segmentation is important in diagnosing abdomen lesions, radiotherapy, and follow-up. However, oncologists’ delineating all abdominal organs from 3D volumes is time-consuming and very expensive. Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, and there is a lack of large-scale datasets covering the whole abdomen region with accurate and detailed annotations for the whole abdominal organ segmentation. In this work, we establish a new large-scale Whole abdominal ORgan Dataset (WORD) for algorithm research and clinical application development. This dataset contains 150 abdominal CT volumes (30495 slices). Each volume has 16 organs with fine pixel-level annotations and scribble-based sparse annotations, which may be the largest dataset with whole abdominal organ annotation. Several state-of-the-art segmentation methods are evaluated on this dataset. And we also invited three experienced oncologists to revise the model predictions to measure the gap between the deep learning method and oncologists. Afterwards, we investigate the inference-efficient learning on the WORD, as the high-resolution image requires large GPU memory and a long inference time in the test stage. We further evaluate the scribble-based annotation-efficient learning on this dataset, as the pixel-wise manual annotation is time-consuming and expensive. The work provided a new benchmark for the abdominal multi-organ segmentation task, and these experiments can serve as the baseline for future research and clinical application development.
AB - Whole abdominal organ segmentation is important in diagnosing abdomen lesions, radiotherapy, and follow-up. However, oncologists’ delineating all abdominal organs from 3D volumes is time-consuming and very expensive. Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, and there is a lack of large-scale datasets covering the whole abdomen region with accurate and detailed annotations for the whole abdominal organ segmentation. In this work, we establish a new large-scale Whole abdominal ORgan Dataset (WORD) for algorithm research and clinical application development. This dataset contains 150 abdominal CT volumes (30495 slices). Each volume has 16 organs with fine pixel-level annotations and scribble-based sparse annotations, which may be the largest dataset with whole abdominal organ annotation. Several state-of-the-art segmentation methods are evaluated on this dataset. And we also invited three experienced oncologists to revise the model predictions to measure the gap between the deep learning method and oncologists. Afterwards, we investigate the inference-efficient learning on the WORD, as the high-resolution image requires large GPU memory and a long inference time in the test stage. We further evaluate the scribble-based annotation-efficient learning on this dataset, as the pixel-wise manual annotation is time-consuming and expensive. The work provided a new benchmark for the abdominal multi-organ segmentation task, and these experiments can serve as the baseline for future research and clinical application development.
KW - Abdominal organ segmentation
KW - Benchmark
KW - Clinical applicable study
KW - Dataset
UR - http://www.scopus.com/inward/record.url?scp=85140416624&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140416624&partnerID=8YFLogxK
U2 - 10.1016/j.media.2022.102642
DO - 10.1016/j.media.2022.102642
M3 - Article
C2 - 36223682
AN - SCOPUS:85140416624
SN - 1361-8415
VL - 82
JO - Medical Image Analysis
JF - Medical Image Analysis
M1 - 102642
ER -