Research PaperNo Access

Mixture 2D Convolutions for 3D Medical Image Segmentation

Jianyong Wang

Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, Sichuan, P. R. China

E-mail Address: wjy@scu.edu.cn

Search for more papers by this author

Lei Zhang

Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, Sichuan, P. R. China

E-mail Address: leizhang@scu.edu.cn

Search for more papers by this author

, and

Yi Zhang

Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, Sichuan, P. R. China

E-mail Address: zhangyi@scu.edu.cn

Corresponding author.

Search for more papers by this author

https://doi.org/10.1142/S0129065722500599Cited by:21 (Source: Crossref)

Abstract

Three-dimensional (3D) medical image segmentation plays a crucial role in medical care applications. Although various two-dimensional (2D) and 3D neural network models have been applied to 3D medical image segmentation and achieved impressive results, a trade-off remains between efficiency and accuracy. To address this issue, a novel mixture convolutional network (MixConvNet) is proposed, in which traditional 2D/3D convolutional blocks are replaced with novel MixConv blocks. In the MixConv block, 3D convolution is decomposed into a mixture of 2D convolutions from different views. Therefore, the MixConv block fully utilizes the advantages of 2D convolution and maintains the learning ability of 3D convolution. It acts as 3D convolutions and thus can process volumetric input directly and learn intra-slice features, which are absent in the traditional 2D convolutional block. By contrast, the proposed MixConv block only contains 2D convolutions; hence, it has significantly fewer trainable parameters and less computation budget than a block containing 3D convolutions. Furthermore, the proposed MixConvNet is pre-trained with small input patches and fine-tuned with large input patches to improve segmentation performance further. In experiments on the Decathlon Heart dataset and Sliver07 dataset, the proposed MixConvNet outperformed the state-of-the-art methods such as UNet3D, VNet, and nnUnet.

Keywords:

References

1. F. Isensee, P. F. Jaeger, S. A. A. Kohl, J. Petersen and K. H. Maier-Hein , nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation, Nat. Methods 18(2) (2021) 203–211. Crossref, Medline, Web of Science, Google Scholar
2. Q. Dou, L. Yu, H. Chen, Y. Jin, X. Yang, J. Qin and P.-A. Heng , 3D deeply supervised network for automated segmentation of volumetric medical images, Med. Image Anal. 41 (2017) 40–54. Crossref, Medline, Web of Science, Google Scholar
3. Y. Hua, X. Shu, Z. Wang and L. Zhang , Uncertainty-guided voxel-level supervised contrastive learning for semi-supervised medical image segmentation, Int. J. Neural Syst. 32(04) (2022) 2250016. Link, Web of Science, Google Scholar
4. O. Ronneberger, P. Fischer and T. Brox , U-Net: Convolutional networks for biomedical image segmentation, in Medical Image Computing and Computer-Assisted Intervention — MICCAI 2015 (Springer International Publishing, Cham, 2015), pp. 234–241. Crossref, Google Scholar
5. F. Milletari, N. Navab and S.-A. Ahmadi , V-Net: Fully convolutional neural networks for volumetric medical image segmentation, in 2016 4th Int. Conf. on 3D Vision (3DV) (IEEE, 2016), pp. 565–571. Crossref, Google Scholar
6. Q. Yu, Y. Xia, L. Xie, E. K. Fishman and A. L. Yuille, Thickened 2D networks for efficient 3D medical image segmentation, preprint (2019), arXiv:1904.01150. Google Scholar
7. L. Li, S. Lian, Z. Luo, S. Li, B. Wang and S. Li , Learning consistency- and discrepancy-context for 2D organ segmentation, in Medical Image Computing and Computer Assisted Intervention — MICCAI 2021 (Springer International Publishing, 2021), pp. 261–270. Crossref, Google Scholar
8. A. Comelli, N. Dahiya, A. Stefano, F. Vernuccio, M. Portoghese, G. Cutaia, A. Bruno, G. Salvaggio and A. Yezzi , Deep learning-based methods for prostate segmentation in magnetic resonance imaging, Appl. Sci. 11(2) (2021) 782. Crossref, Google Scholar
9. O. Oktay et al., Attention U-Net: Learning where to look for the pancreas, preprint (2018), arXiv:1804.03999. Google Scholar
10. K. Men, J. Dai and Y. Li , Automatic segmentation of the clinical target volume and organs at risk in the planning CT for rectal cancer using deep dilated convolutional neural networks, Med. Phys. 44(12) (2017) 6377–6389. Crossref, Medline, Web of Science, Google Scholar
11. T. He, J. Hu, Y. Song, J. Guo and Z. Yi , Multi-task learning for the segmentation of organs at risk with label dependence, Med. Image Anal. 61 (2020) 101666. Crossref, Medline, Web of Science, Google Scholar
12. J. Hu, Y. Song, L. Zhang, S. Bai and Z. Yi , Multi-scale attention U-net for segmenting clinical target volume in graves’ ophthalmopathy, Neurocomputing 427 (2021) 74–83. Crossref, Web of Science, Google Scholar
13. Ö. Çiçek, A. Abdulkadir, S. S. Lienkamp, T. Brox and O. Ronneberger , 3D U-Net: Learning dense volumetric segmentation from sparse annotation, in Medical Image Computing and Computer-Assisted Intervention — MICCAI 2016 (Springer International Publishing, Cham, 2016), pp. 424–432. Google Scholar
14. H. Chen, Q. Dou, L. Yu, J. Qin and P.-A. Heng , VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images, NeuroImage 170 (2018) 446–455. Crossref, Medline, Web of Science, Google Scholar
15. H. R. Roth, L. Lu, A. Farag, H.-C. Shin, J. Liu, E. B. Turkbey and R. M. Summers , DeepOrgan: Multi-level deep convolutional networks for automated pancreas segmentation, in Medical Image Computing and Computer-Assisted Intervention — MICCAI 2015, Lecture Notes in Computer Science, Vol. 9349 (Springer International Publishing, 2015), pp. 556–564. Crossref, Google Scholar
16. J. Chen, L. Yang, Y. Zhang, M. Alber and D. Z. Chen , Combining fully convolutional and recurrent neural networks for 3D biomedical image segmentation, in Advances in Neural Information Processing Systems, Vol. 29 (Curran Associates, Inc., 2016), pp. 3044–3052. Google Scholar
17. Y. Xia, L. Xie, F. Liu, Z. Zhu, E. K. Fishman and A. L. Yuille , Bridging the gap between 2D and 3D organ segmentation with volumetric fusion net, in Medical Image Computing and Computer Assisted Intervention — MICCAI 2018 (Springer International Publishing, 2018), pp. 445–453. Crossref, Google Scholar
18. X. Li, H. Chen, X. Qi, Q. Dou, C.-W. Fu and P.-A. Heng , H-DenseUNet: Hybrid densely connected UNet for liver and tumor segmentation from CT volumes, IEEE Trans. Med. Imaging 37(12) (2018) 2663–2674. Crossref, Medline, Web of Science, Google Scholar
19. A. L. Simpson et al., A large annotated medical image dataset for the development and evaluation of segmentation algorithms, preprint (2019), arXiv:1902.09063. Google Scholar
20. T. Heimann et al., Comparison and evaluation of methods for liver segmentation from CT datasets, IEEE Trans. Med. Imaging 28(8) (2009) 1251–1265. Crossref, Medline, Web of Science, Google Scholar
21. Y. LeCun, Y. Bengio and G. Hinton , Deep learning, Nature 521(7553) (2015) 436. Crossref, Medline, Web of Science, Google Scholar
22. A. Hassanpour, M. Moradikia, H. Adeli, S. R. Khayami and P. Shamsinejadbabaki , A novel end-to-end deep learning scheme for classifying multi-class motor imagery electroencephalography signals, Expert Syst. 36(6) (2019) e12494. Crossref, Web of Science, Google Scholar
23. G. B. Martins, J. P. Papa and H. Adeli , Deep learning techniques for recommender systems based on collaborative filtering, Expert Syst. 37(6) (2020) e12647. Crossref, Web of Science, Google Scholar
24. J. Wang, R. Ju, Y. Chen, L. Zhang, J. Hu, Y. Wu, W. Dong, J. Zhong and Z. Yi , Automated retinopathy of prematurity screening using deep neural networks, EBioMedicine 35 (2018) 361–368. Crossref, Medline, Web of Science, Google Scholar
25. L.-C. Lin, C.-S. Ouyang, R.-C. Wu, R.-C. Yang and C.-T. Chiang , Alternative diagnosis of epilepsy in children without epileptiform discharges using deep convolutional neural networks, Int. J. Neural Syst. 30(05) (2020) 1850060. Link, Web of Science, Google Scholar
26. H. S. Nogay and H. Adeli , Machine learning (ML) for the diagnosis of autism spectrum disorder (ASD) using brain imaging, Rev. Neurosci. 31(8) (2020) 825–841. Crossref, Web of Science, Google Scholar
27. H. S. Nogay and H. Adeli , Detection of epileptic seizure using pretrained deep convolutional neural network and transfer learning, Eur. Neurol. 83(6) (2020) 602–614. Crossref, Medline, Web of Science, Google Scholar
28. G. Liu, W. Zhou and M. Geng , Automatic seizure detection based on S-transform and deep convolutional neural network, Int. J. Neural Syst. 30(04) (2020) 1950024. Link, Web of Science, Google Scholar
29. A. Lozano, J. S. Suárez, C. Soto-Sánchez, J. Garrigós, J. J. Martínez-Alvarez, J. M. Ferrández and E. Fernández , Neurolight: A deep learning neural interface for cortical visual prostheses, Int. J. Neural Syst. 30(09) (2020) 2050045. Link, Web of Science, Google Scholar
30. G. Mirzaei and H. Adeli , Segmentation and clustering in brain MRI imaging, Rev. Neurosci. 30(1) (2019) 31–44. Crossref, Web of Science, Google Scholar
31. C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens and Z. Wojna , Rethinking the inception architecture for computer vision, in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2016, Las Vegas, Nevada, USA, pp. 2818–2826. Crossref, Google Scholar
32. K. He, X. Zhang, S. Ren and J. Sun , Deep Residual Learning for Image Recognition, in IEEE Conf. Computer Vision and Pattern Recognition, 2016, Las Vegas, Nevada, USA, pp. 770–778. Crossref, Google Scholar
33. X. Zhang, X. Zhou, M. Lin and J. Sun , ShuffleNet: An extremely efficient convolutional neural network for mobile devices, in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2018, Salt Lake City, UT, USA, pp. 6848–6856. Crossref, Google Scholar
34. M. Sandler, A. Howard, M. Zhu, A. Zhmoginov and L.-C. Chen , MobileNetV2: Inverted residuals and linear bottlenecks, in 2018 IEEE/CVF Conf. Computer Vision and Pattern Recognition (IEEE, 2018), pp. 4510–4520. Crossref, Google Scholar
35. M.-H. Guo, C.-Z. Lu, Z.-N. Liu, M.-M. Cheng and S.-M. Hu, Visual attention network, preprint (2022), arXiv:2202.09741. Google Scholar
36. J. Wang, J. Hu, Y. Song, Q. Wang, X. Zhang, S. Bai and Z. Yi , VMAT dose prediction in radiotherapy by using progressive refinement UNet, Neurocomputing 488 (2021) S0925231221017380. Web of Science, Google Scholar
37. K. Simonyan and A. Zisserman , Very deep convolutional networks for large-scale image recognition, in 3rd Int. Conf. Learning Representations, 2015, San Diego, pp. 1–14. Google Scholar
38. W. R. Crum, O. Camara and D. L. G. Hill , Generalized overlap measures for evaluation and validation in medical image analysis, IEEE Trans. Med. Imaging 25 (2006) 1451–1461. Crossref, Medline, Web of Science, Google Scholar
39. L. Lin, Q. Dou, Y.-M. Jin, G.-Q. Zhou, Y.-Q. Tang, W.-L. Chen, B.-A. Su, F. Liu, C.-J. Tao, N. Jiang, J.-Y. Li, L.-L. Tang, C.-M. Xie, S.-M. Huang, J. Ma, P.-A. Heng, J. T. S. Wee, M. L. K. Chua, H. Chen and Y. Sun , Deep learning for automated contouring of primary tumor volumes by MRI for nasopharyngeal carcinoma, Radiology 291(3) (2019) 677–686. Crossref, Medline, Web of Science, Google Scholar