The figure illustrates the overall architecture of the proposed model. The proposed method enhances feature extraction by incorporating an encoder block by enhancing it, ...
Abstract:
Accurate segmentation of vertebrae in computed tomography (CT) images possess serious challenges due to the irregular vertebral boundaries, low contrast and brightness, a...Show MoreMetadata
Abstract:
Accurate segmentation of vertebrae in computed tomography (CT) images possess serious challenges due to the irregular vertebral boundaries, low contrast and brightness, and noise in CT scans. This study presents a novel channel attention-based EfficientNetB7-UNet (CAEB7-UNet) method to address this complex task effectively. The proposed model introduces an upgraded ReLU-based channel attention module (CAM) in the skip connection which restrains the nonessential attributes by suppressing them and accentuates the relevant features by emphasizing them to boost the overall segmentation performance. In this work, an improved EfficientNetB7 is employed as the encoder for feature extraction, the fusion of local and global features is enhanced through the upgraded CAM in skip connection, and the up-sampling is performed in the decoder. Further, the model is optimized by incorporating hyperparameter optimization, specifically, hybrid learning rate scheduler strategies, along with the AdamW optimizer and custom data augmentation. A total of 34,782 CT images obtained from the RSNA-2022 cervical spine fracture detection challenge is utilized in this study. The proposed model achieves outstanding performance, yielding a dice score index (DSI) of 96.14% and mean intersection over union (mIoU) of 91.46%. Moreover, a comparative performance analysis of CAEB7-UNet with two state-of-the-art models is carried out on the same dataset. Our approach outperforms both the models, with the best one by 8.1%, 6.73%, 12.7%, and 11.98% in terms of DSI, mIoU, precision, and F1-score respectively. Additionally, it requires merely 0.38 seconds to generate the segmentation mask of a single slice of a CT scan.
The figure illustrates the overall architecture of the proposed model. The proposed method enhances feature extraction by incorporating an encoder block by enhancing it, ...
Published in: IEEE Access ( Volume: 13)