Loading [MathJax]/extensions/MathMenu.js
Accurate Multi-Landmark Localization in 3D Ultra-High Resolution CT Images of the Ears via Deep Reinforcement Learning and Transformer | IEEE Journals & Magazine | IEEE Xplore

Accurate Multi-Landmark Localization in 3D Ultra-High Resolution CT Images of the Ears via Deep Reinforcement Learning and Transformer


Abstract:

Automated landmark localization can help radiologists quickly determine the locations of key structures or lesion areas from medical images. However, when facing large-vo...Show More

Abstract:

Automated landmark localization can help radiologists quickly determine the locations of key structures or lesion areas from medical images. However, when facing large-volume 3D medical images, existing methods have very high computational complexity due to the need to encode the global image. That is to say, it is difficult for existing methods to achieve accurate landmark localization in 3D medical images at a faster localization speed. In this paper, an accurate multi-landmark localization method for ear 3D Ultra-High Resolution CT (U-HRCT) images is proposed. This method adopts a novel localization pipeline that combines Deep Reinforcement Learning (DRL) and Transformer. Firstly, the DRL algorithm is used to quickly collect landmark-related local features. Secondly, Transformer is used to extract the spatial position relationship between anatomical structures from these discrete local features to infer the coordinate position of the landmark. Because the complex process of encoding the global image is avoided, the proposed method can achieve fast localization of ear multi-landmark in 3D U-HRCT images. Finally, we proposed a refinement module based on dual-branch hybrid Multi-Layer Perceptron, which can use the fast localization results of multi-landmark to learn the spatial position relationship between landmarks, thereby further improving the accuracy and stability of landmark localization. Experimental results on the self-built ear 3D U-HRCT dataset and the publicly available 2D cephalometric dataset demonstrate that, the proposed method can achieve Successful Detection Rate of 96.71% and 89.97% respectively within the precision range of 2.0 mm, surpassing the state-of-the-art multi-landmark localization methods.
Page(s): 1 - 14
Date of Publication: 11 April 2025

ISSN Information:

PubMed ID: 40215147

Funding Agency:


Contact IEEE to Subscribe