Abstract:
Due to constraints from imaging devices, the most effective method for estimating the pose of space target RGB images is to establish a 2-D–3-D correspondence and then us...Show MoreMetadata
Abstract:
Due to constraints from imaging devices, the most effective method for estimating the pose of space target RGB images is to establish a 2-D–3-D correspondence and then use the perspective-n-point (PnP) algorithm for pose recovery. However, traditional PnP algorithms are not differentiable, which hinders their integration with neural network training. Although recent work attempts to make PnP partially differentiable during the 2-D–3-D matching stage by mathematical methods, this leads to increased inevitably computational costs. To this end, we propose a scale-consistent learnable PnP (SCLP) network that facilitates end-to-end pose estimation for space targets. Our method incorporates a sparse keypoint learnable PnP (SKL-PnP) layer within a multiscale network, enabling PnP to function as a differentiable layer that integrates seamlessly with preceding neural components. Additionally, we also sample the 2-D–3-D correspondences to obtain sparse keypoint pairs, achieving a lightweight single-stage 6-D pose estimation algorithm. To manage the significant scale variations in space target images, we introduce Gaussian perception sampling (GPS) by assigning instances to different pyramid levels based on size. Furthermore, we propose a scale consistency regularization (SCR) module that aligns downsized feature maps with original ones to better address scale differences. Experimental results demonstrate that our approach achieves superior accuracy and efficiency on the SPEED and SwissCube datasets, showing significant improvements over state-of-the-art methods.
Published in: IEEE Transactions on Geoscience and Remote Sensing ( Volume: 63)
Funding Agency:

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Xi Yang (Senior Member, IEEE) received the B.Eng. degree in electronic information engineering and the Ph.D. degree in pattern recognition and intelligence system from Xidian University, Xi’an, China, in 2010 and 2015, respectively.
From 2013 to 2014, she was a Visiting Ph.D. Student with the Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX, USA. In 2015, she joined the State Key Labo...Show More
Xi Yang (Senior Member, IEEE) received the B.Eng. degree in electronic information engineering and the Ph.D. degree in pattern recognition and intelligence system from Xidian University, Xi’an, China, in 2010 and 2015, respectively.
From 2013 to 2014, she was a Visiting Ph.D. Student with the Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX, USA. In 2015, she joined the State Key Labo...View more

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Jingyuan Wang received the B.Eng. degree in telecommunications engineering from Xidian University, Xi’an, China, in 2023, where he is currently pursuing the M.S. degree in information and communication engineering.
His current research interests include deep learning and computer vision.
Jingyuan Wang received the B.Eng. degree in telecommunications engineering from Xidian University, Xi’an, China, in 2023, where he is currently pursuing the M.S. degree in information and communication engineering.
His current research interests include deep learning and computer vision.View more

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Songsong Duan received the M.S. degree from Anhui University of Science and Technology, Huainan, China, in 2023. He is currently pursuing the Ph.D. degree with the School of Telecommunications Engineering, Xidian University, Xi’an, China.
His research interests include computer vision, weakly supervised learning, and open-vocabulary learning.
Songsong Duan received the M.S. degree from Anhui University of Science and Technology, Huainan, China, in 2023. He is currently pursuing the Ph.D. degree with the School of Telecommunications Engineering, Xidian University, Xi’an, China.
His research interests include computer vision, weakly supervised learning, and open-vocabulary learning.View more

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Xi Yang (Senior Member, IEEE) received the B.Eng. degree in electronic information engineering and the Ph.D. degree in pattern recognition and intelligence system from Xidian University, Xi’an, China, in 2010 and 2015, respectively.
From 2013 to 2014, she was a Visiting Ph.D. Student with the Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX, USA. In 2015, she joined the State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, where she is currently a Professor in communications and information systems. She has published over 60 articles in refereed journals and proceedings, including IEEE T-TIP, T-NNLS, T-CYB, T-GRS, CVPR, ICCV, and ACM MM. Her current research interests include image/video processing, computer vision, and machine learning.
Xi Yang (Senior Member, IEEE) received the B.Eng. degree in electronic information engineering and the Ph.D. degree in pattern recognition and intelligence system from Xidian University, Xi’an, China, in 2010 and 2015, respectively.
From 2013 to 2014, she was a Visiting Ph.D. Student with the Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX, USA. In 2015, she joined the State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, where she is currently a Professor in communications and information systems. She has published over 60 articles in refereed journals and proceedings, including IEEE T-TIP, T-NNLS, T-CYB, T-GRS, CVPR, ICCV, and ACM MM. Her current research interests include image/video processing, computer vision, and machine learning.View more

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Jingyuan Wang received the B.Eng. degree in telecommunications engineering from Xidian University, Xi’an, China, in 2023, where he is currently pursuing the M.S. degree in information and communication engineering.
His current research interests include deep learning and computer vision.
Jingyuan Wang received the B.Eng. degree in telecommunications engineering from Xidian University, Xi’an, China, in 2023, where he is currently pursuing the M.S. degree in information and communication engineering.
His current research interests include deep learning and computer vision.View more

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Songsong Duan received the M.S. degree from Anhui University of Science and Technology, Huainan, China, in 2023. He is currently pursuing the Ph.D. degree with the School of Telecommunications Engineering, Xidian University, Xi’an, China.
His research interests include computer vision, weakly supervised learning, and open-vocabulary learning.
Songsong Duan received the M.S. degree from Anhui University of Science and Technology, Huainan, China, in 2023. He is currently pursuing the Ph.D. degree with the School of Telecommunications Engineering, Xidian University, Xi’an, China.
His research interests include computer vision, weakly supervised learning, and open-vocabulary learning.View more