Unsupervised Global and Local Homography Estimation With Coplanarity-Aware GAN | IEEE Journals & Magazine | IEEE Xplore

Unsupervised Global and Local Homography Estimation With Coplanarity-Aware GAN


Abstract:

Unsupervised methods have received increasing attention in homography learning due to their promising performance and label-free training. However, existing methods do no...Show More

Abstract:

Unsupervised methods have received increasing attention in homography learning due to their promising performance and label-free training. However, existing methods do not explicitly consider the plane-induced parallax, making the prediction compromised on multiple planes. In this work, we propose a novel method HomoGAN to guide unsupervised homography estimation to focus on the dominant plane. First, a multi-scale transformer is designed to predict homography from the feature pyramids of input images in a coarse-to-fine fashion. Moreover, we propose an unsupervised GAN to impose coplanarity constraint on the predicted homography, which is realized by using a generator to predict a mask of aligned regions, and then a discriminator to check if two masked feature maps are induced by a single homography. Based on the global homography framework, we extend it to the local mesh-grid homography estimation, namely, MeshHomoGAN, where plane constraints can be enforced on each mesh cell to go beyond a single dominant plane, such that scenes with multiple depth planes can be better aligned. To validate the effectiveness of our method and its components, we conduct extensive experiments on large-scale datasets. Results show that our matching error is 22% lower than previous SOTA methods.
Page(s): 1863 - 1876
Date of Publication: 02 December 2024

ISSN Information:

PubMed ID: 40030560

Funding Agency:

Author image of Shuaicheng Liu
School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China
Shuaicheng Liu (Senior Member, IEEE) received the BE degree from Sichuan University, Chengdu, China, in 2008, and the MSc and PhD degrees from the National University of Singapore, Singapore, in 2010 and 2014, respectively. In 2015, he joined the University of Electronic Science and Technology of China (UESTC) and is currently a professor with the Institute of Image Processing, School of Information and Communication Engi...Show More
Shuaicheng Liu (Senior Member, IEEE) received the BE degree from Sichuan University, Chengdu, China, in 2008, and the MSc and PhD degrees from the National University of Singapore, Singapore, in 2010 and 2014, respectively. In 2015, he joined the University of Electronic Science and Technology of China (UESTC) and is currently a professor with the Institute of Image Processing, School of Information and Communication Engi...View more
Author image of Mingbo Hong
Megvii Research Chengdu, Chengdu, China
Mingbo Hong received the BEng degree from Sichuan Agricultural University, China, in 2019, and the MSc degree from Sichuan University, China, in 2022. He is currently a researcher with Megvii Technology, Chengdu. His research interests include computer vision and deep learning.
Mingbo Hong received the BEng degree from Sichuan Agricultural University, China, in 2019, and the MSc degree from Sichuan University, China, in 2022. He is currently a researcher with Megvii Technology, Chengdu. His research interests include computer vision and deep learning.View more
Author image of Yuhang Lu
XPeng Motors US, San Diego, CA, USA
Yuhang Lu received the PhD degree in computer science and engineering from the University of South Carolina, in 2022. Prior to that, he interned with Megvii Technology, in 2021. He is currently a senior software engineer with XPeng Motors. His research interests mainly include computer vision, deep learning and autonomous driving.
Yuhang Lu received the PhD degree in computer science and engineering from the University of South Carolina, in 2022. Prior to that, he interned with Megvii Technology, in 2021. He is currently a senior software engineer with XPeng Motors. His research interests mainly include computer vision, deep learning and autonomous driving.View more
Author image of Nianjin Ye
GreatWall Motor, Chengdu, China
Nianjin Ye received the BEng degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2017, and the MS degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2020. He is a researcher with GreatWall Motor, Chengdu. His research interests include computer vision and deep learning.
Nianjin Ye received the BEng degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2017, and the MS degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2020. He is a researcher with GreatWall Motor, Chengdu. His research interests include computer vision and deep learning.View more
Author image of Chunyu Lin
Institute of Information Science, Beijing Jiaotong University, Beijing, China
Chunyu Lin (Member, IEEE) received the doctor degree from Beijing Jiaotong University (BJTU), Beijing, China, in 2011. He is a professor with Beijing Jiaotong University. From 2009 to 2010, he was a visiting researcher with ICT Group, Delft University of Technology, Netherlands. From 2011 to 2012, he was a post-doctoral researcher with Multimedia Laboratory, Gent University, Belgium. His research interests include multi-v...Show More
Chunyu Lin (Member, IEEE) received the doctor degree from Beijing Jiaotong University (BJTU), Beijing, China, in 2011. He is a professor with Beijing Jiaotong University. From 2009 to 2010, he was a visiting researcher with ICT Group, Delft University of Technology, Netherlands. From 2011 to 2012, he was a post-doctoral researcher with Multimedia Laboratory, Gent University, Belgium. His research interests include multi-v...View more
Author image of Bing Zeng
School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China
Bing Zeng (Fellow, IEEE) received the BEng and MEng degrees in electronic engineering from the University of Electronic Science and Technology of China (UESTC), Chengdu, China, in 1983 and 1986, respectively, and the PhD degree in electrical engineering from the Tampere University of Technology, Tampere, Finland, in 1991. He worked as a postdoctoral fellow with the University of Toronto from September 1991 to July 1992 an...Show More
Bing Zeng (Fellow, IEEE) received the BEng and MEng degrees in electronic engineering from the University of Electronic Science and Technology of China (UESTC), Chengdu, China, in 1983 and 1986, respectively, and the PhD degree in electrical engineering from the Tampere University of Technology, Tampere, Finland, in 1991. He worked as a postdoctoral fellow with the University of Toronto from September 1991 to July 1992 an...View more

Author image of Shuaicheng Liu
School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China
Shuaicheng Liu (Senior Member, IEEE) received the BE degree from Sichuan University, Chengdu, China, in 2008, and the MSc and PhD degrees from the National University of Singapore, Singapore, in 2010 and 2014, respectively. In 2015, he joined the University of Electronic Science and Technology of China (UESTC) and is currently a professor with the Institute of Image Processing, School of Information and Communication Engineering, Chengdu. He works on computer vision, computer graphics and computational imaging related problems, with applications in mobile photography and videography.
Shuaicheng Liu (Senior Member, IEEE) received the BE degree from Sichuan University, Chengdu, China, in 2008, and the MSc and PhD degrees from the National University of Singapore, Singapore, in 2010 and 2014, respectively. In 2015, he joined the University of Electronic Science and Technology of China (UESTC) and is currently a professor with the Institute of Image Processing, School of Information and Communication Engineering, Chengdu. He works on computer vision, computer graphics and computational imaging related problems, with applications in mobile photography and videography.View more
Author image of Mingbo Hong
Megvii Research Chengdu, Chengdu, China
Mingbo Hong received the BEng degree from Sichuan Agricultural University, China, in 2019, and the MSc degree from Sichuan University, China, in 2022. He is currently a researcher with Megvii Technology, Chengdu. His research interests include computer vision and deep learning.
Mingbo Hong received the BEng degree from Sichuan Agricultural University, China, in 2019, and the MSc degree from Sichuan University, China, in 2022. He is currently a researcher with Megvii Technology, Chengdu. His research interests include computer vision and deep learning.View more
Author image of Yuhang Lu
XPeng Motors US, San Diego, CA, USA
Yuhang Lu received the PhD degree in computer science and engineering from the University of South Carolina, in 2022. Prior to that, he interned with Megvii Technology, in 2021. He is currently a senior software engineer with XPeng Motors. His research interests mainly include computer vision, deep learning and autonomous driving.
Yuhang Lu received the PhD degree in computer science and engineering from the University of South Carolina, in 2022. Prior to that, he interned with Megvii Technology, in 2021. He is currently a senior software engineer with XPeng Motors. His research interests mainly include computer vision, deep learning and autonomous driving.View more
Author image of Nianjin Ye
GreatWall Motor, Chengdu, China
Nianjin Ye received the BEng degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2017, and the MS degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2020. He is a researcher with GreatWall Motor, Chengdu. His research interests include computer vision and deep learning.
Nianjin Ye received the BEng degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2017, and the MS degree from the University of Electronic Science and Technology of China, Chengdu, China, in 2020. He is a researcher with GreatWall Motor, Chengdu. His research interests include computer vision and deep learning.View more
Author image of Chunyu Lin
Institute of Information Science, Beijing Jiaotong University, Beijing, China
Chunyu Lin (Member, IEEE) received the doctor degree from Beijing Jiaotong University (BJTU), Beijing, China, in 2011. He is a professor with Beijing Jiaotong University. From 2009 to 2010, he was a visiting researcher with ICT Group, Delft University of Technology, Netherlands. From 2011 to 2012, he was a post-doctoral researcher with Multimedia Laboratory, Gent University, Belgium. His research interests include multi-view geometry, camera calibration, and virtual reality video processing.
Chunyu Lin (Member, IEEE) received the doctor degree from Beijing Jiaotong University (BJTU), Beijing, China, in 2011. He is a professor with Beijing Jiaotong University. From 2009 to 2010, he was a visiting researcher with ICT Group, Delft University of Technology, Netherlands. From 2011 to 2012, he was a post-doctoral researcher with Multimedia Laboratory, Gent University, Belgium. His research interests include multi-view geometry, camera calibration, and virtual reality video processing.View more
Author image of Bing Zeng
School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China
Bing Zeng (Fellow, IEEE) received the BEng and MEng degrees in electronic engineering from the University of Electronic Science and Technology of China (UESTC), Chengdu, China, in 1983 and 1986, respectively, and the PhD degree in electrical engineering from the Tampere University of Technology, Tampere, Finland, in 1991. He worked as a postdoctoral fellow with the University of Toronto from September 1991 to July 1992 and as a researcher with Concordia University from August 1992 to January 1993. He then joined the Hong Kong University of Science and Technology (HKUST). After 20 years of service with HKUST, he returned to UESTC in the summer of 2013. At UESTC, he leads the Institute of Image Processing to work on image and video processing, multimedia communication, computer vision, and AI technology.
Bing Zeng (Fellow, IEEE) received the BEng and MEng degrees in electronic engineering from the University of Electronic Science and Technology of China (UESTC), Chengdu, China, in 1983 and 1986, respectively, and the PhD degree in electrical engineering from the Tampere University of Technology, Tampere, Finland, in 1991. He worked as a postdoctoral fellow with the University of Toronto from September 1991 to July 1992 and as a researcher with Concordia University from August 1992 to January 1993. He then joined the Hong Kong University of Science and Technology (HKUST). After 20 years of service with HKUST, he returned to UESTC in the summer of 2013. At UESTC, he leads the Institute of Image Processing to work on image and video processing, multimedia communication, computer vision, and AI technology.View more

Contact IEEE to Subscribe

References

References is not available for this document.