Loading web-font TeX/Main/Regular
PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection | IEEE Journals & Magazine | IEEE Xplore

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection


Abstract:

Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-...Show More

Abstract:

Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by the first stage are the main obstacle impeding HOI detectors from high efficiency and promising performance. To remedy this, we propose a novel top-down interaction-driven paradigm, detecting interactions first and bridging interactive human-object pairs through interactions. We formulate HOI as a point triplet < human point, interaction point, object point> and design a Parallel Point Detection and Matching (PPDM) framework. We further take advantage of two-stage methods and propose a novel framework, PPDM++, that detects the interactive human-object pairs by PPDM, then extracts region features for each pair to predict actions. The core of PPDM/PPDM++ is to convert the instance-driven bottom-up paradigm to an interaction-driven top-down paradigm, thus avoiding additional computation costs from traversing a tremendous number of non-interactive pairs. Benefiting from the advanced paradigm, PPDM/PPDM++ has achieved significant performance gains with high efficiency. PPDM-DLA-34 has achieved 19.94 mAP with 42 FPS as the first real-time HOI detector, and PPDM++-SwinB achieves 30.1 mAP with 17 FPS on HICO-DET dataset. We also built an application-oriented database named HOI-A, a supplement to the existing datasets.
Page(s): 6826 - 6841
Date of Publication: 10 April 2024

ISSN Information:

PubMed ID: 38598380

Funding Agency:

Author image of Yue Liao
Institute of Artificial Intelligence, Beihang University, Beijing, China
Yue Liao received the master's degree from the Institute of Information Engineering, Chinese Academy of Sciences. He is currently working toward the PhD degree with the School of Computer Science and Engineering, Beihang University. His research interests include human-object interaction detection, visual grounding, and object detection. He has published more than 10 papers at top journals and conferences, including IEEE ...Show More
Yue Liao received the master's degree from the Institute of Information Engineering, Chinese Academy of Sciences. He is currently working toward the PhD degree with the School of Computer Science and Engineering, Beihang University. His research interests include human-object interaction detection, visual grounding, and object detection. He has published more than 10 papers at top journals and conferences, including IEEE ...View more
Author image of Si Liu
Institute of Artificial Intelligence, Beihang University, Beijing, China
Si Liu received the PhD degree from the Institute of Automation, Chinese Academy of Sciences. She is currently a professor with Beihang University. She has been a research assistant and postdoc with the National University of Singapore. Her research interests include computer vision and multimedia analysis. She has published more than 70 cutting-edge papers on human-related analysis and vision-language understating. She w...Show More
Si Liu received the PhD degree from the Institute of Automation, Chinese Academy of Sciences. She is currently a professor with Beihang University. She has been a research assistant and postdoc with the National University of Singapore. Her research interests include computer vision and multimedia analysis. She has published more than 70 cutting-edge papers on human-related analysis and vision-language understating. She w...View more
Author image of Yulu Gao
Institute of Artificial Intelligence, Beihang University, Beijing, China
Yulu Gao received the bachelor's degree from Beihang University. He is currently working toward the PhD degree with the Department of Computer Science and Engineering, Beihang University. His research interests include object detection and visual tracking.
Yulu Gao received the bachelor's degree from Beihang University. He is currently working toward the PhD degree with the Department of Computer Science and Engineering, Beihang University. His research interests include object detection and visual tracking.View more
Author image of Aixi Zhang
Institute of Artificial Intelligence, Beihang University, Beijing, China
Aixi Zhang received the Mphil degree from the Hong Kong University of Science and Technology, Hong Kong, China, in 2014. He is now a senior researcher with Taobao (China) Software Co. Ltd., Alibaba Group. His research interests include computer vision, HOI detection, and multi-modal video understanding. He has published four papers at top conferences and journals including NIPS, CVPR, ACM MM and the IEEE Transactions on I...Show More
Aixi Zhang received the Mphil degree from the Hong Kong University of Science and Technology, Hong Kong, China, in 2014. He is now a senior researcher with Taobao (China) Software Co. Ltd., Alibaba Group. His research interests include computer vision, HOI detection, and multi-modal video understanding. He has published four papers at top conferences and journals including NIPS, CVPR, ACM MM and the IEEE Transactions on I...View more
Author image of Zhimin Li
School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, China
Zhimin Li received the BS degree from the Wuhan University of Technology, Wuhan, China, in 2019, and the MS degree from the Huazhong University of Science and Technology, Wuhan, in 2022. His research interests include video analysis and scene understanding. To date, he has published in these fields with some peer-reviewed technical papers at premium conferences including AAAI, CVPR, ACM, MM, etc. He currently serves as th...Show More
Zhimin Li received the BS degree from the Wuhan University of Technology, Wuhan, China, in 2019, and the MS degree from the Huazhong University of Science and Technology, Wuhan, in 2022. His research interests include video analysis and scene understanding. To date, he has published in these fields with some peer-reviewed technical papers at premium conferences including AAAI, CVPR, ACM, MM, etc. He currently serves as th...View more
Author image of Fei Wang
School of Information Science and Technology, University of Science and Technology of China, Hefei, China
Fei Wang received the bachelor's and master's degrees from the Beijing University of Posts and Telecommunications. He is currently working toward the PhD degree with the University of Science and Technology of China. He is the director of SenseTime Intelligent Automotive Group. He is the head of SenseAuto-Parking engineering and SenseAuto-Cabin research. He has published more than 30 papers at CVPR/NIPS/ICCV and gained mo...Show More
Fei Wang received the bachelor's and master's degrees from the Beijing University of Posts and Telecommunications. He is currently working toward the PhD degree with the University of Science and Technology of China. He is the director of SenseTime Intelligent Automotive Group. He is the head of SenseAuto-Parking engineering and SenseAuto-Cabin research. He has published more than 30 papers at CVPR/NIPS/ICCV and gained mo...View more
Author image of Bo Li
Institute of Artificial Intelligence, Beihang University, Beijing, China
Bo Li is currently a Changjiang distinguished professor with the School of Computer Science and Engineering, Beihang University. He is a recipient of the National Science Fund for Distinguished Young Scholars. He is currently the dean of AI Research Institute, Beihang University. He is the chief scientist of National 973 Program and the principal investigator of the National Key Research and Development Program. He has pu...Show More
Bo Li is currently a Changjiang distinguished professor with the School of Computer Science and Engineering, Beihang University. He is a recipient of the National Science Fund for Distinguished Young Scholars. He is currently the dean of AI Research Institute, Beihang University. He is the chief scientist of National 973 Program and the principal investigator of the National Key Research and Development Program. He has pu...View more

Author image of Yue Liao
Institute of Artificial Intelligence, Beihang University, Beijing, China
Yue Liao received the master's degree from the Institute of Information Engineering, Chinese Academy of Sciences. He is currently working toward the PhD degree with the School of Computer Science and Engineering, Beihang University. His research interests include human-object interaction detection, visual grounding, and object detection. He has published more than 10 papers at top journals and conferences, including IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Image Processing, NIPS, CVPR, and ECCV, etc. He was the Champion of CVPR 2021 ActivityNet Homage Challenge. He has been serving as a reviewer for numerous academic journals and conferences, such as IEEE Transactions on Circuits and Systems for Video Technology, IEEE Transactions on Multimedia, CVPR, ICCV, ECCV, AAAI, and ACM MM.
Yue Liao received the master's degree from the Institute of Information Engineering, Chinese Academy of Sciences. He is currently working toward the PhD degree with the School of Computer Science and Engineering, Beihang University. His research interests include human-object interaction detection, visual grounding, and object detection. He has published more than 10 papers at top journals and conferences, including IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Image Processing, NIPS, CVPR, and ECCV, etc. He was the Champion of CVPR 2021 ActivityNet Homage Challenge. He has been serving as a reviewer for numerous academic journals and conferences, such as IEEE Transactions on Circuits and Systems for Video Technology, IEEE Transactions on Multimedia, CVPR, ICCV, ECCV, AAAI, and ACM MM.View more
Author image of Si Liu
Institute of Artificial Intelligence, Beihang University, Beijing, China
Si Liu received the PhD degree from the Institute of Automation, Chinese Academy of Sciences. She is currently a professor with Beihang University. She has been a research assistant and postdoc with the National University of Singapore. Her research interests include computer vision and multimedia analysis. She has published more than 70 cutting-edge papers on human-related analysis and vision-language understating. She was the recipient of Best Paper Award of ACM MM 2021 and 2013, Best Demo Award of ACM MM 2012. She was the Champion of CVPR 2017 Look Into Person Challenge and the organizer of the ECCV 2018, ICCV 2019, CVPR 2021, CVPR 2022 and ACM MM 2022 Person in Context Challenges. She is the associate editor of IEEE Transactions on Multimedia and IEEE Transactions on Circuits and Systems for Video Technology.
Si Liu received the PhD degree from the Institute of Automation, Chinese Academy of Sciences. She is currently a professor with Beihang University. She has been a research assistant and postdoc with the National University of Singapore. Her research interests include computer vision and multimedia analysis. She has published more than 70 cutting-edge papers on human-related analysis and vision-language understating. She was the recipient of Best Paper Award of ACM MM 2021 and 2013, Best Demo Award of ACM MM 2012. She was the Champion of CVPR 2017 Look Into Person Challenge and the organizer of the ECCV 2018, ICCV 2019, CVPR 2021, CVPR 2022 and ACM MM 2022 Person in Context Challenges. She is the associate editor of IEEE Transactions on Multimedia and IEEE Transactions on Circuits and Systems for Video Technology.View more
Author image of Yulu Gao
Institute of Artificial Intelligence, Beihang University, Beijing, China
Yulu Gao received the bachelor's degree from Beihang University. He is currently working toward the PhD degree with the Department of Computer Science and Engineering, Beihang University. His research interests include object detection and visual tracking.
Yulu Gao received the bachelor's degree from Beihang University. He is currently working toward the PhD degree with the Department of Computer Science and Engineering, Beihang University. His research interests include object detection and visual tracking.View more
Author image of Aixi Zhang
Institute of Artificial Intelligence, Beihang University, Beijing, China
Aixi Zhang received the Mphil degree from the Hong Kong University of Science and Technology, Hong Kong, China, in 2014. He is now a senior researcher with Taobao (China) Software Co. Ltd., Alibaba Group. His research interests include computer vision, HOI detection, and multi-modal video understanding. He has published four papers at top conferences and journals including NIPS, CVPR, ACM MM and the IEEE Transactions on Image Processing. He was the Champion of CVPR 2021 ActivityNet Homage Challenge and CVPR 2020 DeepFashion2 Fashion Retrieval Challenge.
Aixi Zhang received the Mphil degree from the Hong Kong University of Science and Technology, Hong Kong, China, in 2014. He is now a senior researcher with Taobao (China) Software Co. Ltd., Alibaba Group. His research interests include computer vision, HOI detection, and multi-modal video understanding. He has published four papers at top conferences and journals including NIPS, CVPR, ACM MM and the IEEE Transactions on Image Processing. He was the Champion of CVPR 2021 ActivityNet Homage Challenge and CVPR 2020 DeepFashion2 Fashion Retrieval Challenge.View more
Author image of Zhimin Li
School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, China
Zhimin Li received the BS degree from the Wuhan University of Technology, Wuhan, China, in 2019, and the MS degree from the Huazhong University of Science and Technology, Wuhan, in 2022. His research interests include video analysis and scene understanding. To date, he has published in these fields with some peer-reviewed technical papers at premium conferences including AAAI, CVPR, ACM, MM, etc. He currently serves as the reviewer of several conferences, including CVPR, ECCV, etc.
Zhimin Li received the BS degree from the Wuhan University of Technology, Wuhan, China, in 2019, and the MS degree from the Huazhong University of Science and Technology, Wuhan, in 2022. His research interests include video analysis and scene understanding. To date, he has published in these fields with some peer-reviewed technical papers at premium conferences including AAAI, CVPR, ACM, MM, etc. He currently serves as the reviewer of several conferences, including CVPR, ECCV, etc.View more
Author image of Fei Wang
School of Information Science and Technology, University of Science and Technology of China, Hefei, China
Fei Wang received the bachelor's and master's degrees from the Beijing University of Posts and Telecommunications. He is currently working toward the PhD degree with the University of Science and Technology of China. He is the director of SenseTime Intelligent Automotive Group. He is the head of SenseAuto-Parking engineering and SenseAuto-Cabin research. He has published more than 30 papers at CVPR/NIPS/ICCV and gained more than 4,000 Google Scholar Citations during the last few years. His research interests include automotive drive system, AI chip, deep learning, etc.
Fei Wang received the bachelor's and master's degrees from the Beijing University of Posts and Telecommunications. He is currently working toward the PhD degree with the University of Science and Technology of China. He is the director of SenseTime Intelligent Automotive Group. He is the head of SenseAuto-Parking engineering and SenseAuto-Cabin research. He has published more than 30 papers at CVPR/NIPS/ICCV and gained more than 4,000 Google Scholar Citations during the last few years. His research interests include automotive drive system, AI chip, deep learning, etc.View more
Author image of Bo Li
Institute of Artificial Intelligence, Beihang University, Beijing, China
Bo Li is currently a Changjiang distinguished professor with the School of Computer Science and Engineering, Beihang University. He is a recipient of the National Science Fund for Distinguished Young Scholars. He is currently the dean of AI Research Institute, Beihang University. He is the chief scientist of National 973 Program and the principal investigator of the National Key Research and Development Program. He has published more than 100 papers in top journals and conferences and held more than 50 domestic and foreign patents.
Bo Li is currently a Changjiang distinguished professor with the School of Computer Science and Engineering, Beihang University. He is a recipient of the National Science Fund for Distinguished Young Scholars. He is currently the dean of AI Research Institute, Beihang University. He is the chief scientist of National 973 Program and the principal investigator of the National Key Research and Development Program. He has published more than 100 papers in top journals and conferences and held more than 50 domestic and foreign patents.View more

Contact IEEE to Subscribe

References

References is not available for this document.