Journals & Magazines >IEEE Transactions on Image Pr... >Volume: 33

UniParser: Multi-Human Parsing With Unified Correlation Representation Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information. However, prior research has typically pro...Show More

Metadata

Abstract:

Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information. However, prior research has typically processed these two types of information through distinct branch types and output formats, leading to inefficient and redundant frameworks. This paper introduces UniParser, which integrates instance-level and category-level representations in three key aspects: 1) we propose a unified correlation representation learning approach, allowing our network to learn instance and category features within the cosine space; 2) we unify the form of outputs of each modules as pixel-level results while supervising instance and category features using a homogeneous label accompanied by an auxiliary loss; and 3) we design a joint optimization procedure to fuse instance and category representations. By unifying instance-level and category-level output, UniParser circumvents manually designed post-processing techniques and surpasses state-of-the-art methods, achieving 49.3% AP on MHPv2.0 and 60.4% AP on CIHP. We have released our source code, pretrained models, and demos to facilitate future studies on https://github.com/cjm-sfw/Uniparser.

Published in: IEEE Transactions on Image Processing ( Volume: 33)

Page(s): 5159 - 5171

Date of Publication: 12 September 2024

ISSN Information:

PubMed ID: 39264771

DOI: 10.1109/TIP.2024.3456004

Funding Agency:

Jiaming Chu

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Jiaming Chu is currently pursuing the Ph.D. degree in electronic science and technology with Beijing University of Posts and Telecommunications. His research interests include deep learning and computer vision, with in-depth research in sub-fields, such as human action recognition, instance segmentation, and human parsing.

Lei Jin

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Lei Jin received the degree from Beijing University of Posts and Telecommunications (BUPT), Beijing, China. He is currently an Associate Research Fellow with BUPT. His research interests include computer vision, data mining, and pattern recognition, with in-depth research in sub-fields, such as human pose estimation, human action recognition, and human parsing, with related research results published in high-level confere...Show More

Yinglei Teng

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Yinglei Teng (Senior Member, IEEE) received the B.S. degree from Shandong University, China, in 2005, and the Ph.D. degree in electrical engineering from Beijing University of Posts and Telecommunications (BUPT), in 2011. She is currently a Professor with the School of Electronic Engineering, BUPT. Her current research interests include wireless communications, edge intelligence, and connected AI.

Jianshu Li

Ant Group, Beijing, China

Yunchao Wei

Beijing Jiaotong University, Beijing, China

Zheng Wang

Wuhan University, Wuhan, China

Junliang Xing

Tsinghua University, Beijing, China

Shuicheng Yan

Skywork AI, 2 Science park drive, Singapore

Jian Zhao

EVOL Lab, Institute of AI (TeleAI), China Telecom and the School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University (NWPU), Xi’an, China

Contents

Jiaming Chu

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Lei Jin

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Yinglei Teng

School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China

Jianshu Li

Ant Group, Beijing, China

Jianshu Li received the Ph.D. degree from the School of Computing, National University of Singapore, in 2019, advised by Prof. Terence Sim and Prof. Shuicheng Yan. He has been an Algorithm Expert with Ant Group since 2018, mainly working on face analysis algorithms, including face recognition, face liveness detection, face quality analysis, and Deepfake detection. He has published more than 20 papers in journals and conferences. His research interests include computer vision and image understanding, particularly face and human analytics, semantic segmentation, and object detection. He is the winner of the Gold Award of PREMIA 2019 Singapore, the Best Student Paper Award of ACMMM 2018, winner prize of object localization ILSVRC 2017, and winner prize of emotion recognition challenge ICMI 2016. He served as an Invited Reviewer for CVPR, ECCV, NIPS, IJCAI, FG, ICMI, IEEE Transactions on Image Processing, IEEE Transactions on Circuits and Systems for Video Technology, and IEEE Transactions on Multimedia.

Yunchao Wei

Beijing Jiaotong University, Beijing, China

Yunchao Wei is currently a Full Professor with Beijing Jiaotong University. He has published more than 100 papers in top-tier conferences/journals and more than Google citations 13000. He has broad research interests in computer vision and machine learning. His current research interests include visual recognition with imperfect data, image/video segmentation and object detection, and multi-modal perception. He was selected as a MIT TR35 China by MIT Technology Review in 2021 and was named as one of the five top early-career researchers in engineering and computer sciences in Australia by Australian in 2020. He received the Discovery Early Career Researcher Award by Australian Research Council in 2019 and the First Prize in Science and Technology awarded by China Society of Image and Graphics (CSIG) in 2019. He received many competition prizes from CVPR/ICCV/ECCV, such as the winner prizes of ILSVRC 2014, LIP 2018/2019, and Youtube VOS 2021; and runner-up prizes of ILSVRC 2017 and DAVIS 2020. He organized many workshops on top-tier conferences, including Learning from Imperfect Data Workshop series (CVPR 2019, 2020, and 2021) and Real-world Recognition from Low-Quality Inputs Workshop series (ICCV 2019 and ECCV 2020).

Zheng Wang

Wuhan University, Wuhan, China

Zheng Wang (Senior Member, IEEE) received the Ph.D. degree from the National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, in 2017. He was a JSPS Fellowship Researcher with the National Institute of Informatics, Japan, and a Project Assistant Professor with The University of Tokyo, Japan. He is currently a Professor with the National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University. His research interests include image processing and video content analysis. He is also an Associate Editor of IEEE Transactions on Image Processing.

Junliang Xing

Tsinghua University, Beijing, China

Junliang Xing (Senior Member, IEEE) received the dual B.E. degree in computer science and applied mathematics from Xi’an Jiaotong University in 2007 and the Ph.D. degree in computer science and technology from Tsinghua University in 2012. He is currently a Professor with the Department of Computer Science and Technology, Tsinghua University. He has published over 120 peer-reviewed conference papers, such as IJCAI, AAAI, ICCV, and CVPR, and journal articles, such as IEEE Transactions on Pattern Analysis and Machine Intelligence, International Journal of Computer Vision, and Artificial Intelligence, which received more than 19000 citations from Google Scholar. His research interests include computer vision and computer gaming, with a current focus on human-computer interactive learning in complex decision-making scenarios.

Shuicheng Yan

Skywork AI, 2 Science park drive, Singapore

Shuicheng Yan (Fellow, IEEE) is currently the Director of Skywork AI, Singapore. He has authored or co-authored more than 600 papers in top international journals and conferences, with Google Scholar Citation more than 40000 times, and H-index of 105. His research interests include computer vision, machine learning, and multimedia analysis. He is a fellow of the Academy of Engineering, Singapore, an ACM Fellow, and an IAPR Fellow. He was among the Thomson Reuters Highly Cited Researchers in 2014, 2015, 2016, 2018, and 2019. His team has received winner or honorable-mention prizes for ten times of two core competitions, Pascal VOC and ImageNet (ILSVRC), which are deemed as World Cup in computer vision community. His team was a recipient of ten best paper or best student paper prizes and especially, a grand slam in ACM MM, the top conference in multimedia, including the Best Paper Award, the Best Student Paper Award, and the Best Demo Award.

Jian Zhao

EVOL Lab, Institute of AI (TeleAI), China Telecom and the School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University (NWPU), Xi’an, China

Jian Zhao (Member, IEEE) received the Ph.D. degree from the National University of Singapore (NUS). He is currently the Leader of the EVOL Laboratory, a Principal Research Scientist with the Institute of AI (TeleAI), China Telecom, China, and a Researcher and the Ph.D. Supervisor of the School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University (NWPU), Xi’an, Shanxi, China. He has published over 60 CCF-A academic articles, including first-author IEEE Transactions on Pattern Analysis and Machine Intelligence✗2 (IF: 20.8) and International Journal of Computer Vision✗3 (IF: 11.6). He has also been authorized five national invention patents as the first inventor. The related technical achievements have been applied and verified in seven leading technology enterprises, including China Telecom, Baidu, Ant Financial, Qihoo, and 360, and have produced significant benefits. His research interests include vicinagearch security, AI + cultural tourism, and multi-modal AI agent. He served as a member for the board of directors of Beijing Society of Image and Graphics (BSIG) and the Editorial Board Member for the internationally renowned journals Pattern Recognition, Electronics and Signal Processing, Artificial Intelligence Advances, and IET Computer Vision. He is a Senior Member of China Society of Images and Graphics (CSIG) and Chinese Association for Artificial Intelligence (CAAI). He received the 2020–2022 Young Elite Scientist Sponsorship Program from China Association for Science and Technology (CAST) and the 2021–2023 Beijing Young Elite Scientist Sponsorship Program from Beijing Association for Science and Technology (BAST). He is in charge of seven relevant projects supported by the National Nature Science Foundation of China (NSFC). He has won the WU WEN JUN AI Outstanding Youth Award (2023), the First-Prize of the WU WEN JUN AI Natural Science Award (2/5, 2022), the PREMIA Lee Hwee Kuan Award (2019), and the ACM Multimedia Best Student Paper Award (first author, 1/208, CCF-A conference, 2018). He has also won eight winner awards in domestic and foreign technical challenges. He also served as the Senior Area Chair for the Vision and Learning Seminar (VALSE), the Session Chair for the ACM Multimedia 2021, the Area Chair for the CICAI 2022/2023, and the Workshop Chair for the CCBR 2024. He served as the Guest Editor for Pattern Recognition Letters Special Issue on Recent Advances in Deep Learning Model Security and Electronics Special Issue on Multimedia Content Analysis, Management and Retrieval: Trends and Challenges.

References is not available for this document.

UniParser: Multi-Human Parsing With Unified Correlation Representation Learning

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

UniParser: Multi-Human Parsing With Unified Correlation Representation Learning

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?