Abstract:
The introduction of depth sensors such as Microsoft Kinect have driven research in human action recognition. Human skeletal data collected from depth sensors convey a sig...Show MoreMetadata
Abstract:
The introduction of depth sensors such as Microsoft Kinect have driven research in human action recognition. Human skeletal data collected from depth sensors convey a significant amount of information for action recognition. While there has been considerable progress in action recognition, most existing skeleton-based approaches neglect the fact that not all human body parts move during many actions, and they fail to consider the ordinal positions of body joints. Here, and motivated by the fact that an action's category is determined by local joint movements, we propose a cuboid model for skeleton-based action recognition. Specifically, a cuboid arranging strategy is developed to organize the pairwise displacements between all body joints to obtain a cuboid action representation. Such a representation is well structured and allows deep CNN models to focus analyses on actions. Moreover, an attention mechanism is exploited in the deep model, such that the most relevant features are extracted. Extensive experiments on our new Yunnan University-Chinese Academy of Sciences-Multimodal Human Action Dataset (CAS-YNU MHAD), the NTU RGB+D dataset, the UTD-MHAD dataset, and the UTKinect-Action3D dataset demonstrate the effectiveness of our method compared to the current state-of-the-art.
Published in: IEEE Transactions on Multimedia ( Volume: 22, Issue: 11, November 2020)
Funding Agency:

FIST LAB, School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, P.R.China
Kaijun Zhu received the B.Eng. degree from the School of Physics and Electrical Engineering, Anqing Normal University, Anqing, China, in 2014, and the M.Sc. degree from the School of Information Science and Engineering, Yunnan University, Kunming, China, in 2019. His current research interests include computer vision and machine learning.
Kaijun Zhu received the B.Eng. degree from the School of Physics and Electrical Engineering, Anqing Normal University, Anqing, China, in 2014, and the M.Sc. degree from the School of Information Science and Engineering, Yunnan University, Kunming, China, in 2019. His current research interests include computer vision and machine learning.View more

Union Vision Innovations Technology, Shenzhen, P.R.China
Ruxin Wang received the B.Eng. degree from Xidian University, Xi’an, China, the M.Sc. degree from the Huazhong University of Science and Technology, Wuhan, China, and the Ph.D. degree from the University of Technology Sydney, Ultimo, NSW, Australia. He is currently a Research Scientist with the Union Visual Innovation Technology Company Ltd., Shenzhen, China. His research interests include deep learning, image restoration...Show More
Ruxin Wang received the B.Eng. degree from Xidian University, Xi’an, China, the M.Sc. degree from the Huazhong University of Science and Technology, Wuhan, China, and the Ph.D. degree from the University of Technology Sydney, Ultimo, NSW, Australia. He is currently a Research Scientist with the Union Visual Innovation Technology Company Ltd., Shenzhen, China. His research interests include deep learning, image restoration...View more

Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China
Qingsong Zhao received the B.Eng. degree from the School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, China, in 2016. He is currently working toward the M.Eng. degree with the Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Beijing, China. His research interests include human action recognition, transfer learning, and meta learning.
Qingsong Zhao received the B.Eng. degree from the School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, China, in 2016. He is currently working toward the M.Eng. degree with the Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Beijing, China. His research interests include human action recognition, transfer learning, and meta learning.View more

CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Chinese University of Hong Kong, Hong Kong, China
Jun Cheng received the B.E. and M.E. degrees from the University of Science and Technology of China, Hefei, China, in 1999 and 2002, respectively, and the Ph.D. degree from the Chinese University of Hong Kong, Hong Kong, in 2006. He is currently a Professor with the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, and also with the CAS Key Laboratory of Human-Machine Intelligence-S...Show More
Jun Cheng received the B.E. and M.E. degrees from the University of Science and Technology of China, Hefei, China, in 1999 and 2002, respectively, and the Ph.D. degree from the Chinese University of Hong Kong, Hong Kong, in 2006. He is currently a Professor with the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, and also with the CAS Key Laboratory of Human-Machine Intelligence-S...View more

FIST LAB, School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, P.R.China
Dapeng Tao received the B.E. degree from the Northwestern Polytechnical University, Xi’an, China, and the Ph.D. degree from the South China University of Technology, Guangzhou, China. He is currently a Professor with the School of Information Science and Engineering, Yunnan University, Kunming, China. He has authored or coauthored more than 30 scientific articles. His research interests include machine learning, computer ...Show More
Dapeng Tao received the B.E. degree from the Northwestern Polytechnical University, Xi’an, China, and the Ph.D. degree from the South China University of Technology, Guangzhou, China. He is currently a Professor with the School of Information Science and Engineering, Yunnan University, Kunming, China. He has authored or coauthored more than 30 scientific articles. His research interests include machine learning, computer ...View more

FIST LAB, School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, P.R.China
Kaijun Zhu received the B.Eng. degree from the School of Physics and Electrical Engineering, Anqing Normal University, Anqing, China, in 2014, and the M.Sc. degree from the School of Information Science and Engineering, Yunnan University, Kunming, China, in 2019. His current research interests include computer vision and machine learning.
Kaijun Zhu received the B.Eng. degree from the School of Physics and Electrical Engineering, Anqing Normal University, Anqing, China, in 2014, and the M.Sc. degree from the School of Information Science and Engineering, Yunnan University, Kunming, China, in 2019. His current research interests include computer vision and machine learning.View more

Union Vision Innovations Technology, Shenzhen, P.R.China
Ruxin Wang received the B.Eng. degree from Xidian University, Xi’an, China, the M.Sc. degree from the Huazhong University of Science and Technology, Wuhan, China, and the Ph.D. degree from the University of Technology Sydney, Ultimo, NSW, Australia. He is currently a Research Scientist with the Union Visual Innovation Technology Company Ltd., Shenzhen, China. His research interests include deep learning, image restoration, and computer vision. He has authored and coauthored about ten research papers including the IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Image Processing, and IEEE Transactions on Cybernetics.
Ruxin Wang received the B.Eng. degree from Xidian University, Xi’an, China, the M.Sc. degree from the Huazhong University of Science and Technology, Wuhan, China, and the Ph.D. degree from the University of Technology Sydney, Ultimo, NSW, Australia. He is currently a Research Scientist with the Union Visual Innovation Technology Company Ltd., Shenzhen, China. His research interests include deep learning, image restoration, and computer vision. He has authored and coauthored about ten research papers including the IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Image Processing, and IEEE Transactions on Cybernetics.View more

Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China
Qingsong Zhao received the B.Eng. degree from the School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, China, in 2016. He is currently working toward the M.Eng. degree with the Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Beijing, China. His research interests include human action recognition, transfer learning, and meta learning.
Qingsong Zhao received the B.Eng. degree from the School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, China, in 2016. He is currently working toward the M.Eng. degree with the Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Beijing, China. His research interests include human action recognition, transfer learning, and meta learning.View more

CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Chinese University of Hong Kong, Hong Kong, China
Jun Cheng received the B.E. and M.E. degrees from the University of Science and Technology of China, Hefei, China, in 1999 and 2002, respectively, and the Ph.D. degree from the Chinese University of Hong Kong, Hong Kong, in 2006. He is currently a Professor with the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, and also with the CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. His current research interests include computer vision, robotics, machine intelligence, and control.
Jun Cheng received the B.E. and M.E. degrees from the University of Science and Technology of China, Hefei, China, in 1999 and 2002, respectively, and the Ph.D. degree from the Chinese University of Hong Kong, Hong Kong, in 2006. He is currently a Professor with the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, and also with the CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. His current research interests include computer vision, robotics, machine intelligence, and control.View more

FIST LAB, School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, P.R.China
Dapeng Tao received the B.E. degree from the Northwestern Polytechnical University, Xi’an, China, and the Ph.D. degree from the South China University of Technology, Guangzhou, China. He is currently a Professor with the School of Information Science and Engineering, Yunnan University, Kunming, China. He has authored or coauthored more than 30 scientific articles. His research interests include machine learning, computer vision, and robotics. He has served more than ten international journals, including the IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Multimedia, IEEE Transactions on Circuits and Systems for Video Technology, IEEE Signal Processing Letters, and Information Sciences.
Dapeng Tao received the B.E. degree from the Northwestern Polytechnical University, Xi’an, China, and the Ph.D. degree from the South China University of Technology, Guangzhou, China. He is currently a Professor with the School of Information Science and Engineering, Yunnan University, Kunming, China. He has authored or coauthored more than 30 scientific articles. His research interests include machine learning, computer vision, and robotics. He has served more than ten international journals, including the IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Multimedia, IEEE Transactions on Circuits and Systems for Video Technology, IEEE Signal Processing Letters, and Information Sciences.View more