Journals & Magazines >IEEE Transactions on Multimedia >Volume: 18 Issue: 2

Effective Active Skeleton Representation for Low Latency Human Action Recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

With the development of depth sensors, low latency 3D human action recognition has become increasingly important in various interaction systems, where response with minim...Show More

Metadata

Abstract:

With the development of depth sensors, low latency 3D human action recognition has become increasingly important in various interaction systems, where response with minimal latency is a critical process. High latency not only significantly degrades the interaction experience of users, but also makes certain interaction systems, e.g., gesture control or electronic gaming, unattractive. In this paper, we propose a novel active skeleton representation towards low latency human action recognition . First, we encode each limb of the human skeleton into a state through a Markov random field. The active skeleton is then represented by aggregating the encoded features of individual limbs. Finally, we propose a multi-channel multiple instance learning with maximum-pattern-margin to further boost the performance of the existing model. Our method is robust in calculating features related to joint positions, and effective in handling the unsegmented sequences. Experiments on the MSR Action3D, the MSR DailyActivity3D, and the Huawei/3DLife-2013 dataset demonstrate the effectiveness of the model with the proposed novel representation, and its superiority over the state-of-the-art low latency recognition approaches.

Published in: IEEE Transactions on Multimedia ( Volume: 18, Issue: 2, February 2016)

Page(s): 141 - 154

Date of Publication: 03 December 2015

ISSN Information:

DOI: 10.1109/TMM.2015.2505089

References is not available for this document.

Contents

I. Introduction

Recently, with the development of depth sensors such as Nintendo Wii, Microsoft Kinect and PlayStation Move controllers, depth information can be readily obtained. This has facilitated a new trend in research on 3D action recognition. In fact, the early work of Johansson et al. [1] has suggested that the motion of human skeleton is discriminative enough to be used for identifying different human gestures. In particular, Shotton et al. [2] proposed a method to estimate joints from depth maps and provide the 3D positions of joints, from which discriminative features are extracted to describe the motion of human skeleton. Based on this method, much work has been conducted focusing on 3D human action recognition using depth maps [3]–[13]. Those methods are driven by high recognition accuracy and some of them need to access the entire observation data stream for reliable recognition. However, most applications of depth sensors are oriented for interaction systems such as human-computer interaction, electronic entertainment, and smart houses technologies, which usually require prompt responses after user actions for system control. That is to say, we need to build a low latency system to recognize human actions. Here, we discuss low latency from two aspects, i.e., the computational latency and the observational latency. Different from computational latency which is influenced by the performance of computers, the observational latency can be caused by the algorithm itself if the recognition system needs to access too much data stream. If high latency exists, it may cause system lag and thus not only significantly decreases the interactivity of user experiences, but also makes these certain interaction systems unattractive. Therefore, the success of these technologies requires flexible algorithms which satisfy the two fundamental properties, high recognition accuracy and low latency. Only a few systems paid attention to the observational latency and spent efforts identifying the action accurately long before it ends ([3], [9], [14]).

References is not available for this document.

Effective Active Skeleton Representation for Low Latency Human Action Recognition

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Effective Active Skeleton Representation for Low Latency Human Action Recognition

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?