Loading [MathJax]/extensions/MathMenu.js
Dual Stream Spatio-Temporal Motion Fusion With Self-Attention For Action Recognition | IEEE Conference Publication | IEEE Xplore

Dual Stream Spatio-Temporal Motion Fusion With Self-Attention For Action Recognition


Abstract:

Human action recognition in diverse and realistic environments is a challenging task. Automatic classification of action and gestures has a significant impact on human-ro...Show More

Abstract:

Human action recognition in diverse and realistic environments is a challenging task. Automatic classification of action and gestures has a significant impact on human-robot interaction and human-machine interaction technologies. Due to the prevalence of complex real-world problems, it is non-trivial to produce a rich representation of actions and to produce an effective categorical distribution of large action classes. Deep convolutional neural networks have obtained great success in this area. Many researchers have proposed deep neural architectures for action recognition while considering the spatial and temporal aspects of the action. This research proposes a dual stream spatiotemporal fusion architecture for human action classification. The spatial and temporal data is fused using an attention mechanism. We investigate two fusion techniques and show that the proposed architecture achieves accurate results with much fewer parameters as compared to the traditional deep neural networks. We achieved 99.1 % absolute accuracy on the UCF-101 test set.
Date of Conference: 02-05 July 2019
Date Added to IEEE Xplore: 27 February 2020
ISBN Information:
Conference Location: Ottawa, ON, Canada

Contact IEEE to Subscribe

References

References is not available for this document.