Abstract:
Recently, Two-Stream Convolutional Network has achieved remarkable performance. Especially, by capturing appearance and motion information, spatial-temporal two- stream n...Show MoreMetadata
Abstract:
Recently, Two-Stream Convolutional Network has achieved remarkable performance. Especially, by capturing appearance and motion information, spatial-temporal two- stream networks bring noticeable improvement. On the other hand, dynamic image, which is a powerful representation for videos, has also been confirmed to provide complimentary information to spatial appearance. Inspired by these works, we proposed Triple-Stream Convolutional Networks by fusing a third network stream whose input is dynamic image. In this paper, we implement the proposed Triple-Stream Convolutional Networks and evaluated them in two aspects: (a) how the overall end-to-end classification performance can be benefited by adding the dynamic stream; (b) which way is efficient to use the trained Triple-Stream Convolutional Networks in classification. Our evaluation shows improvements over both single networks (spatial and temporal) and Fused Spatial-temporal Two-Stream Network.
Published in: 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Date of Conference: 29 November 2017 - 01 December 2017
Date Added to IEEE Xplore: 21 December 2017
ISBN Information:
Graduate School of Informatics, Nagoya University Nagoya, Japan
Graduate School of Informatics, Nagoya University Nagoya, Japan
Graduate School of Informatics, Nagoya University Nagoya, Japan
Graduate School of Informatics, Nagoya University Nagoya, Japan
Graduate School of Informatics, Nagoya University Nagoya, Japan
Graduate School of Informatics, Nagoya University Nagoya, Japan