Journal of System Simulation ›› 2019, Vol. 31 ›› Issue (11): 2382-2387.doi: 10.16182/j.issn1004731x.joss.19-FZ0369

Previous Articles     Next Articles

A Temporal Action Detection Algorithm Based on Spatio-Temporal Feature Pyramid Network

Liu Wang, Sun Jinyu, Ma Shiwei*   

  1. School of Mechatronic Engineering and Automation, Shanghai University, Shanghai 200444, China
  • Received:2019-05-21 Revised:2019-07-23 Online:2019-11-10 Published:2019-12-13

Abstract: In view of the discontinuity of motion timing detection in the frame-level prediction network structure, a novel algorithm based on spatio-temporal feature pyramid network (ST-FPN) is proposed. In the frame-level action prediction, several 3D convolution-de-convolution (CDC) networks are used to sample spatial feature down to 1 dimension and sample temporal feature up to corresponding proposal level. Then the prediction scores of different CDC networks are fused by non-maximum suppression (NMS). The softmax classifier is used to classify frame-level actions, and then temporal action detection is obtained. The experimental results on dataset THUMOS14 show that the proposed algorithm improves the accuracy of temporal action detection.

Key words: temporal action detection, feature fusion, spatio-temporal feature pyramid network, 3D convolution-de-convolution, non-maximum suppression

CLC Number: