Journal of System Simulation ›› 2024, Vol. 36 ›› Issue (3): 649-658.doi: 10.16182/j.issn1004731x.joss.23-0223

• Papers • Previous Articles     Next Articles

Gesture Recognition for Dynamic Vision Sensor Based on Multi-dimensional Projection Spatiotemporal Event Frame

Kang Lai1,2(), Zhang Yakun3   

  1. 1.College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
    2.Laboratory for Big Data and Decision, National University of Defense Technology, Changsha 410073, China
    3.PLA 61081 Troops, Beijing 100089, China
  • Received:2023-02-28 Revised:2023-04-23 Online:2024-03-15 Published:2024-03-14

Abstract:

Vision-based gesture recognition is a commonly used means of human-computer interaction in the fields of virtual reality and game simulation. In practical applications, rapid changes in gesture movements will lead to blurred imaging with traditional RGB cameras or depth cameras, which brings great challenges to gesture recognition. To solve the above problems, a dynamic visual data gesture recognition method based on a multi-dimensional projection spatiotemporal event frame (STEF) is proposed by a using dynamic vision sensor to capture high-speed gesture movement information. The spatiotemporal information is embedded in the data projection surface and fused to form a multi-dimensional projection STEF, which overcomes the limitation of the time-domain information loss of the existing event frame expression method of dynamic visual information and improves the feature expression ability of dynamic visual sensing data. On this basis, advanced spiking neural networks are used to classify STEFs to realize gesture recognition. The recognition accuracy of the above method on the public dataset reaches 96.67%, which is better than similar methods, indicating that the proposed method can significantly improve the accuracy of gesture recognition in dynamic visual sensing data.

Key words: dynamic vision sensor, gesture recognition, multi-dimensional projection, spatiotemporal event frame, spiking neural network

CLC Number: