Fusion of Video and Inertial Sensing for Deep Learning-Based Human Action Recognition.

abstract

This paper presents the simultaneous utilization of video images and inertial signals that are captured at the same time via a video camera and a wearable inertial sensor within a fusion framework in order to achieve a more robust human action recognition compared to the situations when each sensing modality is used individually. The data captured by these sensors are turned into 3D video images and 2D inertial images that are then fed as inputs into a 3D convolutional neural network and a 2D convolutional neural network, respectively, for recognizing actions. Two types of fusion are considered-Decision-level fusion and feature-level fusion. Experiments are conducted using the publicly available dataset UTD-MHAD in which simultaneous video images and inertial signals are captured for a total of 27 actions. The results obtained indicate that both the decision-level and feature-level fusion approaches generate higher recognition accuracies compared to the approaches when each sensing modality is used individually. The highest accuracy of 95.6% is obtained for the decision-level fusion approach.

authors

Jafari, Roozbeh

published proceedings

Sensors (Basel)

author list (cited authors)

Wei, H., Jafari, R., & Kehtarnavaz, N.

citation count

34

complete list of authors

Wei, Haoran||Jafari, Roozbeh||Kehtarnavaz, Nasser

publication date

August 2019

publisher

MDPI Publisher

published in

Sensors Journal

keywords

Algorithms
Decision-level And Feature-level Fusion For Action Recognition
Deep Learning
Deep Learning-based Action Recognition
Fusion Of Video And Inertial Sensing For Action Recognition
Humans
Neural Networks, Computer
Video Recording
Vision, Ocular
Wearable Electronic Devices

PubMed Central ID

31450609

Digital Object Identifier (DOI)

10.3390/s19173680

start page

3680

end page

3680

volume

19

issue

17

URL

http://dx.doi.org/10.3390/s19173680

Fusion of Video and Inertial Sensing for Deep Learning-Based Human Action Recognition. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL