Textual, and Multimodal Features for Detecting Sign Language in Video Sharing Sites Comparing Visual

abstract

2018 IEEE. Easy recording and sharing of video content has led to the creation and distribution of increasing quantities of sign language (SL) content. Current capabilities make locating SL videos on a desired topic dependent on the existence and correctness of metadata indicating both the language and topic of the video. Automated techniques to detect sign language content can aid this problem. This paper compares metadata-based classifiers and multimodal classifiers, using both early and late fusion techniques, with video content-based classifiers in the literature. Comparisons of applying TF-IDF, LDA, and NMF in the generation of metadata features indicates that NMF performs best, either when used independently or when combined with video features. Multimodal classifiers perform better than unimodal SL video classifiers. Experiments show multimodal features obtained results of up to 86% precision, 81% recall, and 84% F1 score. This represents an improvement on F1 score of roughly 9% in comparison with the video-based approach presented in the literature and an improvement of 6% over text-based features extracted using NMF.

name of conference

2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

authors

published proceedings

IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018)

author list (cited authors)

Monteiro, C., Shipman, F., & Gutierrez-Osuna, R.

citation count

4

complete list of authors

Monteiro, Caio DD||Shipman, Frank||Gutierrez-Osuna, Ricardo

publication date

April 2018

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

keywords

Machine Learning
Multimodal Classification
Sign Language Detection

Digital Object Identifier (DOI)

10.1109/MIPR.2018.00010

International Standard Book Number (ISBN) 13

9781538618578

start page

7

end page

12

URL

http://dx.doi.org/10.1109/mipr.2018.00010

Comparing Visual, Textual, and Multimodal Features for Detecting Sign Language in Video Sharing Sites Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL