Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites

abstract

2017 ACM. Sign language is the primary medium of communication for many people who are deaf or hard of hearing. Members of this community access online sign language (SL) content posted on video sharing sites to stay informed. Unfortunately, locating SL videos can be difficult since the text-based search on video sharing sites is based on metadata rather than on the video content. Low cost or real-time video classification techniques would be invaluable for improving access to this content. Our prior work developed a technique to identify SL content based on video features alone but is computationally expensive. Here we describe and evaluate three optimization strategies that have the potential to reduce the computation time without overly impacting precision and recall. Two optimizations reduce the cost of face-detection, whereas the third focuses on analyzing shorter segments of the video. Our results identify a combination of these techniques that yields a 96% reduction in computation time while losing only 1% in F1 score. To further reduce computation, we additionally explore a keyframe-based approach that achieves comparable recall but lower precision than the above techniques, making it appropriate as an early filter in a staged classifier.

name of conference

Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility

authors

published proceedings

PROCEEDINGS OF THE 19TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY (ASSETS'17)

author list (cited authors)

Shipman, F. M., Duggina, S., Monteiro, C., & Gutierrez-Osuna, R.

citation count

6

complete list of authors

Shipman, Frank M||Duggina, Satyakiran||Monteiro, Caio DD||Gutierrez-Osuna, Ricardo

editor list (cited editors)

Hurst, A., Findlater, L., & Morris, M. R.

publication date

January 2017

publisher

Association for Computing Machinery (ACM) Publisher

keywords

Computer Vision
Sign Language Detection
Signal Processing

Digital Object Identifier (DOI)

10.1145/3132525.3132559

International Standard Book Number (ISBN) 13

9781450349260

start page

185

end page

189

URL

http://dl.acm.org/citation.cfm?id=3132525

user-defined tag

3 Good Health and Well-Being

Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

editor list (cited editors)

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL

user-defined tag