Detection of Mouth Movements and Its Applications to Cross-Modal Analysis of Planning Meetings

abstract

Detection of meaningful meeting events is very important for cross-modal analysis of planning meetings. Many important events are related to speaker's communication behavior. In visual-audio based speaker detection, mouth positions and movements are needed as visual information. We present our techniques to detect mouth positions and movements of a talking person in meetings. First, we build a skin color model with the Gaussian distribution. After training with skin color samples, we obtain parameters for the model. A skin color filter is created corresponding to the model with a threshold. We detect face regions for all participants in the meeting. Second, We create a mouth template and perform image matching to find candidates of the mouth in each face region. Next, according to the fact that the skin color in lip areas is different from other areas in the face region, by comparing dissimilarities of skin color between candidates and the original color model, we decide the mouth area from the candidates. Finally, we detect mouth movements by computing normalized cross-correlation coefficients of mouth area between two successive frames. A real-time system has been implemented to track speaker's mouth positions and detection mouth movements. Applications also include video conferencing and improving human computer interaction (HCI). Examples in meeting environments and others are provided. 2009 IEEE.

name of conference

2009 International Conference on Multimedia Information Networking and Security

authors

Quek, Francis

published proceedings

2009 International Conference on Multimedia Information Networking and Security

author list (cited authors)

Xiong, Y., Fang, B., & Quek, F.

citation count

1

complete list of authors

Xiong, Yingen||Fang, Bing||Quek, Francis

publication date

January 2009

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

keywords

Eye Disease And Disorders Of Vision

Digital Object Identifier (DOI)

10.1109/mines.2009.258

International Standard Book Number (ISBN) 13

9780769538433

start page

225

end page

229

volume

1

URL

http://dx.doi.org/10.1109/mines.2009.258

Detection of Mouth Movements and Its Applications to Cross-Modal Analysis of Planning Meetings Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL