Anomaly Detection Approach for Pronunciation Verification of Disordered Speech using Speech Attribute Features

abstract

2018 International Speech Communication Association. All rights reserved. The automatic assessment of speech is a powerful tool in computer aided speech therapy for disorders such as Childhood Apraxia of Speech (CAS). However, the lack of sufficient annotated disordered speech data seriously impedes the accurate detection of pronunciation errors. To handle this deficiency, in this paper, we used the novel approach of tackling pronunciation verification as an anomaly detection problem. We achieved this by modeling only the correct pronunciation of each individual phoneme with a one-class Support Vector Machine (SVM) trained using a set of speech attributes features, namely the manner and place of articulation. These features are extracted from a bank of pre-trained Deep Neural Network (DNN) speech attributes classifiers. The one-class SVM model classifies each phoneme production as normal (correct) or an anomaly (incorrect). We evaluated the system using both native speech with artificial errors and disordered speech collected from children with apraxia of speech and compared it with the DNN Goodness of Pronunciation (GOP) algorithm. The results show that our approach reduces the false-rejection rates by around 35% when applied to disordered speech.

name of conference

Interspeech 2018

authors

Ji, Jim

published proceedings

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6

author list (cited authors)

Shahin, M., Ahmed, B., Ji, J. X., & Ballard, K.

citation count

4

complete list of authors

Shahin, Mostafa||Ahmed, Beena||Ji, Jim X||Ballard, Kirrie

publication date

January 2018

publisher

International Speech Communication Association Publisher

published in

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH Journal

keywords

Deep Learning
Disordered Speech
One Class Svm
Pronunciation Verification
Speech Attributes

Digital Object Identifier (DOI)

10.21437/Interspeech.2018-1319

International Standard Book Number (ISBN) 13

978-1-5108-7221-9

start page

1671

end page

1675

volume

2018-September

URL

http://dx.doi.org/10.21437/interspeech.2018-1319

Anomaly Detection Approach for Pronunciation Verification of Disordered Speech using Speech Attribute Features Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL