Reduction of non-native accents through statistical parametric articulatory synthesis.

abstract

This paper presents an articulatory synthesis method to transform utterances from a second language (L2) learner to appear as if they had been produced by the same speaker but with a native (L1) accent. The approach consists of building a probabilistic articulatory synthesizer (a mapping from articulators to acoustics) for the L2 speaker, then driving the model with articulatory gestures from a reference L1 speaker. To account for differences in the vocal tract of the two speakers, a Procrustes transform is used to bring their articulatory spaces into registration. In a series of listening tests, accent conversions were rated as being more intelligible and less accented than L2 utterances while preserving the voice identity of the L2 speaker. No significant effect was found between the intelligibility of accent-converted utterances and the proportion of phones outside the L2 inventory. Because the latter is a strong predictor of pronunciation variability in L2 speech, these results suggest that articulatory resynthesis can decouple those aspects of an utterance that are due to the speaker's physiology from those that are due to their linguistic gestures.

authors

Gutierrez-Osuna, Ricardo

published proceedings

J Acoust Soc Am

altmetric score

1.5

author list (cited authors)

Aryal, S., & Gutierrez-Osuna, R.

citation count

10

complete list of authors

Aryal, Sandesh||Gutierrez-Osuna, Ricardo

publication date

January 2015

publisher

Acoustical Society of America (ASA) Publisher

published in

Journal of the Acoustical Society of America Journal

keywords

Acoustics
Algorithms
Audiovisual Aids
Emotions
Equipment Design
Humans
Individuality
Language
Machine Learning
Models, Theoretical
Multilingualism
Pattern Recognition, Physiological
Phonation
Phonetics
Speech Intelligibility
Speech Production Measurement
Teaching
Voice Quality

PubMed Central ID

25618072

Digital Object Identifier (DOI)

10.1121/1.4904701

start page

433

end page

446

volume

137

issue

1

URL

http%3A%2F%2Fdx.doi.org%2F10.1121%2F1.4904701

Reduction of non-native accents through statistical parametric articulatory synthesis. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL