Sigma-lognormal modeling of speech

Carmona-Duarte, C.; Ferrer, M.A.; Plamondon, R.; Gómez-Rodellar, A.; Gómez-Vilda, P.

Título:	Sigma-lognormal modeling of speech
Autores/as:	Carmona-Duarte, C. Ferrer, M.A. Plamondon, R. Gómez-Rodellar, A. Gómez-Vilda, P.
Clasificación UNESCO:	3307 Tecnología electrónica
Palabras clave:	Aging Kinematic Theory Of Rapid Human Movements Modeling Of The Neuromotor System Sigma-Lognormal Model Speech Kinematics, et al.
Fecha de publicación:	2021
Proyectos:	TEC2016-77791
Publicación seriada:	Cognitive Computation
Resumen:	Human movement studies and analyses have been fundamental in many scientific domains, ranging from neuroscience to education, pattern recognition to robotics, health care to sports, and beyond. Previous speech motor models were proposed to understand how speech movement is produced and how the resulting speech varies when some parameters are changed. However, the inverse approach, in which the muscular response parameters and the subject’s age are derived from real continuous speech, is not possible with such models. Instead, in the handwriting field, the kinematic theory of rapid human movements and its associated Sigma-lognormal model have been applied successfully to obtain the muscular response parameters. This work presents a speech kinematics-based model that can be used to study, analyze, and reconstruct complex speech kinematics in a simplified manner. A method based on the kinematic theory of rapid human movements and its associated Sigma-lognormal model are applied to describe and to parameterize the asymptotic impulse response of the neuromuscular networks involved in speech as a response to a neuromotor command. The method used to carry out transformations from formants to a movement observation is also presented. Experiments carried out with the (English) VTR-TIMIT database and the (German) Saarbrucken Voice Database, including people of different ages, with and without laryngeal pathologies, corroborate the link between the extracted parameters and aging, on the one hand, and the proportion between the first and second formants required in applying the kinematic theory of rapid human movements, on the other. The results should drive innovative developments in the modeling and understanding of speech kinematics.
URI:	https://accedacris.ulpgc.es/handle/10553/77817
ISSN:	1866-9956
DOI:	10.1007/s12559-020-09803-8
Fuente:	Cognitive Computation[ISSN 1866-9956], n. 13, p. 488–503, (Enero 2021)
Colección:	Artículos

Unknown (2,3 MB)

Vista completa

Unknown (2,3 MB)

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Visitas 5

Descargas

Google Scholar^TM

Altmetric

Comparte

Exporta metadatos

Dirección

Contacto

Legal

De interés

Unknown (2,3 MB)

Citas SCOPUSTM

Citas de WEB OF SCIENCETM Citations

Visitas 5

Descargas

Google ScholarTM

Altmetric

Comparte

Exporta metadatos

Dirección

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Google Scholar^TM