Voice Pathology Detection Using Deep Learning: A Preliminary Study

Harar, Pavol; Alonso-Hernández, Jesús B.; Mekyska, Jiri; Galaz, Zoltan; Burget, Radim; Smekal, Zdenek

Title:	Voice Pathology Detection Using Deep Learning: A Preliminary Study
Authors:	Harar, Pavol Alonso-Hernández, Jesús B. Mekyska, Jiri Galaz, Zoltan Burget, Radim Smekal, Zdenek
UNESCO Clasification:	3307 Tecnología electrónica
Issue Date:	2017
Publisher:	Institute of Electrical and Electronics Engineers (IEEE)
Conference:	5th IEEE International Work Conference on Bio-Inspired Intelligence, IWOBI 2017
Abstract:	This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional layers in combination with recurrent Long-Short-Term-Memory (LSTM) layers on raw audio signal. Each recording was split into 64 ms Hamming windowed segments with 30 ms overlap. Our trained model achieved 71.36% accuracy with 65.04% sensitivity and 77.67% specificity on 206 validation files and 68.08% accuracy with 66.75% sensitivity and 77.89% specificity on 874 testing files. This is a promising result in favor of this approach because it is comparable to similar previously published experiment that used different methodology. Further investigation is needed to achieve the state-of-the-art results.
URI:	https://accedacris.ulpgc.es/handle/10553/69730
ISBN:	9781538608500
DOI:	10.1109/IWOBI.2017.7985525
Source:	2017 International Work Conference on Bio-Inspired Intelligence: Intelligent Systems for Biodiversity Conservation, IWOBI 2017 - Proceedings, Funchal, e17032869
Appears in Collections:	Actas de congresos

Show full item record

Google Scholar^TM

Altmetric

Share

Export metadata

Dirección

Contacto

Legal

De interés

Google ScholarTM

Altmetric

Share

Export metadata

Dirección

Google Scholar^TM