Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/69730
Title: Voice Pathology Detection Using Deep Learning: A Preliminary Study
Authors: Harar, Pavol
Alonso-Hernández, Jesús B. 
Mekyska, Jiri
Galaz, Zoltan
Burget, Radim
Smekal, Zdenek
UNESCO Clasification: 3307 Tecnología electrónica
Issue Date: 2017
Publisher: Institute of Electrical and Electronics Engineers (IEEE) 
Conference: 5th IEEE International Work Conference on Bio-Inspired Intelligence, IWOBI 2017 
Abstract: This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional layers in combination with recurrent Long-Short-Term-Memory (LSTM) layers on raw audio signal. Each recording was split into 64 ms Hamming windowed segments with 30 ms overlap. Our trained model achieved 71.36% accuracy with 65.04% sensitivity and 77.67% specificity on 206 validation files and 68.08% accuracy with 66.75% sensitivity and 77.89% specificity on 874 testing files. This is a promising result in favor of this approach because it is comparable to similar previously published experiment that used different methodology. Further investigation is needed to achieve the state-of-the-art results.
URI: http://hdl.handle.net/10553/69730
ISBN: 9781538608500
DOI: 10.1109/IWOBI.2017.7985525
Source: 2017 International Work Conference on Bio-Inspired Intelligence: Intelligent Systems for Biodiversity Conservation, IWOBI 2017 - Proceedings, Funchal, e17032869
Appears in Collections:Actas de congresos
Show full item record

SCOPUSTM   
Citations

89
checked on Nov 24, 2024

Page view(s)

86
checked on Jun 8, 2024

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.