Document Type
Article
Publication Date
1-1-2022
Publication Title
Computer Methods and Programs in Biomedicine Update
Volume
2
Keywords
Classifier, Deep learning, EGG, Pathology detection, Speech, Voice generation
Abstract
This paper presents a convolutional neural network (CNN) based automated noninvasive voice pathology detection system. The proposed system functions in two steps. First, it discriminates pathological voices from healthy ones, and then, it classifies the discriminated pathological voices into one of the three pathologies. Two CNNs are used for these purposes; one works as a binary classifier to identify pathological voices. The other one works as a multiclass classifier for categorizing the voice pathologies. This work investigates the effectiveness of electroglottographic (EGG) and speech signals to detect and classify pathological voices using sustained vowel ('/a/') samples. EGG signals can assess the vibratory pattern of the vocal folds during voiced sound. On the other hand, the speech signals add spectral color to the EGG signals. Hence, their contributions for pathology identification and segregation differ, as demonstrated in this work. The Saarbrücken Voice Database (SVD) is used in this investigation. The results show that the proposed system achieves a higher accuracy (more than 9%) in identifying pathological voices from healthy ones with speech signals than EGG signals. However, categorizing pathological voices into different pathology types demonstrates higher accuracy (more than 12%) with EGG signals than speech signals. A comparative performance analysis of the proposed system is presented with these two signals in terms of clinical and statistical measures. The obtained results of this work are also compared with those of other related published works.
DOI
10.1016/j.cmpbup.2022.100074
E-ISSN
26669900
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Recommended Citation
Islam, Rumana; Abdel-Raheem, Esam; and Tarique, Mohammed. (2022). Voice pathology detection using convolutional neural networks with electroglottographic (EGG) and speech signals. Computer Methods and Programs in Biomedicine Update, 2.
https://scholar.uwindsor.ca/electricalengpub/489