Multi-band dysperiodicity analyses of disordered connected speech

Grenez, Francis; Alpan, Ali; Maryn, Youri; Kacha, Abdellah; Schoentgen, Jean

doi:doi/10.1016/j.specom.2010.06.010

Citer

Multi-band dysperiodicity analyses of disordered connected speech

par Grenez, Francis

;Alpan, Ali

;Maryn, Youri ;Kacha, Abdellah

;Schoentgen, Jean

Référence Speech communication, 53, page (131-141)
Publication Publié, 2010

Article révisé par les pairs

Résumé :

The objective is to analyse vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. Previously, this method has been evaluated on small corpora only. In this article, analyses have been carried out on two corpora comprising over 250 and 700 speakers. This has enabled carrying out multi-frequency band and multi-cue analyses without risking overfitting. The analysis results are compared to the cepstral peak prominence, which is a popular cue that indirectly summarizes vocal dysperiodicities frame-wise. A perceptual rating has been available for the first corpus whereas speakers in the second corpus have been categorized as normal or pathological only. For the first corpus, results show that the correlation with perceptual scores increases statistically significantly for multi-band analysis compared to conventional full-band analysis. Also, combining the cepstral peak prominence with the low-frequency band signal-to-dysperiodicity ratio statistically significantly increases their combined correlation with perceptual scores. The signal-to-dysperiodicity ratios of the two corpora have been separately submitted to principal component analysis. The results show that the first two principal components are interpretable in terms of the degree of dysphonia and the spectral slope, respectively. The clinical relevance of the principal components has been confirmed by linear discriminant analysis. © 2010 Elsevier B.V. All rights reserved.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Multi-band dysperiodicity analyses of disordered connected speech

Documents en relation

DI-fusion