Voice quality enhancement for vocal tract rehabilitation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Vocal rehabilitation devices used by patients after Laryngectomy produce an unnatural sounding speech. Our study aims at increasing the quality of these synthetically generated voices by implementing human-like characteristics. A simplified source filter model, linear predictive coding coefficients and line spectral frequencies were used to model the vocal tract and manipulate the acoustic features of their resulting speech. Two different mapping functions were employed to convert between the features of synthetically generated voice and those of a human voice: A Gaussian mixture model and a linear regression model. The models were trained on a set of 50 human and 50 synthetic voice utterances. Both mapping functions yielded significant changes in the transformed synthetic voices and their spectra were similar to the human voices. The linear regression model mapping produced slightly better results compared to the Gaussian mixture model mapping. Listeners' tests confirmed this result, but indicated that voices re-synthesized from the transformed model coefficients, improved on the synthetic voice but still sounded unnatural. This may imply that the vocal tract model is lacking in information that produces the subjective perception of 'artificial speech'. Future work will investigate an elaborate model which will include the speech production excitation and radiation signals and the transformation of their features. These models have the potential to improve the conversion of synthetically generated electrolarynx voice into human sounding one.

Original languageEnglish
Title of host publication2018 3rd Biennial South African Biomedical Engineering Conference, SAIBMEC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-4
Number of pages4
ISBN (Electronic)9781538625163
DOIs
Publication statusPublished - 23 May 2018
Externally publishedYes
Event3rd Biennial South African Biomedical Engineering Conference, SAIBMEC 2018 - Stellenbosch, South Africa
Duration: 4 Apr 20186 Apr 2018

Publication series

Name2018 3rd Biennial South African Biomedical Engineering Conference, SAIBMEC 2018

Conference

Conference3rd Biennial South African Biomedical Engineering Conference, SAIBMEC 2018
Country/TerritorySouth Africa
CityStellenbosch
Period4/04/186/04/18

Keywords

  • Gaussian mixture model
  • line spectral frequencies
  • linear predictive coding coefficients
  • linear regression
  • source-filter model
  • voice conversion

Fingerprint

Dive into the research topics of 'Voice quality enhancement for vocal tract rehabilitation'. Together they form a unique fingerprint.

Cite this