Automated Classification of Depression Severity Using Speech-A Comparison of Two Machine Learning Architectures

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Depression affects approximately 300 million people worldwide, resulting in significant suffering and economic costs. Millions of sufferers remain undiagnosed and untreated due to a shortage of trained personnel, social stigma, and expensive treatments. Two novel machine learning architectures, used to predict depression severity from audio recordings, are presented and compared in this study. The data was taken from the Distress Analysis Interview Corpus, which contains recordings of 189 participant interviews and their Public Health Questionnaire 8 depression scores. Feature extraction and feature selection were performed on the participants' speech, and two machine learning architectures were designed to provide prediction models for depression severity. In the first architecture, participants' data were initially classified into depressed or not-depressed classes, and a regression model was trained on each class. The second architecture sorted the data into depression severity classes, which were then used in addition to the original features to predict the depression scores. The second architecture outperformed the first in both the classification and regression stages, achieving an RMSE value of 4.1, a significant improvement over previous studies that reported RMSE values of 6.32 to 6.94 for the same data. The results demonstrate a potential for a speech-based depression screening tool, able to assist healthcare professionals in the diagnosis and monitoring of patients, and to provide a scalable depression screening method enabling individuals to recognise their illnesses and seek professional help.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Healthcare Informatics, ICHI 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728153827
DOIs
Publication statusPublished - Nov 2020
Externally publishedYes
Event8th IEEE International Conference on Healthcare Informatics, ICHI 2020 - Virtual, Oldenburg, Germany
Duration: 30 Nov 20203 Dec 2020

Publication series

Name2020 IEEE International Conference on Healthcare Informatics, ICHI 2020

Conference

Conference8th IEEE International Conference on Healthcare Informatics, ICHI 2020
Country/TerritoryGermany
CityVirtual, Oldenburg
Period30/11/203/12/20

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Depression recognition
  • disease severity discrimination
  • multi-stage classifiers
  • Speech analytics
  • Speech signal processing for healthcare

Fingerprint

Dive into the research topics of 'Automated Classification of Depression Severity Using Speech-A Comparison of Two Machine Learning Architectures'. Together they form a unique fingerprint.

Cite this