Abstract
Depression affects approximately 300 million people worldwide, resulting in significant suffering and economic costs. Millions of sufferers remain undiagnosed and untreated due to a shortage of trained personnel, social stigma, and expensive treatments. Two novel machine learning architectures, used to predict depression severity from audio recordings, are presented and compared in this study. The data was taken from the Distress Analysis Interview Corpus, which contains recordings of 189 participant interviews and their Public Health Questionnaire 8 depression scores. Feature extraction and feature selection were performed on the participants' speech, and two machine learning architectures were designed to provide prediction models for depression severity. In the first architecture, participants' data were initially classified into depressed or not-depressed classes, and a regression model was trained on each class. The second architecture sorted the data into depression severity classes, which were then used in addition to the original features to predict the depression scores. The second architecture outperformed the first in both the classification and regression stages, achieving an RMSE value of 4.1, a significant improvement over previous studies that reported RMSE values of 6.32 to 6.94 for the same data. The results demonstrate a potential for a speech-based depression screening tool, able to assist healthcare professionals in the diagnosis and monitoring of patients, and to provide a scalable depression screening method enabling individuals to recognise their illnesses and seek professional help.
| Original language | English |
|---|---|
| Title of host publication | 2020 IEEE International Conference on Healthcare Informatics, ICHI 2020 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781728153827 |
| DOIs | |
| Publication status | Published - Nov 2020 |
| Externally published | Yes |
| Event | 8th IEEE International Conference on Healthcare Informatics, ICHI 2020 - Virtual, Oldenburg, Germany Duration: 30 Nov 2020 → 3 Dec 2020 |
Publication series
| Name | 2020 IEEE International Conference on Healthcare Informatics, ICHI 2020 |
|---|
Conference
| Conference | 8th IEEE International Conference on Healthcare Informatics, ICHI 2020 |
|---|---|
| Country/Territory | Germany |
| City | Virtual, Oldenburg |
| Period | 30/11/20 → 3/12/20 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Depression recognition
- disease severity discrimination
- multi-stage classifiers
- Speech analytics
- Speech signal processing for healthcare
Fingerprint
Dive into the research topics of 'Automated Classification of Depression Severity Using Speech-A Comparison of Two Machine Learning Architectures'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver