Abstract
Bipolar disorder is a mental health disorder that causes mood swings that range from depression to mania. Clinical diagnosis of bipolar disorder is based on patient interviews and reports obtained from the relatives of the patients. Subsequently, the diagnosis depends on the experience of the expert, and there is co-morbidity with other mental disorders. Automated processing in the diagnosis of bipolar disorder can help providing quantitative indicators, and allow easier observations of the patients for longer periods. In this paper, we create a multimodal decision system for three level mania classification based on recordings of the patients in acoustic, linguistic, and visual modalities. The system is evaluated on the Turkish Bipolar Disorder corpus we have recently introduced to the scientific community. Comprehensive analysis of unimodal and multimodal systems, as well as fusion techniques, are performed. Using acoustic, linguistic, and visual features in a multimodal fusion system, we achieved a 64.8% unweighted average recall score, which advances the state-of-the-art performance on this dataset.
Original language | English |
---|---|
Article number | 9835028 |
Pages (from-to) | 2119-2131 |
Number of pages | 13 |
Journal | IEEE Transactions on Affective Computing |
Volume | 13 |
Issue number | 4 |
Early online date | 24 Jul 2022 |
DOIs | |
Publication status | Published - 2022 |
Keywords
- Affective disorders
- Bipolar Disorder
- Depression
- Multimodal fusion
- Mania level prediction
- Feature extraction
- Acoustics
- Visualization
- multimodal fusion
- mania level prediction
- Behavioral sciences
- Task analysis
- bipolar disorder
- Interviews