Abstract
The analysis of spoken emotions is of increasing interest in human computer interaction, in order to drive the machine communication into a humane manner. It has manifold applications ranging from intelligent tutoring systems to affect sensitive robots, from smart call centers to patient telemonitoring. In general the study of computational paralinguistics, which covers the analysis of speaker states and traits, faces with real life challenges of inter-speaker and intercorpus variability. In this paper, a brief summary of the progress and future directions of my PhD study titled Adap- Tive Mixture Models for Speech Emotion Recognition that targets these challenges are given. An automatic mixture model selection method for Mixture of Factor Analyzers is proposed for modeling high dimensional data. To provide the mentioned statistical method a compact set of potent features, novel feature selection methods based on Canonical Correlation Analysis are introduced.
Original language | English |
---|---|
Title of host publication | ICMI 2014 - Proceedings of the 2014 International Conference on Multimodal Interaction |
Publisher | Association for Computing Machinery |
Pages | 359-363 |
Number of pages | 5 |
ISBN (Electronic) | 9781450328852 |
DOIs | |
Publication status | Published - 12 Nov 2014 |
Event | 16th ACM International Conference on Multimodal Interaction, ICMI 2014 - Istanbul, Turkey Duration: 12 Nov 2014 → 16 Nov 2014 |
Conference
Conference | 16th ACM International Conference on Multimodal Interaction, ICMI 2014 |
---|---|
Country/Territory | Turkey |
City | Istanbul |
Period | 12/11/14 → 16/11/14 |
Keywords
- Canonical correlation analysis
- Depression recognition
- Factor analysis
- Feature extraction
- Local fisher discriminant analysis
- Mixture modeling
- Mixture of factor analyzers
- Speech emotion recognition audio-visual fusion