Abstract
The field of paralinguistics is growing rapidly with a wide range of applications that go beyond recognition of emotions, laughter and personality. The research flourishes in multiple directions such as signal representation and classification, addressing the issues of the domain. Apart from the noise robustness, an important issue with real life data is the imbalanced nature: some classes of states/traits are under-represented. Combined with the high dimensionality of the feature vectors used in the state-of-the-art analysis systems, this issue poses the threat of over-fitting. While the kernel trick can be employed to handle the dimensionality issue, regular classifiers inherently aim to minimize the misclassification error and hence are biased towards the majority class. A solution to this problem is oversampling of the minority class(es). However, this brings increased memory/computational costs, while not bringing any new information to the classifier. In this work, we propose anew weighting scheme on instances of the original dataset, employing Weighted Kernel Extreme Learning Machine, and inspired from that, introducing theWeighted Partial Least Squares Regression based classifier. The proposed methods are applied on all three INTERSPEECH ComParE 2017 challenge corpora, giving better or competitive results compared to the challenge baselines.
Original language | English |
---|---|
Title of host publication | INTERSPEECH-2017 |
Pages | 3527-3531 |
Number of pages | 5 |
Volume | 2017-August |
DOIs | |
Publication status | Published - 1 Jan 2017 |
Event | 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden Duration: 20 Aug 2017 → 24 Aug 2017 |
Publication series
Name | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
---|---|
ISSN (Print) | 2308-457X |
Conference
Conference | 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 |
---|---|
Country/Territory | Sweden |
City | Stockholm |
Period | 20/08/17 → 24/08/17 |
Funding
This work is partially supported by RFBR (project № 16-37-60100), grant of the President of Russia (№ MD-254.2017.8) and by the Government of Russia (grant № 074-U01).
Keywords
- Addressee
- Computational paralinguistics
- ELM
- Fisher vector
- Imbalanced data
- Snoring
- Weighted PLS