Abstract
This paper investigates different fusion strategies as well as provides insights on their effectiveness alongside standalone classifiers in the framework of paralinguistic analysis of infant vocalizations. The combinations of such systems as Support Vector Machines (SVM) and Extreme Learning Machines (ELM) based classifiers, as well as its weighted kernel version are explored, training systems on different acoustic feature representations and implementing weighted score-level fusion of the predictions. The proposed framework is tested on INTERSPEECH ComParE-2019 Baby Sounds corpus, which is a collection of Home Bank infant vocalization corpora annotated for five classes. Adhering to the challenge protocol, using a single test set submission we outperform the challenge baseline Unweighted Average Recall (UAR) score and achieve a comparable result to the state-of-the-art.
Original language | English |
---|---|
Pages | 509-513 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 29 Oct 2020 |
Event | ICMI 2020 Workshop on Bridging Social Sciences and AI for Understanding Child Behavior - Virtual Event, Utrecht, Netherlands Duration: 29 Oct 2020 → 29 Oct 2020 https://sites.google.com/view/wocbu/home |
Workshop
Workshop | ICMI 2020 Workshop on Bridging Social Sciences and AI for Understanding Child Behavior |
---|---|
Abbreviated title | WoCBU |
Country/Territory | Netherlands |
City | Utrecht |
Period | 29/10/20 → 29/10/20 |
Internet address |
Keywords
- baby sounds classification
- computational paralinguistics
- information fusion
- extreme learning machines
- support vector machines