Combining Clustering and Functionals based Acoustic Feature Representations for Classification of Baby Sounds

H. Kaya, Oxana Verkholyak, Maxim Markitantov, Alexey Karpov, Maxim Markitantov

Research output: Contribution to conferencePaperAcademic

Abstract

This paper investigates different fusion strategies as well as provides insights on their effectiveness alongside standalone classifiers in the framework of paralinguistic analysis of infant vocalizations. The combinations of such systems as Support Vector Machines (SVM) and Extreme Learning Machines (ELM) based classifiers, as well as its weighted kernel version are explored, training systems on different acoustic feature representations and implementing weighted score-level fusion of the predictions. The proposed framework is tested on INTERSPEECH ComParE-2019 Baby Sounds corpus, which is a collection of Home Bank infant vocalization corpora annotated for five classes. Adhering to the challenge protocol, using a single test set submission we outperform the challenge baseline Unweighted Average Recall (UAR) score and achieve a comparable result to the state-of-the-art.
Original languageEnglish
Pages509-513
Number of pages5
DOIs
Publication statusPublished - 29 Oct 2020
EventICMI 2020 Workshop on Bridging Social Sciences and AI for Understanding Child Behavior - Virtual Event, Utrecht, Netherlands
Duration: 29 Oct 202029 Oct 2020
https://sites.google.com/view/wocbu/home

Workshop

WorkshopICMI 2020 Workshop on Bridging Social Sciences and AI for Understanding Child Behavior
Abbreviated titleWoCBU
Country/TerritoryNetherlands
CityUtrecht
Period29/10/2029/10/20
Internet address

Keywords

  • baby sounds classification
  • computational paralinguistics
  • information fusion
  • extreme learning machines
  • support vector machines

Fingerprint

Dive into the research topics of 'Combining Clustering and Functionals based Acoustic Feature Representations for Classification of Baby Sounds'. Together they form a unique fingerprint.

Cite this