Skip to main navigation Skip to search Skip to main content

UpStory: the uppsala storytelling dataset

  • Marc Fraile*
  • , Natalia Calvo-Barajas
  • , Anastasia Sophia Apeiron
  • , Giovanna Varni
  • , Joakim Lindblad
  • , Nataša Sladoje
  • , Ginevra Castellano
  • *Corresponding author for this work
  • Uppsala University
  • University of Trento

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Friendship and rapport play an important role in the formation of constructive social interactions, and have been widely studied in education due to their impact on learning outcomes. Given the growing interest in automating the analysis of such phenomena through Machine Learning, access to annotated interaction datasets is highly valuable. However, no dataset on child-child interactions explicitly capturing rapport currently exists. Moreover, despite advances in the automatic analysis of human behavior, no previous work has addressed the prediction of rapport in child-child interactions in educational settings. We present UpStory - the Uppsala Storytelling dataset: a novel dataset of naturalistic dyadic interactions between primary school aged children, with an experimental manipulation of rapport. Pairs of children aged 8-10 participate in a task-oriented activity: designing a story together, while being allowed free movement within the play area. We promote balanced collection of different levels of rapport by using a within-subjects design: self-reported friendships are used to pair each child twice, either minimizing or maximizing pair separation in the friendship network. The dataset contains data for 35 pairs, totaling 3 h 40 m of audiovisual recordings. It includes two video sources, and separate voice recordings per child. An anonymized version of the dataset is made publicly available, containing per-frame head pose, body pose, and face features. Finally, we confirm the informative power of the UpStory dataset by establishing baselines for the prediction of rapport. A simple approach achieves 68% test accuracy using data from one child, and 70% test accuracy aggregating data from a pair.

Original languageEnglish
Article number1547578
Number of pages25
JournalFrontiers in Robotics and AI
Volume12
DOIs
Publication statusPublished - 21 Jul 2025

Bibliographical note

Publisher Copyright:
Copyright © 2025 Fraile, Calvo-Barajas, Apeiron, Varni, Lindblad, Sladoje and Castellano.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was partly funded by the Centre for Interdisciplinary Mathematics, Uppsala University; the Marianne and Marcus Wallenberg Foundation and the Marcus and Amalia Wallenberg Foundation, partly via the Wallenberg AI, Autonomous Systems and Software Program-Humanities and Society (WASP-HS); the Horizon Europe SymAware project (project number: 101070802); and the Swedish Research Council (grant number 2020-03167).

FundersFunder number
Marcus och Amalia Wallenbergs minnesfond
Uppsala Universitet
Horizon Europe SymAware project101070802
Vetenskapsrådet2020-03167

    Keywords

    • child-child interaction
    • machine learning
    • multimodal dataset
    • rapport
    • social signals

    Fingerprint

    Dive into the research topics of 'UpStory: the uppsala storytelling dataset'. Together they form a unique fingerprint.

    Cite this