Abstract
Large-scale action recognition datasets contain more instances of adults than children, and models trained with these datasets may not perform well for children. In this study, we test if current state-of-the-art deep learning models have some systemic bias in decoding the activity being performed by an adult or a child. We collected a sports activity recognition dataset with child and adult labels. We fine-tuned a state-of-the-art action recognition classifier on two different segments of our dataset, containing only children or only adults. Our results show that cross-condition generalization performance of the resulting networks is not similar. Our results indicate that the child-specific segment is more complex to generalize than the adult-specific segment. The dataset and the code are made publicly available.
Original language | English |
---|---|
Title of host publication | 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) |
Publisher | IEEE |
Pages | 1563-1570 |
ISBN (Electronic) | 978-988-14768-9-0 |
Publication status | Published - 2021 |
Event | 13th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) - Duration: 13 Dec 2021 → … |
Conference
Conference | 13th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) |
---|---|
Period | 13/12/21 → … |
Keywords
- deep learning
- training
- activity recognition
- data collection
- video analysis