Revisiting Representation Learning and Identity Adversarial Training for Facial Behavior Understanding

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Facial Action Unit (AU) detection has gained significant attention as it enables the breakdown of complex facial expressions into individual muscle movements. In this paper, we revisit two fundamental factors in AU detection: diverse and large-scale data and subject identity regularization. Motivated by recent advances in foundation models, we highlight the importance of data and introduce Face 9 M, a diverse dataset comprising 9 million facial images from multiple public sources. Pretraining a masked autoencoder on Face9M yields strong performance in AU detection and facial expression tasks. More importantly, we emphasize that the Identity Adversarial Training (IAT) has not been well explored in AU tasks. To fill this gap, we first show that subject identity in AU datasets creates shortcut learning for the model and leads to suboptimal solutions to AU predictions. Secondly, we demonstrate that strong IAT regularization is necessary to learn identityinvariant features. Finally, we elucidate the design space of IAT and empirically show that IAT circumvents the identity-based shortcut learning and results in a better solution. Our proposed methods, Facial Masked Autoencoder (FMAE) and IAT, are simple, generic and effective. Remarkably, the proposed FMAEIAT approach achieves new state-of-the-art F1 scores on BP4D (67.1%), BP4D+ (66.8%), and DISFA (70.1%) databases, significantly outperforming previous work. We release the code and model at https://github.com/forever208/FMAE-IAT.

Original languageEnglish
Title of host publication2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025
PublisherIEEE
ISBN (Electronic)9798331553418
ISBN (Print)9798331553418
DOIs
Publication statusPublished - 6 Aug 2025
Event19th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2025 - Tampa, United States
Duration: 26 May 202530 May 2025

Publication series

Name2025 IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025

Conference

Conference19th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2025
Country/TerritoryUnited States
CityTampa
Period26/05/2530/05/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Fingerprint

Dive into the research topics of 'Revisiting Representation Learning and Identity Adversarial Training for Facial Behavior Understanding'. Together they form a unique fingerprint.

Cite this