Temporal Contrastive Pretraining for Video Action Recognition - Normandie Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Temporal Contrastive Pretraining for Video Action Recognition

Résumé

In this paper, we propose a self-supervised method for video representation learning based on Contrastive Predictive Coding (CPC) [27]. Previously, CPC has been used to learn representations for different signals (audio, text or image). It benefits from the use of an autoregressive modeling and contrastive estimation to learn long-term relations inside raw signal while remaining robust to local noise. Our self-supervised task consists in predicting the latent representation of future segments of the video. As opposed to generative models, predicting directly in the feature space is easier and avoid incertitude problems for long-term predictions. Today, using CPC to learn representations for videos remains challenging due to the structure and the high dimensionality of the signal. We demonstrate experimentally that the representations learned by the network are useful for action recognition. We test it with different input types such as optical flows, image differences and raw images on different datasets (UCF-101 and HMDB51). It gives consistent results across the modalities. At last, we notice the utility of our pre-training method by achieving competitive results for action recognition using few labeled data.
Fichier principal
Vignette du fichier
LORRE_Temporal_Contrastive_Pretraining_for_Video_Action_Recognition_WACV_2020_paper.pdf (633.48 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03255934 , version 1 (09-06-2021)

Identifiants

Citer

Guillaume Lorre, Jaonary Rabarisoa, Astrid Orcesi, Samia Ainouz-Zemouche, Stephane Canu. Temporal Contrastive Pretraining for Video Action Recognition. IEEE/CVF Winter Conference on Applications of Computer Vision, Mar 2020, Snowmass, United States. ⟨10.1109/WACV45572.2020.9093278⟩. ⟨hal-03255934⟩
31 Consultations
156 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More