Contrastive predictive coding for video representation learning - Institut National des Sciences Appliquées de Rouen Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Contrastive predictive coding for video representation learning

Résumé

Contrastive Predictive Coding (CPC) (van den Oord et al., 2018) has been successfully used to learn representations for different signals (audio, text, images). It uses an autoregressive modeling and contrastive estimation to learn long-term temporal relation inside the raw signal while remaining robust to local noise. The result is a higher level signal representation useful to solve downstream tasks. Using CPC to learn representations for videos remains challenging due to the structure and the high dimensionality of the signal. In this work, we propose different implementations of CPC for video signal. The learned representation increases the performance of an action recognition classifier.
Fichier principal
Vignette du fichier
video_cpc.pdf (294.36 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

cea-03547497 , version 1 (28-01-2022)

Identifiants

  • HAL Id : cea-03547497 , version 1

Citer

Guillaume Lorre, Jaonary Rabarisoa, Astrid Orcesi, Samia Ainouz, Stéphane Canu. Contrastive predictive coding for video representation learning. ICML2019 - 36th International Conference on Machine Learning - Workshop on Self-Supervised Learning, Jun 2019, Long Beach, United States. ⟨cea-03547497⟩
244 Consultations
265 Téléchargements

Partager

Gmail Facebook X LinkedIn More