Pré-apprentissage supervisé pour les réseaux profonds

Gradient backpropagation works well only if the initial weights are close a good solution. Pretraining the Deep Neural Networks (DNNs) by autoassociators in a greedy way is a tricky way to set appropriate initializations in deep learning. While in the literature, the pretraining solely in-volve the inputs while the information conveyed by the la-bels is ignored. In this paper, we present new pretraining algorithms for DNNs by embedding the information of la-bels : the input and hidden layers' weights are initialized in the usual way by autoassociators. To set the initial values of the output layer, a autoassociator embedding the output vector into a particular space is learned. This space shares the dimension of the last hidden layer space which is set appropriatedly according to the output size. Empirical ev-idences show that initialization of the architecture rather than random initialization leads to better results in terms of generalization error.

Domaines

Machine Learning [stat.ML]

Romain Hérault : Connectez-vous pour contacter le contributeur

https://normandie-univ.hal.science/hal-02351919

Soumis le : mercredi 6 novembre 2019-15:50:10

Dernière modification le : vendredi 22 décembre 2023-15:16:05

Dates et versions

hal-02351919 , version 1 (06-11-2019)

Identifiants

HAL Id : hal-02351919 , version 1

Citer

Xilan Tian, Romain Hérault, Gilles Gasso, Stéphane Canu. Pré-apprentissage supervisé pour les réseaux profonds. Reconnaissance des Formes et Intelligence Artificielle (RFIA), Jan 2010, Caen, France. ⟨hal-02351919⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSA-ROUEN LITIS COMUE-NORMANDIE UNIROUEN UNILEHAVRE INSA-GROUPE

17 Consultations

0 Téléchargements