Multi-scale Gated Fully Convolutional DenseNets for semantic labeling of historical newspaper images - Normandie Université Accéder directement au contenu
Article Dans Une Revue Pattern Recognition Letters Année : 2020

Multi-scale Gated Fully Convolutional DenseNets for semantic labeling of historical newspaper images

Résumé

Historical newspaper image analysis is a challenging task due to the complex layout of newspapers and its variability among collections. While traditional approaches are rule-based methods with many successive steps, recent works show that deep learning approaches can be successfully used to provide a pixel labeling of the various fields occurring in a page. This allows the automatic extraction of the document structure and accessing the different semantic entities. Recent improvements proposed to strengthen convolutional neural network capacities such as gated mechanism may also apply well to to task at end. In this respect, we propose a fully convolutional neural network architecture (FCN) that outputs a pixel-labeling of the various semantic entities that occur in historical newspaper images. Our model is based on a novel Multi-Scale Gated Block architecture (MSGB), made of dense connections and gating mechanisms that handle a multi-scale analysis of the input image with self-attention. Evaluations conducted on 4 historical newspaper datasets including up to 11 semantic classes show that our proposition outperforms standard FCN architectures.
Fichier non déposé

Dates et versions

hal-02476841 , version 1 (13-02-2020)

Identifiants

Citer

Yann Soullard, Pierrick Tranouez, Clement Chatelain, Stéphane Nicolas, Thierry Paquet. Multi-scale Gated Fully Convolutional DenseNets for semantic labeling of historical newspaper images. Pattern Recognition Letters, 2020, 131, pp.435-441. ⟨10.1016/j.patrec.2020.01.026⟩. ⟨hal-02476841⟩
66 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More