A Comprehensive Study Of Imagenet Pre-training For Historical Document Image Analysis
2019 Β· Linda Studer, Michele Alberti, Vinaychandran Pondenkandath, et al.
Abstract
Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, which are often challenging for machine learning due to a lack of human-annotated learning samples. With the advent of deep neural networks, a promising way to cope with the lack of training data is to pre-train models on images from a different domain and then fine-tune them on historical documents. In the current research, a typical example of such cross-domain transfer learning is the use of neural networks that have been pre-trained on the ImageNet database for object recognition. It remains a mostly open question whether or not this pre-training helps to analyse historical documents, which have fundamentally different image properties when compared with ImageNet. In this paper, we present a comprehensive empirical survey on the effect of ImageNet pre-training for diverse historical document analysis tasks, including character recognition, style classification, manuscript dating, sema
Authors
(none)
Tags
Stats
Related papers
- Deep Learning Approaches For Image Retrieval And Pattern Spotting In Ancient Documents (2019)0.00
- Pattern Spotting And Image Retrieval In Historical Documents Using Deep Hashing (2022)2.26
- A Generic Image Retrieval Method For Date Estimation Of Historical Document Collections (2022)3.58
- ICDAR 2019 Competition On Image Retrieval For Historical Handwritten Documents (2019)11.29
- What Is The Right Way To Represent Document Images? (2016)0.00
- Image Retrieval And Pattern Spotting Using Siamese Neural Network (2019)11.58
- Leveraging Computer Vision Application In Visual Arts: A Case Study On The Use Of Residual Neural Network To Classify And Analyze Baroque Paintings (2022)3.58
- Lifelong Learning For Text Retrieval And Recognition In Historical Handwritten Document Collections (2019)5.24