Document Image Classification, With A Specific View On Applications Of Patent Images
2016 Β· Gabriela Csurka
Abstract
The main focus of this paper is document image classification and retrieval, where we analyze and compare different parameters for the RunLeght Histogram (RL) and Fisher Vector (FV) based image representations. We do an exhaustive experimental study using different document image datasets, including the MARG benchmarks, two datasets built on customer data and the images from the Patent Image Classification task of the Clef-IP 2011. The aim of the study is to give guidelines on how to best choose the parameters such that the same features perform well on different tasks. As an example of such need, we describe the Image-based Patent Retrieval task's of Clef-IP 2011, where we used the same image representation to predict the image type and retrieve relevant patents.
Authors
(none)
Tags
Stats
Related papers
- What Is The Right Way To Represent Document Images? (2016)0.00
- Learning Efficient Representations For Image-based Patent Retrieval (2023)2.26
- Large Language Model Informed Patent Image Retrieval (2024)0.00
- Hierarchical Multi-positive Contrastive Learning For Patent Image Retrieval (2025)0.00
- A Convolutional Neural Network-based Patent Image Retrieval Method For Design Ideation (2020)3.58
- Designclip: Multimodal Learning With CLIP For Design Patent Understanding (2025)0.00
- Patentnet: A Large-scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database (2021)0.00
- Fine-grained Image Classification And Retrieval By Combining Visual And Locally Pooled Textual Features (2020)10.48