Towards A Text-based Quantitative And Explainable Histopathology Image Analysis
2024 Β· Anh Tien Nguyen, Trinh Thi Le Vuong, Jin Tae Kwak
Abstract
Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be utilized for quantitative histopathology image analysis through a simple image-to-text retrieval. To this end, we propose a Text-based Quantitative and Explainable histopathology image analysis, which we call TQx. Given a set of histopathology images, we adopt a pre-trained vision-language model to retrieve a word-of-interest pool. The retrieved words are then used to quantify the histopathology images and generate understandable feature embeddings due to the direct mapping to the text description. To evaluate the proposed method, the text-based embeddings of four histopathology image dataset
Authors
(none)
Tags
Stats
Related papers
- Pathalign: A Vision-language Model For Whole Slide Images In Histopathology (2024)0.00
- On The Importance Of Text Preprocessing For Multimodal Representation Learning And Pathology Report Generation (2025)0.00
- Accurate And Scalable Multimodal Pathology Retrieval Via Attentive Vision-language Alignment (2025)2.26
- Learning To Read Where To Look: Disease-aware Vision-language Pretraining For 3D CT (2026)0.00
- Learning Binary Semantic Embedding For Histology Image Classification And Retrieval (2020)3.58
- Multimodal Whole Slide Foundation Model For Pathology (2024)12.99
- HOMIE: Histopathology Omni-modal Embedding For Pathology Composed Retrieval (2025)0.00
- Zero-shot Whole Slide Image Retrieval In Histopathology Using Embeddings Of Foundation Models (2024)0.00