Human-aligned Image Models Improve Visual Decoding From The Brain
2025 · Nona Rajabi, Antônio H. Ribeiro, Miguel Vasco, et al.
Abstract
Decoding visual images from brain activity has significant potential for advancing brain-computer interaction and enhancing the understanding of human perception. Recent approaches align the representation spaces of images and brain activity to enable visual decoding. In this paper, we introduce the use of human-aligned image encoders to map brain signals to images. We hypothesize that these models more effectively capture perceptual attributes associated with the rapid visual stimuli presentations commonly used in visual brain data recording experiments. Our empirical results support this hypothesis, demonstrating that this simple modification improves image retrieval accuracy by up to 21% compared to state-of-the-art methods. Comprehensive experiments confirm consistent performance improvements across diverse EEG architectures, image encoders, alignment methods, participants, and brain imaging modalities
Authors
(none)
Tags
Stats
Related papers
- Brain-inspired Capture: Evidence-driven Neuromimetic Perceptual Simulation For Visual Decoding (2026)0.00
- Achieving Fine-grained Cross-modal Understanding Through Brain-inspired Hierarchical Representation Learning (2026)0.00
- See What You See: Self-supervised Cross-modal Retrieval Of Visual Stimuli From Brain Activity (2022)0.00
- Perception Encoder: The Best Visual Embeddings Are Not At The Output Of The Network (2025)6.71
- Unveiling Deep Semantic Uncertainty Perception For Language-anchored Multi-modal Vision-brain Alignment (2025)0.00
- Give: Guiding Visual Encoder To Perceive Overlooked Information (2024)0.00
- Context Sensitivity Improves Human-machine Visual Alignment (2026)0.00
- Hyperdimensional Cross-modal Alignment Of Frozen Language And Image Models For Efficient Image Captioning (2026)0.00