Uniecs: Unified Multimodal E-commerce Search Framework With Gated Cross-modal Fusion
2025 Β· Zihan Liang, Yufei Ma, Zhipeng Qian, et al.
Abstract
Current e-commerce multimodal retrieval systems face two key limitations: they optimize for specific tasks with fixed modality pairings, and lack comprehensive benchmarks for evaluating unified retrieval approaches. To address these challenges, we introduce UniECS, a unified multimodal e-commerce search framework that handles all retrieval scenarios across image, text, and their combinations. Our work makes three key contributions. First, we propose a flexible architecture with a novel gated multimodal encoder that uses adaptive fusion mechanisms. This encoder integrates different modality representations while handling missing modalities. Second, we develop a comprehensive training strategy to optimize learning. It combines cross-modal alignment loss (CMAL), cohesive local alignment loss (CLAL), intra-modal contrastive loss (IMCL), and adaptive loss weighting. Third, we create M-BEER, a carefully curated multimodal benchmark containing 50K product pairs for e-commerce search evaluatio
Authors
(none)
Tags
Stats
Related papers
- MRSE: An Efficient Multi-modality Retrieval System For Large Scale E-commerce (2024)0.00
- Semantic-enhanced Modality-asymmetric Retrieval For Online E-commerce Search (2025)0.00
- Modality Curation: Building Universal Embeddings For Advanced Multimodal Information Retrieval (2025)0.00
- E-MMKGR: A Unified Multimodal Knowledge Graph Framework For E-commerce Applications (2026)0.00
- MUST: An Effective And Scalable Framework For Multimodal Search Of Target Modality (2023)7.81
- ACE-BERT: Adversarial Cross-modal Enhanced BERT For E-commerce Retrieval (2021)0.00
- Joint Fusion And Encoding: Advancing Multimodal Retrieval From The Ground Up (2025)0.00
- Unified Learning-to-rank For Multi-channel Retrieval In Large-scale E-commerce Search (2026)0.00