Enterpriseem: Fine-tuned Embeddings For Enterprise Semantic Search
2024 Β· Kamalkumar Rathinasamy, Jayarama Nettar, Amit Kumar, et al.
Abstract
Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a comprehensive methodology for contextualizing pre-trained embedding models to enterprise environments, covering the entire process from data preparation to model fine-tuning and evaluation. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to en
Authors
(none)
Tags
Stats
Related papers
- Unified Embedding Based Personalized Retrieval In Etsy Search (2023)2.26
- Pre-training Tasks For User Intent Detection And Embedding Retrieval In E-commerce Search (2022)9.41
- Advancing Retrieval-augmented Generation For Structured Enterprise And Internal Data (2025)1.20
- Mine And Refine: Optimizing Graded Relevance In E-commerce Search Retrieval (2026)0.00
- Evaluating Embedding Apis For Information Retrieval (2023)8.09
- Embracing Structure In Data For Billion-scale Semantic Product Search (2021)0.00
- Vectorsearch: Enhancing Document Retrieval With Semantic Embeddings And Optimized Search (2024)0.00
- Large Reasoning Embedding Models: Towards Next-generation Dense Retrieval Paradigm (2025)0.00