Linkedin Post Embeddings: Industrial Scale Embedding Generation And Usage Across Linkedin
2024 Β· Sudarshan Srinivasa Ramanujam, Akanksha Bindal, Yu Jiang, et al.
Abstract
A post embedding (representation of text in embedding space that effectively captures semantic meaning) is a foundational component of LinkedIn that is consumed by product surfaces in retrieval and ranking (e.g., ranking posts in the feed or video tab). This paper presents the post embeddings used at LinkedIn, where a pre-trained transformer-based large language model (LLM) is taken as input and fine-tuned using multi-task learning across a diverse set of semantic labeling tasks. We observe positive transfer, leading to improved performance across all tasks, compared to training them independently. The generated post embeddings outperform baseline models in zero-shot learning, demonstrating its potential for broader applicability. Furthermore, the generated post embeddings' performance surpasses that of OpenAI's ADA-001 and ADA-002 embeddings on LinkedIn specific datasets and tasks. We also describe the offline evaluation methodology and the deployment to our near-line infrastructure,
Authors
(none)
Tags
Stats
Related papers
- Large Scale Retrieval For The Linkedin Feed Using Causal Language Models (2025)0.00
- Enterpriseem: Fine-tuned Embeddings For Enterprise Semantic Search (2024)0.00
- Search-adaptor: Embedding Customization For Information Retrieval (2023)0.00
- Training Llms To Be Better Text Embedders Through Bidirectional Reconstruction (2025)0.00
- Liteembed: Adapting CLIP To Rare Classes (2026)0.00
- Evaluating Embedding Models And Pipeline Optimization For AI Search Quality (2025)0.00
- Learning A Unified Embedding For Visual Search At Pinterest (2019)10.85
- PLUM: Adapting Pre-trained Language Models For Industrial-scale Generative Recommendations (2025)2.26