Turkcolbert: A Benchmark Of Dense And Late-interaction Models For Turkish Information Retrieval

Abstract

Neural information retrieval systems excel in high-resource languages but remain underexplored for morphologically rich, lower-resource languages such as Turkish. Dense bi-encoders currently dominate Turkish IR, yet late-interaction models -- which retain token-level representations for fine-grained matching -- have not been systematically evaluated. We introduce TurkColBERT, the first comprehensive benchmark comparing dense encoders and late-interaction models for Turkish retrieval. Our two-stage adaptation pipeline fine-tunes English and multilingual encoders on Turkish NLI/STS tasks, then converts them into ColBERT-style retrievers using PyLate trained on MS MARCO-TR. We evaluate 10 models across five Turkish BEIR datasets covering scientific, financial, and argumentative domains. Results show strong parameter efficiency: the 1.0M-parameter colbert-hash-nano-tr is 600\(\times\) smaller than the 600M turkish-e5-large dense encoder while preserving over 71% of its average mAP. Late-in

Turkcolbert: A Benchmark Of Dense And Late-interaction Models For Turkish Information Retrieval

Abstract

Authors

Tags

Stats

Related papers