SynSQL-2.5M
Emerging2papers using it
456HF downloads
30HF likes
2025first seen
SynSQL-2.5M - The First Million-Scale Cross-Domain Text-to-SQL Dataset We introduce the first million-scale text-to-SQL dataset, SynSQL-2.5M, containing over 2.5 million diverse and high-quality data samples, spanning more than 16,000 databases from various domains. Building on SynSQL-2.5M, we introduce OmniSQL, a fami
π€ Hugging Faceβ apache-2.0