July 31, 2025
: Upload model to modelscope and huggingface.
July 30, 2025
: Publish the paper to arxiv
Introduction
Large language models (LLMs) have demonstrated strong performance in translating natural language questions into SQL
queries (Text-to-SQL). In contrast, small language models (SLMs) ranging from 0.5B to 1.5B parameters currently
underperform on Text-to-SQL tasks due to their limited logical reasoning capabilities. However, SLMs offer inherent
advantages in inference speed and suitability for edge deployment. To explore their potential in Text-to-SQL
applications, we leverage recent advancements in post-training techniques. Specifically, we used the open-source
SynSQL-2.5M dataset to construct two derived datasets: SynSQL-Think-916K for SQL generation and
SynSQL-Merge-Think-310K
for SQL merge revision. We then applied supervised fine-tuning and reinforcement learning-based post-training to the
SLM, followed by inference using a corrective self-consistency approach. Experimental results validate the
effectiveness
and generalizability of our method, SLM-SQL. On the BIRD development set, the five evaluated models achieved an
average
improvement of 31.4 points. Notably, the 0.5B model reached 56.87% execution accuracy (EX), while the 1.5B model
achieved 67.08% EX. We will release our dataset, model, and code to github:
https://github.com/CycloneBoy/slm_sql
.
Framework
Main Results
Performance Comparison of different Text-to-SQL methods on BIRD dev and test dataset.
@misc{sheng2025slmsqlexplorationsmalllanguage,
title={SLM-SQL: An Exploration of Small Language Models for Text-to-SQL},
author={Lei Sheng and Shuai-Shuai Xu},
year={2025},
eprint={2507.22478},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2507.22478},
}
@misc{sheng2025cscsqlcorrectiveselfconsistencytexttosql,
title={CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning},
author={Lei Sheng and Shuai-Shuai Xu},
year={2025},
eprint={2505.13271},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.13271},
}
Runs of cycloneboy SLM-SQL-Base-1.5B on huggingface.co
58
Total runs
-3
24-hour runs
0
3-day runs
-1
7-day runs
-49
30-day runs
More Information About SLM-SQL-Base-1.5B huggingface.co Model
SLM-SQL-Base-1.5B huggingface.co is an AI model on huggingface.co that provides SLM-SQL-Base-1.5B's model effect (), which can be used instantly with this cycloneboy SLM-SQL-Base-1.5B model. huggingface.co supports a free trial of the SLM-SQL-Base-1.5B model, and also provides paid use of the SLM-SQL-Base-1.5B. Support call SLM-SQL-Base-1.5B model through api, including Node.js, Python, http.
SLM-SQL-Base-1.5B huggingface.co is an online trial and call api platform, which integrates SLM-SQL-Base-1.5B's modeling effects, including api services, and provides a free online trial of SLM-SQL-Base-1.5B, you can try SLM-SQL-Base-1.5B online for free by clicking the link below.
cycloneboy SLM-SQL-Base-1.5B online free url in huggingface.co:
SLM-SQL-Base-1.5B is an open source model from GitHub that offers a free installation service, and any user can find SLM-SQL-Base-1.5B on GitHub to install. At the same time, huggingface.co provides the effect of SLM-SQL-Base-1.5B install, users can directly use SLM-SQL-Base-1.5B installed effect in huggingface.co for debugging and trial. It also supports api for free installation.