cycloneboy / SLM-SQL-Base-1.5B

huggingface.co
Total runs: 58
24-hour runs: -3
7-day runs: -1
30-day runs: -49
Model's Last Updated: July 31 2025
text-generation

Introduction of SLM-SQL-Base-1.5B

Model Details of SLM-SQL-Base-1.5B

SLM-SQL: An Exploration of Small Language Models for Text-to-SQL

Important Links

๐Ÿ“– Arxiv Paper | ๐Ÿค— HuggingFace | ๐Ÿค– ModelScope |

News
  • July 31, 2025 : Upload model to modelscope and huggingface.
  • July 30, 2025 : Publish the paper to arxiv
Introduction

Large language models (LLMs) have demonstrated strong performance in translating natural language questions into SQL queries (Text-to-SQL). In contrast, small language models (SLMs) ranging from 0.5B to 1.5B parameters currently underperform on Text-to-SQL tasks due to their limited logical reasoning capabilities. However, SLMs offer inherent advantages in inference speed and suitability for edge deployment. To explore their potential in Text-to-SQL applications, we leverage recent advancements in post-training techniques. Specifically, we used the open-source SynSQL-2.5M dataset to construct two derived datasets: SynSQL-Think-916K for SQL generation and SynSQL-Merge-Think-310K for SQL merge revision. We then applied supervised fine-tuning and reinforcement learning-based post-training to the SLM, followed by inference using a corrective self-consistency approach. Experimental results validate the effectiveness and generalizability of our method, SLM-SQL. On the BIRD development set, the five evaluated models achieved an average improvement of 31.4 points. Notably, the 0.5B model reached 56.87% execution accuracy (EX), while the 1.5B model achieved 67.08% EX. We will release our dataset, model, and code to github: https://github.com/CycloneBoy/slm_sql .

Framework
slmsql_framework
Main Results
slm_sql_result slmsql_bird_main slmsql_spider_main

Performance Comparison of different Text-to-SQL methods on BIRD dev and test dataset.

slmsql_ablation_study
Model
Model Base Model Train Method Modelscope HuggingFace
SLM-SQL-Base-0.5B Qwen2.5-Coder-0.5B-Instruct SFT ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-0.5B Qwen2.5-Coder-0.5B-Instruct SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
CscSQL-Merge-Qwen2.5-Coder-0.5B-Instruct Qwen2.5-Coder-0.5B-Instruct SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-Base-1.5B Qwen2.5-Coder-1.5B-Instruct SFT ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-1.5B Qwen2.5-Coder-1.5B-Instruct SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
CscSQL-Merge-Qwen2.5-Coder-1.5B-Instruct Qwen2.5-Coder-1.5B-Instruct SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-Base-0.6B Qwen3-0.6B SFT ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-0.6B Qwen3-0.6B SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-Base-1.3B deepseek-coder-1.3b-instruct SFT ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-1.3B deepseek-coder-1.3b-instruct SFT + GRPO ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SLM-SQL-Base-1B Llama-3.2-1B-Instruct SFT ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
Dataset
Dataset Modelscope HuggingFace
SynsQL-Think-916k ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
SynsQL-Merge-Think-310k ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
bird train and dev dataset ๐Ÿค– Modelscope ๐Ÿค— HuggingFace
TODO
  • Release inference code
  • Upload Model
  • Release training code
  • Fix bug
  • Update doc
Thanks to the following projects
Citation

@misc{sheng2025slmsqlexplorationsmalllanguage,
      title={SLM-SQL: An Exploration of Small Language Models for Text-to-SQL}, 
      author={Lei Sheng and Shuai-Shuai Xu},
      year={2025},
      eprint={2507.22478},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2507.22478}, 
}

@misc{sheng2025cscsqlcorrectiveselfconsistencytexttosql,
      title={CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning}, 
      author={Lei Sheng and Shuai-Shuai Xu},
      year={2025},
      eprint={2505.13271},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.13271}, 
}

Runs of cycloneboy SLM-SQL-Base-1.5B on huggingface.co

58
Total runs
-3
24-hour runs
0
3-day runs
-1
7-day runs
-49
30-day runs

More Information About SLM-SQL-Base-1.5B huggingface.co Model

More SLM-SQL-Base-1.5B license Visit here:

https://choosealicense.com/licenses/cc-by-nc-4.0

SLM-SQL-Base-1.5B huggingface.co

SLM-SQL-Base-1.5B huggingface.co is an AI model on huggingface.co that provides SLM-SQL-Base-1.5B's model effect (), which can be used instantly with this cycloneboy SLM-SQL-Base-1.5B model. huggingface.co supports a free trial of the SLM-SQL-Base-1.5B model, and also provides paid use of the SLM-SQL-Base-1.5B. Support call SLM-SQL-Base-1.5B model through api, including Node.js, Python, http.

SLM-SQL-Base-1.5B huggingface.co Url

https://huggingface.co/cycloneboy/SLM-SQL-Base-1.5B

cycloneboy SLM-SQL-Base-1.5B online free

SLM-SQL-Base-1.5B huggingface.co is an online trial and call api platform, which integrates SLM-SQL-Base-1.5B's modeling effects, including api services, and provides a free online trial of SLM-SQL-Base-1.5B, you can try SLM-SQL-Base-1.5B online for free by clicking the link below.

cycloneboy SLM-SQL-Base-1.5B online free url in huggingface.co:

https://huggingface.co/cycloneboy/SLM-SQL-Base-1.5B

SLM-SQL-Base-1.5B install

SLM-SQL-Base-1.5B is an open source model from GitHub that offers a free installation service, and any user can find SLM-SQL-Base-1.5B on GitHub to install. At the same time, huggingface.co provides the effect of SLM-SQL-Base-1.5B install, users can directly use SLM-SQL-Base-1.5B installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

SLM-SQL-Base-1.5B install url in huggingface.co:

https://huggingface.co/cycloneboy/SLM-SQL-Base-1.5B

Url of SLM-SQL-Base-1.5B

SLM-SQL-Base-1.5B huggingface.co Url

Provider of SLM-SQL-Base-1.5B huggingface.co

cycloneboy
ORGANIZATIONS

Other API from cycloneboy