trashpanda-org / Grog

huggingface.co
Total runs: 3
24-hour runs: 0
7-day runs: 1
30-day runs: -2
Model's Last Updated: March 24 2025
text-generation

Introduction of Grog

Model Details of Grog

Model Card for Greg

This model is a fine-tuned version of Hasnonname/Qwen2.5-14B-Kebab-v0 . It has been trained using TRL .

Quick start
from transformers import pipeline

question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="trashpanda-org/Greg", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])
Training procedure

Visualize in Weights & Biases

This model was trained with GRPO, a method introduced in DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models .

Framework versions
  • TRL: 0.15.1
  • Transformers: 4.49.0
  • Pytorch: 2.5.1
  • Datasets: 3.4.1
  • Tokenizers: 0.21.1
Citations

Cite GRPO as:

@article{zhihong2024deepseekmath,
    title        = {{DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models}},
    author       = {Zhihong Shao and Peiyi Wang and Qihao Zhu and Runxin Xu and Junxiao Song and Mingchuan Zhang and Y. K. Li and Y. Wu and Daya Guo},
    year         = 2024,
    eprint       = {arXiv:2402.03300},
}

Cite TRL as:

@misc{vonwerra2022trl,
    title        = {{TRL: Transformer Reinforcement Learning}},
    author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallouédec},
    year         = 2020,
    journal      = {GitHub repository},
    publisher    = {GitHub},
    howpublished = {\url{https://github.com/huggingface/trl}}
}

Runs of trashpanda-org Grog on huggingface.co

3
Total runs
0
24-hour runs
0
3-day runs
1
7-day runs
-2
30-day runs

More Information About Grog huggingface.co Model

Grog huggingface.co

Grog huggingface.co is an AI model on huggingface.co that provides Grog's model effect (), which can be used instantly with this trashpanda-org Grog model. huggingface.co supports a free trial of the Grog model, and also provides paid use of the Grog. Support call Grog model through api, including Node.js, Python, http.

trashpanda-org Grog online free

Grog huggingface.co is an online trial and call api platform, which integrates Grog's modeling effects, including api services, and provides a free online trial of Grog, you can try Grog online for free by clicking the link below.

trashpanda-org Grog online free url in huggingface.co:

https://huggingface.co/trashpanda-org/Grog

Grog install

Grog is an open source model from GitHub that offers a free installation service, and any user can find Grog on GitHub to install. At the same time, huggingface.co provides the effect of Grog install, users can directly use Grog installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Grog install url in huggingface.co:

https://huggingface.co/trashpanda-org/Grog

Url of Grog

Provider of Grog huggingface.co

trashpanda-org
ORGANIZATIONS

Other API from trashpanda-org

huggingface.co

Total runs: 6
Run Growth: 4
Growth Rate: 66.67%
Updated:March 25 2025