FreedomIntelligence / TinyDeepSeek-3.3B-base

huggingface.co
Total runs: 4
24-hour runs: 0
7-day runs: 0
30-day runs: 0
Model's Last Updated: March 11 2025

Introduction of TinyDeepSeek-3.3B-base

Model Details of TinyDeepSeek-3.3B-base

TinyDeepSeek: Reproduction of DeepSeek-R1 and Beyond

📃 Paper • 🤗 TinyDeepSeek-0.5B-base • 🤗 TinyDeepSeek-3.3B-base
🤗 TinyDeepSeek-3.3B-checkpoints • 🤗 TinyDeepSeek-0.5B-checkpoints

Innovation and Open source are the best tributes and reproductions of DeepSeek .

The open-source ethos, rooted in technological equity, upholds two key principles: free access for all developers and the opportunity for technical contributions . While DeepSeek exemplifies the first principle, the second principle is hindered by restrictive training strategies, unclear data sources, and the high costs of model training. These constraints limit the open-source community's capacity to contribute and impede technological progress.

To overcome these challenges, we launched a comprehensive reproduction project of DeepSeek. This initiative involves training models from scratch, replicating DeepSeek's architecture and algorithms. We will fully open-source the training code, datasets, and models, offering code framework, reference solution, and base models for low-cost continual exploration.

🌈 Update
  • [2025.03.11] TinyDeepSeek repo is published!🎉
Reproduction of DeepSeek-R1
Architecture
Click to expand

arch

As shown in above Figure, the DeepSeek technical report introduces three architectural designs:

  • A. Multi-Head Latent Attention (MLA)

  • B. Load Balancing Strategy without Auxiliary Loss : Please refer code for implementation.

  • C. Multi-Token Prediction : Please refer code for implementation.

Our Detailed architectural parameter are shown in Table below.

arch

Data Construction
Click to expand

arch

Model Training
bash examples/pretrainStage1.sh
bash examples/pretrainStage2.sh
bash examples/sft.sh

bash examples/rl.sh (TBD)
Results

TBD

Beyond Reproduction
Scale Up RL to Pretrain

RWO: Reward Weighted Optimization

For the training, please include the include the following flags in the training command.

  • For Pretrain please prepare data item with key 'text_evaluation':{"knowledge":4, "reasoning":3, ...}.
  • For SFT, please provide data file 'General.json' and reward file 'General_reward.json' in the same directory.
--reward_weighted_optimization True \
--remove_unused_columns False \
📃 To do
  • Release Evaluation Results
  • Release RL Training Code and Model
  • Release Pretrain Data quality annotation label
  • Release TinyDeepSeek Technical Report
Acknowledgment
Citation

Please use the following citation if you intend to use our dataset for training or evaluation:

@misc{tinydeepseek,
  title={TinyDeepSeek},
  author={FreedomIntelligence Team},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/FreedomIntelligence/TinyDeepSeek}},
}

Runs of FreedomIntelligence TinyDeepSeek-3.3B-base on huggingface.co

4
Total runs
0
24-hour runs
0
3-day runs
0
7-day runs
0
30-day runs

More Information About TinyDeepSeek-3.3B-base huggingface.co Model

More TinyDeepSeek-3.3B-base license Visit here:

https://choosealicense.com/licenses/apache-2.0

TinyDeepSeek-3.3B-base huggingface.co

TinyDeepSeek-3.3B-base huggingface.co is an AI model on huggingface.co that provides TinyDeepSeek-3.3B-base's model effect (), which can be used instantly with this FreedomIntelligence TinyDeepSeek-3.3B-base model. huggingface.co supports a free trial of the TinyDeepSeek-3.3B-base model, and also provides paid use of the TinyDeepSeek-3.3B-base. Support call TinyDeepSeek-3.3B-base model through api, including Node.js, Python, http.

FreedomIntelligence TinyDeepSeek-3.3B-base online free

TinyDeepSeek-3.3B-base huggingface.co is an online trial and call api platform, which integrates TinyDeepSeek-3.3B-base's modeling effects, including api services, and provides a free online trial of TinyDeepSeek-3.3B-base, you can try TinyDeepSeek-3.3B-base online for free by clicking the link below.

FreedomIntelligence TinyDeepSeek-3.3B-base online free url in huggingface.co:

https://huggingface.co/FreedomIntelligence/TinyDeepSeek-3.3B-base

TinyDeepSeek-3.3B-base install

TinyDeepSeek-3.3B-base is an open source model from GitHub that offers a free installation service, and any user can find TinyDeepSeek-3.3B-base on GitHub to install. At the same time, huggingface.co provides the effect of TinyDeepSeek-3.3B-base install, users can directly use TinyDeepSeek-3.3B-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

TinyDeepSeek-3.3B-base install url in huggingface.co:

https://huggingface.co/FreedomIntelligence/TinyDeepSeek-3.3B-base

Url of TinyDeepSeek-3.3B-base

Provider of TinyDeepSeek-3.3B-base huggingface.co

FreedomIntelligence
ORGANIZATIONS

Other API from FreedomIntelligence