Innovation and Open source are the best tributes and reproductions of DeepSeek
.
The open-source ethos, rooted in technological equity, upholds two key principles:
free access for all developers
and
the opportunity for technical contributions
. While DeepSeek exemplifies the first principle, the second principle is hindered by restrictive training strategies, unclear data sources, and the high costs of model training. These constraints limit the open-source community's capacity to contribute and impede technological progress.
To overcome these challenges, we launched a comprehensive reproduction project of DeepSeek. This initiative involves training models from scratch, replicating DeepSeek's architecture and algorithms. We will
fully open-source
the training code, datasets, and models, offering code framework, reference solution, and base models for low-cost continual exploration.
🌈 Update
[2025.03.11]
TinyDeepSeek repo is published!🎉
Reproduction of DeepSeek-R1
Architecture
Click to expand
As shown in above Figure, the DeepSeek technical report introduces three architectural designs:
A.
Multi-Head Latent Attention (MLA)
B.
Load Balancing Strategy without Auxiliary Loss
: Please refer
code
for implementation.
C.
Multi-Token Prediction
: Please refer
code
for implementation.
Our Detailed architectural parameter are shown in Table below.
Data Construction
Click to expand
Pretrain:
Meta & Labeled Data:
TBD
Process Code: Please refer to
Link
for implementation.
TinyDeepSeek-3.3B-base huggingface.co is an AI model on huggingface.co that provides TinyDeepSeek-3.3B-base's model effect (), which can be used instantly with this FreedomIntelligence TinyDeepSeek-3.3B-base model. huggingface.co supports a free trial of the TinyDeepSeek-3.3B-base model, and also provides paid use of the TinyDeepSeek-3.3B-base. Support call TinyDeepSeek-3.3B-base model through api, including Node.js, Python, http.
TinyDeepSeek-3.3B-base huggingface.co is an online trial and call api platform, which integrates TinyDeepSeek-3.3B-base's modeling effects, including api services, and provides a free online trial of TinyDeepSeek-3.3B-base, you can try TinyDeepSeek-3.3B-base online for free by clicking the link below.
FreedomIntelligence TinyDeepSeek-3.3B-base online free url in huggingface.co:
TinyDeepSeek-3.3B-base is an open source model from GitHub that offers a free installation service, and any user can find TinyDeepSeek-3.3B-base on GitHub to install. At the same time, huggingface.co provides the effect of TinyDeepSeek-3.3B-base install, users can directly use TinyDeepSeek-3.3B-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.
TinyDeepSeek-3.3B-base install url in huggingface.co: