K2 huggingface.co api & LLM360 K2 github AI Model

Introduction of K2

Model Details of K2

K2: a fully-reproducible large language model outperforming Llama 2 70B using 35% less compute

LLM360 demystifies the training recipe used for Llama 2 70B with K2. K2 is fully transparent, meaning we’ve open-sourced all artifacts, including code, data, model checkpoints, intermediate results, and more.

About K2:

65 billion parameter LLM
Tokens: 1.4T
Languages: English
Models Released: base, chat model
Trained in 2 stages
License: Apache 2.0

K2 was developed as a collaboration between MBZUAI , Petuum , and LLM360 .

LLM360 Model Performance and Evaluation Collection

The LLM360 Performance and Evaluation Collection is a robust evaluations set consisting of general and domain specific evaluations to assess model knowledge and function.

Evaluations include standard best practice benchmarks, medical, math, and coding knowledge. More about the evaluations can be found here .

Detailed analysis can be found on the K2 Weights and Biases project here

Open LLM Leaderboard

Evaluation	Score	Raw Score
IFEval	22.52	23
BBH	28.22	50
Math Lvl 5	2.04	2
GPQA	3.58	28
MUSR	8.55	40
MMLU-PRO	22.27	30
Average	14.53	35.17

K2 Gallery

The K2 gallery allows one to browse the output of various prompts on intermediate K2 checkpoints, which provides an intuitive understanding on how the model develops and improves over time. This is inspired by The Bloom Book.

View K2 gallery here

Datasets and Mix

The following data mix was used to train K2 and achieve results in line with Llama 2 70B.

The full data sequence can be found here

Dataset	Starting Tokens	Multiplier	Total Tokens	% of Total
dm-math	4.33B	3x	13B	1%
pubmed-abstracts	4.77B	3x	14.3B	1.1%
uspto	4.77B	3x	14.3B	1.1%
pubmed-central	26B	1x	26B	2%
redpajama.arxiv	27.3B	1x	27.3B	2.1%
starcoder.spm	67.6B	0.5x	33.8B	2.6%
starcoder.fim	67.6B	0.5x	33.8B	2.6%
redpajama.stackexchange	61.1B	1x	61.1B	4.7%
starcoder	132.6B	0.5x	66.3B	5.1%
pile-of-law	76.7B	1x	76.7B	5.9%
redpajama.book	80.6B	1x	80.6B	6.2%
s2orc	107.9B	1x	107.9B	8.3%
redpajama.wikipedia	22.1B	6x	132.6B	10.2%
refinedweb	612.3B	1x	612.3B	47.1%
Totals	-	-	1.3T	100%

LLM360 Reasearch Suite

Stage 2 - Last 10 Checkpoints

Checkpoints
Checkpoint 380	Checkpoint 375
Checkpoint 379	Checkpoint 374
Checkpoint 378	Checkpoint 373
Checkpoint 377	Checkpoint 372
Checkpoint 376	Checkpoint 371

Stage 1 - Last 10 Checkpoints

Checkpoints
Checkpoint 360	Checkpoint 355
Checkpoint 359	Checkpoint 354
Checkpoint 358	Checkpoint 353
Checkpoint 357	Checkpoint 352
Checkpoint 356	Checkpoint 351

[to find all branches: git branch -a]

LLM360 Pretraining Suite

We provide step-by-step reproducation tutorials for tech enthusiasts, AI practitioners and academic or industry researchers who want to learn pretraining techniques here .

LLM360 Developer Suite

We provide step-by-step finetuning tutorials for tech enthusiasts, AI practitioners and academic or industry researchers here .

Loading K2

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("LLM360/K2")
model = AutoModelForCausalLM.from_pretrained("LLM360/K2")

prompt = 'what is the highest mountain on earth?'

input_ids = tokenizer(prompt, return_tensors="pt").input_ids
gen_tokens = model.generate(input_ids, do_sample=True, max_new_tokens=128)

print("-"*20 + "Output for model"  + 20 * '-')
print(tokenizer.batch_decode(gen_tokens)[0])

About LLM360

LLM360 is an open research lab enabling community-owned AGI through open-source large model research and development.

LLM360 enables community-owned AGI by creating standards and tools to advance the bleeding edge of LLM capability and empower knowledge transfer, research, and development.

We believe in a future where artificial general intelligence (AGI) is created by the community, for the community. Through an open ecosystem of equitable computational resources, high quality data, and flowing technical knowledge, we can ensure ethical AGI development and universal access for all innovators.

Visit us

Citation

BibTeX:

@article{K2,
      title={LLM360 K2-65B: Scaling Up Fully Transparent Open-Source LLMs}, 
      author={
      Zhengzhong Liu and Bowen Tan
      and Hongyi Wang and Willie Neiswanger and Tianhua Tao
      and Haonan Li and Fajri Koto and Yuqi Wang and Suqi Sun
      and Omkar Pangarkar and Richard Fan and Yi Gu and Victor Miller
      and Liqun Ma and Liping Tang and Nikhil Ranjan and Yonghao Zhuang
      and Guowei He and Renxi Wang and Mingkai Deng and Robin Algayres 
      and Yuanzhi Li and Zhiqiang Shen and Preslav Nakov
      and Eric Xing      
      },
      year={2024},
}

Runs of LLM360 K2 on huggingface.co

124

Total runs

24-hour runs

-1

3-day runs

7-day runs

30-day runs

More Information About K2 huggingface.co Model

More K2 license Visit here:

https://choosealicense.com/licenses/apache-2.0

K2 huggingface.co

K2 huggingface.co is an AI model on huggingface.co that provides K2's model effect (), which can be used instantly with this LLM360 K2 model. huggingface.co supports a free trial of the K2 model, and also provides paid use of the K2. Support call K2 model through api, including Node.js, Python, http.

K2 huggingface.co Url

https://huggingface.co/LLM360/K2

LLM360 K2 online free

K2 huggingface.co is an online trial and call api platform, which integrates K2's modeling effects, including api services, and provides a free online trial of K2, you can try K2 online for free by clicking the link below.

LLM360 K2 online free url in huggingface.co:

https://huggingface.co/LLM360/K2

K2 install

K2 is an open source model from GitHub that offers a free installation service, and any user can find K2 on GitHub to install. At the same time, huggingface.co provides the effect of K2 install, users can directly use K2 installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

K2 install url in huggingface.co:

https://huggingface.co/LLM360/K2

huggingface.co

LLM360/guru-32b-step240

Total runs: 49.2K

Run Growth: 0

Growth Rate: 0.00%

Updated:May 15 2025

huggingface.co

LLM360/guru-7b-step320

Total runs: 37.8K

Run Growth: 0

Growth Rate: 0.00%

Updated:May 15 2025

huggingface.co

LLM360/K2-V2-Instruct

Total runs: 21.5K

Run Growth: -104

Growth Rate: -0.48%

Updated:January 27 2026

huggingface.co

LLM360/guru-32b-step340

Total runs: 17.8K

Run Growth: 0

Growth Rate: 0.00%

Updated:May 15 2025

huggingface.co

LLM360/Crystal

Total runs: 3.6K

Run Growth: -2.6K

Growth Rate: -73.50%

Updated:October 05 2024

huggingface.co

LLM360/Amber

Total runs: 2.1K

Run Growth: -79.3K

Growth Rate: -3760.47%

Updated:October 11 2025

huggingface.co

LLM360/CrystalChat-7B-Web2Code

Total runs: 1.8K

Run Growth: 1.7K

Growth Rate: 94.56%

Updated:October 23 2024

huggingface.co

LLM360/K2-Think-V2

Total runs: 1.5K

Run Growth: -1.5K

Growth Rate: -99.66%

Updated:March 03 2026

huggingface.co

LLM360/guru-32b-anneal-step80

Total runs: 1.2K

Run Growth: 0

Growth Rate: 0.00%

Updated:May 16 2025

huggingface.co

LLM360/CrystalChat

Total runs: 1.2K

Run Growth: 1.0K

Growth Rate: 88.59%

Updated:October 05 2024

huggingface.co

LLM360/K2-V2

Total runs: 1.0K

Run Growth: 573

Growth Rate: 56.45%

Updated:January 27 2026

huggingface.co

LLM360/AmberChat

Total runs: 619

Run Growth: 351

Growth Rate: 56.70%

Updated:October 05 2024

huggingface.co

LLM360/AmberSafe

Total runs: 485

Run Growth: 431

Growth Rate: 88.87%

Updated:October 05 2024

huggingface.co

LLM360/K2-Think

Total runs: 214

Run Growth: -749

Growth Rate: -350.00%

Updated:November 19 2025

huggingface.co

LLM360/CrystalCoder

Total runs: 97

Run Growth: 0

Growth Rate: 0.00%

Updated:June 25 2024

huggingface.co

LLM360/guru-7B

Total runs: 46

Run Growth: 37

Growth Rate: 80.43%

Updated:June 20 2025

huggingface.co

LLM360/K2-Chat

Total runs: 38

Run Growth: 0

Growth Rate: 0.00%

Updated:January 13 2025

huggingface.co

LLM360/MegaMath-Llama-3.2-3B

Total runs: 13

Run Growth: 10

Growth Rate: 76.92%

Updated:April 16 2025

huggingface.co

LLM360/guru-32b-anneal-step30

Total runs: 13

Run Growth: 0

Growth Rate: 0.00%

Updated:May 16 2025

huggingface.co

LLM360/guru-32B

Total runs: 10

Run Growth: 1

Growth Rate: 10.00%

Updated:June 20 2025

huggingface.co

LLM360/MegaMath-Llama-3.2-1B

Total runs: 10

Run Growth: 5

Growth Rate: 50.00%

Updated:April 16 2025

huggingface.co

LLM360/K2-Spike-2

Total runs: 3

Run Growth: 3

Growth Rate: 100.00%

Updated:June 12 2024

huggingface.co

LLM360/K2-Spike-1

Total runs: 3

Run Growth: 2

Growth Rate: 66.67%

Updated:June 12 2024

huggingface.co

LLM360/Reason-Codegen-7B-kodcode12k

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 26 2025

huggingface.co

LLM360/k2-vision-65b

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 12 2024

huggingface.co

LLM360/guru-7b-step380

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 15 2025

huggingface.co

LLM360/guru_RL

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 04 2025

huggingface.co

LLM360/Reason-Codegen-32B-kodcode12k

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 26 2025

huggingface.co

LLM360/megamath_models

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 25 2025

LLM360 / K2

Introduction of K2

Model Details of K2

K2: a fully-reproducible large language model outperforming Llama 2 70B using 35% less compute

About K2:

LLM360 Model Performance and Evaluation Collection

Open LLM Leaderboard

K2 Gallery

Datasets and Mix

LLM360 Reasearch Suite

Stage 2 - Last 10 Checkpoints

Stage 1 - Last 10 Checkpoints

LLM360 Pretraining Suite

LLM360 Developer Suite

Loading K2

About LLM360

Citation

Runs of LLM360 K2 on huggingface.co

More Information About K2 huggingface.co Model

More K2 license Visit here:

K2 huggingface.co

K2 huggingface.co Url

LLM360 K2 online free

LLM360 K2 online free url in huggingface.co:

K2 install

K2 install url in huggingface.co:

Url of K2

K2 huggingface.co Url

Provider of K2 huggingface.co

Other API from LLM360