bigcode / astraios-3b-ia3

huggingface.co
Total runs: 9
24-hour runs: 0
7-day runs: 1
30-day runs: 4
Model's Last Updated: January 02 2024

Introduction of astraios-3b-ia3

Model Details of astraios-3b-ia3

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Astraios

Table of Contents

  1. Model Summary
  2. Use
  3. Training
  4. Citation

Model Summary

Astraios-3B-IA3 is an instruction tuned model with 15.5B parameters created by finetuning StarCoderBase on CommitPackFT & OASST as described in the Astraios paper.

  • Repository: bigcode-project/astraios
  • Paper: Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
  • Languages: 80+ Programming languages
  • ✨Astraios:
    Data CommitPackFT+OASST Filtered version of CommitPack and OASST for high-quality commit messages that resemble instructions
    Model Astraios-1B Collection of StarCoderBase-1B models instruction tuned on CommitPackFT + OASST with different tuning methods
    Astraios-3B Collection of StarCoderBase-3B (3B parameters) models instruction tuned on CommitPackFT + OASST with different tuning methods
    Astraios-7B Collection of StarCoderBase-7B (7B parameters) models instruction tuned on CommitPackFT + OASST with different tuning methods
    Astraios-16B Collection of StarCoderBase-16B (16B parameters) models instruction tuned on CommitPackFT + OASST with different tuning methods
    Evaluation BigCloneBench Dataset for clone detection; We use 2,000 samples for evaluation
    Devign Dataset for defect detection; We use 2,000 samples for evaluation
    HumanEvalPack Extension of OpenAI's HumanEval to cover 3 scenarios across 6 languages
    ReCode Dataset for the robustness of code generation, covering 4 variants
    Asleep At The Keyboard Datasets for security of code generation; We use DoW for evaluation

Use

Intended use

The model follows instructions provided in the input. You should always preface your input with "Question: " and finish it with "Answer:", for example: "Question: Please write a function in Python that performs bubble sort.

Answer:"

Feel free to share your generations in the Community tab!

Generation
# pip install -q transformers
# pip install -e git+https://github.com/bigcode-project/astraios#subdirectory=peft
from peft import PeftModel 
from transformers import AutoModelForCausalLM, AutoTokenizer

peft_checkpoint = "bigcode/astraios-3b-ia3"
checkpoint = "bigcode/starcoderbase-3b"
model = AutoModelForCausalLM.from_pretrained(checkpoint)
model = PeftModel.from_pretrained(model, peft_checkpoint)
device = "cuda" # for GPU usage or "cpu" for CPU usage

tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = AutoModelForCausalLM.from_pretrained(checkpoint).to(device)

inputs = tokenizer.encode("Question: Please write a function in Python that performs bubble sort.

Answer:", return_tensors="pt").to(device)
outputs = model.generate(inputs)
print(tokenizer.decode(outputs[0]))

Training

Model
  • Architecture: GPT-2 model with multi-query attention and Fill-in-the-Middle objective
  • Steps: 250k pretraining & 200 instruction tuning
  • Precision: fp32
Hardware
  • Pretraining:
    • GPUs: 512 Tesla A100
    • Training time: 24 days
  • Instruction tuning:
    • GPUs: 8 Tesla A100
Software

Citation


Runs of bigcode astraios-3b-ia3 on huggingface.co

9
Total runs
0
24-hour runs
-1
3-day runs
1
7-day runs
4
30-day runs

More Information About astraios-3b-ia3 huggingface.co Model

More astraios-3b-ia3 license Visit here:

https://choosealicense.com/licenses/bigcode-openrail-m

astraios-3b-ia3 huggingface.co

astraios-3b-ia3 huggingface.co is an AI model on huggingface.co that provides astraios-3b-ia3's model effect (), which can be used instantly with this bigcode astraios-3b-ia3 model. huggingface.co supports a free trial of the astraios-3b-ia3 model, and also provides paid use of the astraios-3b-ia3. Support call astraios-3b-ia3 model through api, including Node.js, Python, http.

astraios-3b-ia3 huggingface.co Url

https://huggingface.co/bigcode/astraios-3b-ia3

bigcode astraios-3b-ia3 online free

astraios-3b-ia3 huggingface.co is an online trial and call api platform, which integrates astraios-3b-ia3's modeling effects, including api services, and provides a free online trial of astraios-3b-ia3, you can try astraios-3b-ia3 online for free by clicking the link below.

bigcode astraios-3b-ia3 online free url in huggingface.co:

https://huggingface.co/bigcode/astraios-3b-ia3

astraios-3b-ia3 install

astraios-3b-ia3 is an open source model from GitHub that offers a free installation service, and any user can find astraios-3b-ia3 on GitHub to install. At the same time, huggingface.co provides the effect of astraios-3b-ia3 install, users can directly use astraios-3b-ia3 installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

astraios-3b-ia3 install url in huggingface.co:

https://huggingface.co/bigcode/astraios-3b-ia3

Url of astraios-3b-ia3

astraios-3b-ia3 huggingface.co Url

Provider of astraios-3b-ia3 huggingface.co

bigcode
ORGANIZATIONS

Other API from bigcode

huggingface.co

Total runs: 86.7K
Run Growth: -18.0K
Growth Rate: -20.82%
Updated:March 04 2024
huggingface.co

Total runs: 15.7K
Run Growth: -3.4K
Growth Rate: -21.77%
Updated:June 11 2024
huggingface.co

Total runs: 12.4K
Run Growth: 2.7K
Growth Rate: 21.57%
Updated:October 09 2024
huggingface.co

Total runs: 7.4K
Run Growth: 1.8K
Growth Rate: 24.78%
Updated:October 13 2023
huggingface.co

Total runs: 2.2K
Run Growth: 1.9K
Growth Rate: 85.91%
Updated:August 18 2023
huggingface.co

Total runs: 612
Run Growth: -61
Growth Rate: -9.97%
Updated:May 10 2023
huggingface.co

Total runs: 399
Run Growth: 356
Growth Rate: 89.22%
Updated:August 18 2023
huggingface.co

Total runs: 271
Run Growth: 262
Growth Rate: 96.68%
Updated:August 05 2023
huggingface.co

Total runs: 206
Run Growth: 196
Growth Rate: 95.15%
Updated:August 05 2023
huggingface.co

Total runs: 77
Run Growth: -5.3K
Growth Rate: -6851.95%
Updated:July 24 2023
huggingface.co

Total runs: 3
Run Growth: -3
Growth Rate: -100.00%
Updated:January 02 2024
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:February 28 2024
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:February 24 2025