michaelfeil / ct2fast-gpt_bigcode-santacoder

huggingface.co
Total runs: 7
24-hour runs: 1
7-day runs: 2
30-day runs: 5
Model's Last Updated: May 31 2023
text-generation

Introduction of ct2fast-gpt_bigcode-santacoder

Model Details of ct2fast-gpt_bigcode-santacoder

# Fast-Inference with Ctranslate2

Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on CPU or GPU.

quantized version of bigcode/gpt_bigcode-santacoder

pip install hf-hub-ctranslate2>=2.0.8 ctranslate2>=3.14.0

Converted on 2023-05-31 using

ct2-transformers-converter --model bigcode/gpt_bigcode-santacoder --output_dir /home/michael/tmp-ct2fast-gpt_bigcode-santacoder --force --copy_files tokenizer.json README.md tokenizer_config.json special_tokens_map.json .gitattributes --quantization float16 --trust_remote_code

Checkpoint compatible to ctranslate2>=3.14.0 and hf-hub-ctranslate2>=2.0.8

  • compute_type=int8_float16 for device="cuda"
  • compute_type=int8 for device="cpu"
from hf_hub_ctranslate2 import TranslatorCT2fromHfHub, GeneratorCT2fromHfHub
from transformers import AutoTokenizer

model_name = "michaelfeil/ct2fast-gpt_bigcode-santacoder"
# use either TranslatorCT2fromHfHub or GeneratorCT2fromHfHub here, depending on model.
model = GeneratorCT2fromHfHub(
        # load in int8 on CUDA
        model_name_or_path=model_name, 
        device="cuda",
        compute_type="int8_float16",
        # tokenizer=AutoTokenizer.from_pretrained("bigcode/gpt_bigcode-santacoder")
)
outputs = model.generate(
    text=["How do you call a fast Flan-ingo?", "User: How are you doing? Bot:"],
    max_length=64, 
    include_prompt_in_result=False
)
print(outputs)

Licence and other remarks:

This is just a quantized version. Licence conditions are intended to be idential to original huggingface repo.

Original description

SantaCoder

banner

Play with the model on the SantaCoder Space Demo .

Table of Contents

  1. Model Summary
  2. Use
  3. Limitations
  4. Training
  5. License
  6. Citation

Model Summary

This is the Megatron-version of SantaCoder . We refer the reader to the SantaCoder model page for full documentation about this model

There are two versions (branches) of the model:

  • main : Uses the gpt_bigcode model. Requires the bigcode fork of transformers .
  • main_custom : Packaged with its modeling code. Requires transformers>=4.27 . Alternatively, it can run on older versions by setting the configuration parameter activation_function = "gelu_pytorch_tanh" .

Use

Intended use

The model was trained on GitHub code. As such it is not an instruction model and commands like "Write a function that computes the square root." do not work well. You should phrase commands like they occur in source code such as comments (e.g. # the following function computes the sqrt ) or write a function signature and docstring and let the model complete the function body.

Attribution & Other Requirements

The pretraining dataset of the model was filtered for permissive licenses only. Nevertheless, the model can generate source code verbatim from the dataset. The code's license might require attribution and/or other specific requirements that must be respected. We provide a search index that let's you search through the pretraining data to identify where generated code came from and apply the proper attribution to your code.

Limitations

The model has been trained on source code in Python, Java, and JavaScript. The predominant language in source is English although other languages are also present. As such the model is capable to generate code snippets provided some context but the generated code is not guaranteed to work as intended. It can be inefficient, contain bugs or exploits.

Training

Model
  • Architecture: GPT-2 model with multi-query attention and Fill-in-the-Middle objective
  • Pretraining steps: 600K
  • Pretraining tokens: 236 billion
  • Precision: float16
Hardware
  • GPUs: 96 Tesla V100
  • Training time: 6.2 days
  • Total FLOPS: 2.1 x 10e21
Software

License

The model is licenses under the CodeML Open RAIL-M v0.1 license. You can find the full license here .

Runs of michaelfeil ct2fast-gpt_bigcode-santacoder on huggingface.co

7
Total runs
1
24-hour runs
1
3-day runs
2
7-day runs
5
30-day runs

More Information About ct2fast-gpt_bigcode-santacoder huggingface.co Model

More ct2fast-gpt_bigcode-santacoder license Visit here:

https://choosealicense.com/licenses/openrail

ct2fast-gpt_bigcode-santacoder huggingface.co

ct2fast-gpt_bigcode-santacoder huggingface.co is an AI model on huggingface.co that provides ct2fast-gpt_bigcode-santacoder's model effect (), which can be used instantly with this michaelfeil ct2fast-gpt_bigcode-santacoder model. huggingface.co supports a free trial of the ct2fast-gpt_bigcode-santacoder model, and also provides paid use of the ct2fast-gpt_bigcode-santacoder. Support call ct2fast-gpt_bigcode-santacoder model through api, including Node.js, Python, http.

ct2fast-gpt_bigcode-santacoder huggingface.co Url

https://huggingface.co/michaelfeil/ct2fast-gpt_bigcode-santacoder

michaelfeil ct2fast-gpt_bigcode-santacoder online free

ct2fast-gpt_bigcode-santacoder huggingface.co is an online trial and call api platform, which integrates ct2fast-gpt_bigcode-santacoder's modeling effects, including api services, and provides a free online trial of ct2fast-gpt_bigcode-santacoder, you can try ct2fast-gpt_bigcode-santacoder online for free by clicking the link below.

michaelfeil ct2fast-gpt_bigcode-santacoder online free url in huggingface.co:

https://huggingface.co/michaelfeil/ct2fast-gpt_bigcode-santacoder

ct2fast-gpt_bigcode-santacoder install

ct2fast-gpt_bigcode-santacoder is an open source model from GitHub that offers a free installation service, and any user can find ct2fast-gpt_bigcode-santacoder on GitHub to install. At the same time, huggingface.co provides the effect of ct2fast-gpt_bigcode-santacoder install, users can directly use ct2fast-gpt_bigcode-santacoder installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

ct2fast-gpt_bigcode-santacoder install url in huggingface.co:

https://huggingface.co/michaelfeil/ct2fast-gpt_bigcode-santacoder

Url of ct2fast-gpt_bigcode-santacoder

ct2fast-gpt_bigcode-santacoder huggingface.co Url

Provider of ct2fast-gpt_bigcode-santacoder huggingface.co

michaelfeil
ORGANIZATIONS

Other API from michaelfeil