CohereLabs / tiny-aya-base

huggingface.co
Total runs: 10.5K
24-hour runs: 0
7-day runs: 1.8K
30-day runs: -4.0K
Model's Last Updated: April 29 2026
text-generation

Introduction of tiny-aya-base

Model Details of tiny-aya-base

Model Card for tiny-aya-base

Tiny Aya Base

Model Summary

Cohere Labs Tiny Aya is an open weights research release of a pretrained 3.35 billion parameter model optimized for efficient, strong, and balanced multilingual representation across 70+ languages, including many lower-resourced ones. The model is designed to support downstream adaptation, instruction tuning, and local deployment under realistic compute constraints.

Developed by: Cohere and Cohere Labs

For more details about this model family, please check out our blog post , tech report , and our chat models:

Try Cohere Labs Tiny Aya

You can try out the chat models in our hosted Hugging Face Space .

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "CohereLabs/tiny-aya-base"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

prompt = "The capital of Spain is"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

outputs = model.generate(
    **inputs,
    max_new_tokens=128,
    do_sample=True,
    temperature=0.3,
    top_p=0.9,
    top_k=50,
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

You can also use the model directly using transformers pipeline abstraction:

from transformers import pipeline

generator = pipeline(
    "text-generation",
    model="CohereLabs/tiny-aya-base",
    torch_dtype="bfloat16",
)

output = generator(
    "The capital of Spain is",
    max_new_tokens=128,
    do_sample=True,
    temperature=0.3,
    top_p=0.9,
    repetition_penalty=1.1,
)
print(output[0]["generated_text"])
Model Details

Input : Text only.

Output : Model generates text.

Model Architecture : This is an auto-regressive language model that uses an optimized transformer architecture. After pretraining, this model uses supervised fine-tuning (SFT) and preference training to align model behavior to human preferences for helpfulness and safety. The model features three layers with sliding window attention (window size 4096) and RoPE for efficient local context modeling and relative positional encoding. A fourth layer uses global attention without positional embeddings, enabling unrestricted token interactions across the entire sequence.

Languages covered: The model has been trained on 70+ languages, with a focus on: English, Dutch, French, Italian, Portuguese, Romanian, Spanish, Czech, Polish, Ukrainian, Russian, Greek, German, Danish, Swedish, Norwegian, Catalan, Galician, Welsh, Irish, Basque, Croatian, Latvian, Lithuanian, Slovak, Slovenian, Estonian, Finnish, Hungarian, Serbian, Bulgarian, Arabic, Persian, Urdu, Turkish, Maltese, Hebrew, Hindi, Marathi, Bengali, Gujarati, Punjabi, Tamil, Telugu, Nepali, Tagalog, Malay, Indonesian, Vietnamese, Javanese, Khmer, Thai, Lao, Chinese, Burmese, Japanese, Korean, Amharic, Hausa, Igbo, Malagasy, Shona, Swahili, Wolof, Xhosa, Yoruba, and Zulu

Context Length: Tiny Aya supports a context length of 8K & 8K output length.

Usage and Limitations
Intended Usage

Tiny Aya is a family of massively multilingual small language models built to bring capable AI to languages that are often underserved by existing models. The models support languages across West, East and Southeast Asian, African, European, and Middle Eastern language families, with a deliberate emphasis on low-resource language performance.

Intended applications include multilingual text generation, conversational AI, summarization, translation and cross-lingual tasks, as well as research in multilingual NLP and low-resource language modeling. The models are also suited for efficient deployment in multilingual regions, helping bridge the digital language divide for underrepresented language communities.

Strengths

Tiny Aya demonstrates strong open-ended generation quality across its full language coverage, with particularly notable performance on low-resource languages. The model performs well on translation, summarization, and cross-lingual tasks, benefiting from training signal shared across language families and scripts.

Limitations

Reasoning tasks. The model's strongest performance is on open-ended generation and conversational tasks. Chain-of-thought reasoning tasks such as multilingual math (MGSM) are comparatively weaker.

Factual knowledge. As with any language model, outputs may contain incorrect or outdated statements, particularly in lower-resource languages with thinner training data coverage.

Uneven resource distribution. High-resource languages benefit from richer training signal and tend to exhibit more consistent quality across tasks. The lowest-resource languages in the model's coverage may show greater variability, and culturally specific nuance, sarcasm, or figurative language may be less reliably handled in these languages.

Task complexity. The model performs best with clear prompts and instructions. Highly complex or open-ended reasoning, particularly in lower-resource languages, remains challenging.

Model Card Contact

For errors or additional questions about details in this model card, contact [ [email protected] ].

Terms of Use:

We hope that the release of this model will make community-based research efforts more accessible, by releasing the weights of a highly performant 111 billion parameter model to researchers all over the world. This model is governed by a CC-BY-NC License (Non-Commercial) with an acceptable use addendum, and also requires adhering to Cohere Lab's Acceptable Use Policy . If you are interested in commercial use, please contact Cohere’s Sales team .

Try it now:

You can try Tiny Aya in our dedicated Hugging Face Space .

Runs of CohereLabs tiny-aya-base on huggingface.co

10.5K
Total runs
0
24-hour runs
944
3-day runs
1.8K
7-day runs
-4.0K
30-day runs

More Information About tiny-aya-base huggingface.co Model

More tiny-aya-base license Visit here:

https://choosealicense.com/licenses/cc-by-nc-4.0

tiny-aya-base huggingface.co

tiny-aya-base huggingface.co is an AI model on huggingface.co that provides tiny-aya-base's model effect (), which can be used instantly with this CohereLabs tiny-aya-base model. huggingface.co supports a free trial of the tiny-aya-base model, and also provides paid use of the tiny-aya-base. Support call tiny-aya-base model through api, including Node.js, Python, http.

CohereLabs tiny-aya-base online free

tiny-aya-base huggingface.co is an online trial and call api platform, which integrates tiny-aya-base's modeling effects, including api services, and provides a free online trial of tiny-aya-base, you can try tiny-aya-base online for free by clicking the link below.

CohereLabs tiny-aya-base online free url in huggingface.co:

https://huggingface.co/CohereLabs/tiny-aya-base

tiny-aya-base install

tiny-aya-base is an open source model from GitHub that offers a free installation service, and any user can find tiny-aya-base on GitHub to install. At the same time, huggingface.co provides the effect of tiny-aya-base install, users can directly use tiny-aya-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

tiny-aya-base install url in huggingface.co:

https://huggingface.co/CohereLabs/tiny-aya-base

Url of tiny-aya-base

tiny-aya-base huggingface.co Url

Provider of tiny-aya-base huggingface.co

CohereLabs
ORGANIZATIONS

Other API from CohereLabs

huggingface.co

Total runs: 12.3K
Run Growth: 1.4K
Growth Rate: 11.99%
Updated:September 11 2025
huggingface.co

Total runs: 7.3K
Run Growth: 3.2K
Growth Rate: 44.64%
Updated:September 11 2025
huggingface.co

Total runs: 2.0K
Run Growth: -497
Growth Rate: -27.35%
Updated:September 11 2025