Shami-MT huggingface.co api & Omartificial-Intelligence-Space Shami-MT github AI Model

Introduction of Shami-MT

Model Details of Shami-MT

SHAMI-MT : A Machine Translation Model From MSA to Syrian Dialect

Model Description

SHAMI-MT is a specialized machine translation model designed to translate from Modern Standard Arabic (MSA) to Syrian dialect. Built on the robust AraT5v2-base-1024 architecture, this model bridges the gap between formal Arabic and the rich dialectal variations of Syrian Arabic.

Model Details

Model Type : Sequence-to-Sequence Translation
Base Model : UBC-NLP/AraT5v2-base-1024
Language : Arabic (MSA → Syrian Dialect)
License : Apache 2.0
Library : Transformers

Dataset

The model was trained on the Nâbra dataset, a comprehensive corpus of Syrian Arabic dialects with morphological annotations.

Nâbra Dataset Details

Citation:

Nayouf, A., Hammouda, T., Jarrar, M., Zaraket, F., & Kurdy, M. B. (2023). 
Nâbra: Syrian Arabic dialects with morphological annotations. 
arXiv preprint arXiv:2310.17315.

Key Statistics:

Tokens : ~60,000 words
Dialects Covered : Multiple Syrian regional dialects including:
- Aleppo
- Damascus
- Deir-ezzur
- Hama
- Homs
- Huran
- Latakia
- Mardin
- Raqqah
- Suwayda

Data Sources:

Social media posts
Movie and TV series scripts
Song lyrics
Local proverbs

Training Details

The model was fine-tuned on the AraT5v2-base-1024 architecture with the following training metrics:

Total Training Steps : 10,384
Epochs : 22
Final Training Loss : 1.396
Final Evaluation Loss : 0.771
Learning Rate : Cosine schedule starting at 5e-5
Batch Size : 256
Total FLOPs : 1.58e+17

Training Progress

The model showed consistent improvement throughout training:

Initial loss: 12.93 → Final loss: 1.40
Evaluation loss steadily decreased from 1.44 to 0.77
Gradient norms remained stable throughout training

Usage

Installation

pip install transformers torch

Inference Code

from transformers import T5Tokenizer, AutoModelForSeq2SeqLM

# Load model and tokenizer
tokenizer = T5Tokenizer.from_pretrained("Omartificial-Intelligence-Space/Shami-MT")
model = AutoModelForSeq2SeqLM.from_pretrained("Omartificial-Intelligence-Space/Shami-MT")

# Example usage
ar_prompt = "مرحبا بك هنا"  # MSA input
input_ids = tokenizer(ar_prompt, return_tensors="pt").input_ids
outputs = model.generate(input_ids)

print("Input (MSA):", ar_prompt)
print("Tokenized input:", tokenizer.tokenize(ar_prompt))
print("Output (Syrian Dialect):", tokenizer.decode(outputs[0], skip_special_tokens=True))

Generation Parameters

For optimal results, you can adjust generation parameters:

outputs = model.generate(
    input_ids,
    max_length=128,
    num_beams=4,
    temperature=0.7,
    do_sample=True,
    pad_token_id=tokenizer.pad_token_id,
    eos_token_id=tokenizer.eos_token_id
)

Evaluation Results

Test Set : 1,500 unseen sentences
Evaluation Method : GPT-4.1 as automated judge
Average Score : 4.01/5.0 ⭐
Evaluation Criteria : Translation quality, dialectal accuracy, and semantic preservation

The model was evaluated using GPT-4.1 as an automated judge with the following structured prompt:

"You are a language evaluation assistant. Compare the predicted Shami sentence to the reference.
Please return a rating from 0 to 5 and a short comment.

MSA Input: [input sentence]
Model Prediction (Shami dialect): [model output]
Ground Truth (Shami dialect): [reference translation]

Respond in this format:
Score: <number from 0 to 5>
Comment: <brief explanation of the score>"

Score Distribution Analysis:

Excellent (5.0) : High-quality translations with perfect dialectal conversion
Good (4.0-4.9) : Minor dialectal variations or stylistic differences
Average (3.0-3.9) : Acceptable translations with some dialectal inconsistencies
Below Average (2.0-2.9) : Noticeable errors in dialect or meaning
Poor (0-1.9) : Significant translation errors or loss of meaning

Performance Highlights

Strong Dialectal Conversion : Successfully transforms MSA into authentic Syrian dialect
Semantic Preservation : Maintains original meaning while adapting linguistic style
Regional Adaptability : Handles various Syrian sub-dialects effectively
Consistent Quality : Stable performance across different text types and domains

Applications

This model is particularly useful for:

Content Localization : Adapting MSA content for Syrian audiences
Cultural Preservation : Maintaining and promoting Syrian dialectal variations
Educational Tools : Teaching differences between MSA and Syrian dialect
Research : Syrian Arabic NLP and dialectology studies

Regional Coverage

The model handles multiple Syrian sub-dialects, making it versatile for different regions within Syria:

🏛️ Urban Centers : Damascus, Aleppo
🏔️ Northern Regions : Latakia, Mardin
🏜️ Eastern Areas : Deir-ezzur, Raqqah
🌄 Central/Southern : Hama, Homs, Huran, Suwayda

Limitations

Trained specifically on Syrian dialect variations
Performance may vary for other Arabic dialects
Limited to text-based translation (no speech support)
Dataset size constraints may affect handling of very rare dialectal expressions

Citation

If you use this model in your research, please cite:

@misc{shami-mt-2024,
  title={SHAMI-MT: A Machine Translation Model From MSA to Syrian Dialect},
  author={Omartificial Intelligence Space},
  year={2024},
  publisher={Hugging Face},
  url={https://huggingface.co/Omartificial-Intelligence-Space/Shami-MT}
}

@article{nayouf2023nabra,
  title={Nâbra: Syrian Arabic dialects with morphological annotations},
  author={Nayouf, Amal and Hammouda, Tymaa Hasanain and Jarrar, Mustafa and Zaraket, Fadi A and Kurdy, Mohamad-Bassam},
  journal={arXiv preprint arXiv:2310.17315},
  year={2023}
}

@misc{onajar2025shamiMT,
  title={Shami-MT-2MSA : A Machine Translation from Syrian Dialect to MSA},
  author={Sibaee, Serry and Nacar, Omer},
  year={2025}
}

Contact & Support

For questions, issues, or contributions, please visit the model repository or contact the development team.

Runs of Omartificial-Intelligence-Space Shami-MT on huggingface.co

Total runs

24-hour runs

-1

3-day runs

-1

7-day runs

-45

30-day runs

More Information About Shami-MT huggingface.co Model

More Shami-MT license Visit here:

https://choosealicense.com/licenses/apache-2.0

Shami-MT huggingface.co

Shami-MT huggingface.co is an AI model on huggingface.co that provides Shami-MT's model effect (), which can be used instantly with this Omartificial-Intelligence-Space Shami-MT model. huggingface.co supports a free trial of the Shami-MT model, and also provides paid use of the Shami-MT. Support call Shami-MT model through api, including Node.js, Python, http.

Shami-MT huggingface.co Url

https://huggingface.co/Omartificial-Intelligence-Space/Shami-MT

Omartificial-Intelligence-Space Shami-MT online free

Shami-MT huggingface.co is an online trial and call api platform, which integrates Shami-MT's modeling effects, including api services, and provides a free online trial of Shami-MT, you can try Shami-MT online for free by clicking the link below.

Omartificial-Intelligence-Space Shami-MT online free url in huggingface.co:

https://huggingface.co/Omartificial-Intelligence-Space/Shami-MT

Shami-MT install

Shami-MT is an open source model from GitHub that offers a free installation service, and any user can find Shami-MT on GitHub to install. At the same time, huggingface.co provides the effect of Shami-MT install, users can directly use Shami-MT installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Shami-MT install url in huggingface.co:

https://huggingface.co/Omartificial-Intelligence-Space/Shami-MT

huggingface.co

Omartificial-Intelligence-Space/Arabic-Triplet-Matryoshka-V2

Total runs: 5.8K

Run Growth: 2.9K

Growth Rate: 49.00%

Updated:September 07 2025

huggingface.co

Omartificial-Intelligence-Space/GATE-AraBert-v1

Total runs: 2.0K

Run Growth: 787

Growth Rate: 39.69%

Updated:September 07 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-MiniLM-L12-v2-all-nli-triplet

Total runs: 878

Run Growth: 710

Growth Rate: 80.87%

Updated:June 10 2025

huggingface.co

Omartificial-Intelligence-Space/ARA-Reranker-V1

Total runs: 827

Run Growth: 41

Growth Rate: 4.96%

Updated:April 03 2025

huggingface.co

Omartificial-Intelligence-Space/Arabert-all-nli-triplet-Matryoshka

Total runs: 561

Run Growth: 242

Growth Rate: 43.14%

Updated:January 23 2025

huggingface.co

Omartificial-Intelligence-Space/Marbert-all-nli-triplet-Matryoshka

Total runs: 473

Run Growth: 163

Growth Rate: 34.46%

Updated:January 11 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-mpnet-base-all-nli-triplet

Total runs: 344

Run Growth: 104

Growth Rate: 30.23%

Updated:January 23 2025

huggingface.co

Omartificial-Intelligence-Space/Semantic-Ar-Qwen-Embed-0.6B

Total runs: 287

Run Growth: 189

Growth Rate: 65.85%

Updated:September 07 2025

huggingface.co

Omartificial-Intelligence-Space/SA-Retrieval-Embeddings-0.2B

Total runs: 232

Run Growth: 76

Growth Rate: 32.76%

Updated:December 23 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-all-nli-triplet-Matryoshka

Total runs: 177

Run Growth: -60

Growth Rate: -33.90%

Updated:January 23 2025

huggingface.co

Omartificial-Intelligence-Space/mmbert-base-arabic-nli

Total runs: 152

Run Growth: 141

Growth Rate: 92.76%

Updated:October 08 2025

huggingface.co

Omartificial-Intelligence-Space/Qwen3-VL-Embedding-2B-Arabic-VDR

Total runs: 151

Run Growth: 97

Growth Rate: 64.24%

Updated:April 22 2026

huggingface.co

Omartificial-Intelligence-Space/Arabic-llama3.1-16bit-FT

Total runs: 119

Run Growth: 101

Growth Rate: 84.87%

Updated:August 02 2024

huggingface.co

Omartificial-Intelligence-Space/Arabic-labse-Matryoshka

Total runs: 110

Run Growth: -401

Growth Rate: -358.04%

Updated:January 11 2025

huggingface.co

Omartificial-Intelligence-Space/ALLaM-7B-Instruct-preview-Q4_K_M-GGUF

Total runs: 61

Run Growth: -5

Growth Rate: -8.20%

Updated:February 19 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-DeepSeek-R1-Distill-8B

Total runs: 38

Run Growth: 35

Growth Rate: 92.11%

Updated:February 03 2025

huggingface.co

Omartificial-Intelligence-Space/SA-BERT-V1

Total runs: 18

Run Growth: 11

Growth Rate: 61.11%

Updated:November 28 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-base-all-nli-stsb-quora

Total runs: 16

Run Growth: 16

Growth Rate: 100.00%

Updated:June 28 2024

huggingface.co

Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO

Total runs: 16

Run Growth: 14

Growth Rate: 87.50%

Updated:June 17 2025

huggingface.co

Omartificial-Intelligence-Space/al-baka-llama3-8b-experimental

Total runs: 15

Run Growth: 4

Growth Rate: 26.67%

Updated:April 20 2024

huggingface.co

Omartificial-Intelligence-Space/ALLaM-7B-Instruct-preview-Q8_0-GGUF

Total runs: 10

Run Growth: 3

Growth Rate: 30.00%

Updated:February 19 2025

huggingface.co

Omartificial-Intelligence-Space/Diraya-3B-Instruct-Ar

Total runs: 9

Run Growth: 1

Growth Rate: 11.11%

Updated:March 16 2025

huggingface.co

Omartificial-Intelligence-Space/al-baka-4bit-llama3-8b

Total runs: 9

Run Growth: 7

Growth Rate: 77.78%

Updated:April 22 2024

huggingface.co

Omartificial-Intelligence-Space/al-baka-16bit-llama3-8b

Total runs: 9

Run Growth: 2

Growth Rate: 22.22%

Updated:May 04 2024

huggingface.co

Omartificial-Intelligence-Space/Arabic-QWQ-32B-Preview

Total runs: 9

Run Growth: 1

Growth Rate: 11.11%

Updated:December 03 2024

huggingface.co

Omartificial-Intelligence-Space/SA-STS-Embeddings-0.2B

Total runs: 8

Run Growth: 1

Growth Rate: 12.50%

Updated:November 30 2025

huggingface.co

Omartificial-Intelligence-Space/AraEuroBert-210M

Total runs: 8

Run Growth: 1

Growth Rate: 12.50%

Updated:March 20 2025

huggingface.co

Omartificial-Intelligence-Space/GATE-AraBert-v0

Total runs: 7

Run Growth: 3

Growth Rate: 42.86%

Updated:January 23 2025

huggingface.co

Omartificial-Intelligence-Space/results

Total runs: 7

Run Growth: 0

Growth Rate: 0.00%

Updated:July 06 2025

huggingface.co

Omartificial-Intelligence-Space/AraEuroBert-610M

Total runs: 7

Run Growth: -875

Growth Rate: -12500.00%

Updated:March 20 2025

huggingface.co

Omartificial-Intelligence-Space/AI-Diploma-Saudi_Classifier_evaluator

Total runs: 6

Run Growth: 0

Growth Rate: 0.00%

Updated:July 06 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-llama3.1-lora-FT

Total runs: 6

Run Growth: 1

Growth Rate: 16.67%

Updated:July 27 2024

huggingface.co

Omartificial-Intelligence-Space/al-baka-Lora-llama3-8b

Total runs: 5

Run Growth: 1

Growth Rate: 20.00%

Updated:April 22 2024

huggingface.co

Omartificial-Intelligence-Space/AraGemma-Embedding-300m

Total runs: 4

Run Growth: -150

Growth Rate: -3750.00%

Updated:September 07 2025

huggingface.co

Omartificial-Intelligence-Space/SHAMI-MT-2MSA

Total runs: 4

Run Growth: -7

Growth Rate: -175.00%

Updated:August 06 2025

huggingface.co

Omartificial-Intelligence-Space/E5-all-nli-triplet-Matryoshka

Total runs: 3

Run Growth: -1

Growth Rate: -33.33%

Updated:December 29 2024

huggingface.co

Omartificial-Intelligence-Space/gpt-oss-math-ar

Total runs: 3

Run Growth: -2

Growth Rate: -66.67%

Updated:August 09 2025

huggingface.co

Omartificial-Intelligence-Space/AraStyleTransfer-21

Total runs: 2

Run Growth: -3

Growth Rate: -150.00%

Updated:November 26 2025

huggingface.co

Omartificial-Intelligence-Space/AraEuroBert-2.1B

Total runs: 2

Run Growth: 1

Growth Rate: 50.00%

Updated:March 21 2025

huggingface.co

Omartificial-Intelligence-Space/al-baka-16bit-llama3-8b-GGUF

Total runs: 2

Run Growth: 1

Growth Rate: 50.00%

Updated:April 22 2024

huggingface.co

Omartificial-Intelligence-Space/Arabert-matro-v3

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 28 2024

huggingface.co

Omartificial-Intelligence-Space/GATE-quora-Arabert-v2.0-prompts

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 14 2024

huggingface.co

Omartificial-Intelligence-Space/SA-BERT-Classifier

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 13 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-QwQ-32B-Preview-16bit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:December 02 2024

huggingface.co

Omartificial-Intelligence-Space/Arabic-llama-16bit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 29 2024

huggingface.co

Omartificial-Intelligence-Space/Fanar-0.5B-GRPO-test

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 14 2025

huggingface.co

Omartificial-Intelligence-Space/Arabic-llama3.1-Chat-16bit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 28 2024

huggingface.co

Omartificial-Intelligence-Space/Arabic-llama3.1-16bit

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 28 2024

huggingface.co

Omartificial-Intelligence-Space/inference-free-splade-distilbert-base-Arabic-cased-nq

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 02 2025

huggingface.co

Omartificial-Intelligence-Space/arabic-mini-instruct

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:August 23 2024

Omartificial-Intelligence-Space / Shami-MT

Introduction of Shami-MT

Model Details of Shami-MT

SHAMI-MT : A Machine Translation Model From MSA to Syrian Dialect

Model Description

Model Details

Dataset

Nâbra Dataset Details

Training Details

Training Progress

Usage

Installation

Inference Code

Generation Parameters

Evaluation Results

Performance Highlights

Applications

Regional Coverage

Limitations

Citation

Contact & Support

Runs of Omartificial-Intelligence-Space Shami-MT on huggingface.co

More Information About Shami-MT huggingface.co Model

More Shami-MT license Visit here:

Shami-MT huggingface.co

Shami-MT huggingface.co Url

Omartificial-Intelligence-Space Shami-MT online free

Omartificial-Intelligence-Space Shami-MT online free url in huggingface.co:

Shami-MT install

Shami-MT install url in huggingface.co:

Url of Shami-MT

Shami-MT huggingface.co Url

Provider of Shami-MT huggingface.co

Other API from Omartificial-Intelligence-Space