Cedille is a project to bring large language models to non-English languages.
fr-boris
Boris is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the
mesh-transformer-jax
codebase.
Boris was trained on around 78B tokens of French text from the
C4
dataset. We started training from GPT-J, which has been trained on
The Pile
. As a consequence the model still has good performance in English language. Boris makes use of the unmodified GPT-2 tokenizer.
Boris is named after the great French writer
Boris Vian
.
Thanks for citing our work if you make use of Cedille
@misc{muller2022cedille,
title={Cedille: A large autoregressive French language model},
author={Martin M{\"{u}}ller and Florian Laurent},
year={2022},
eprint={2202.03371},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
fr-boris huggingface.co is an AI model on huggingface.co that provides fr-boris's model effect (), which can be used instantly with this Cedille fr-boris model. huggingface.co supports a free trial of the fr-boris model, and also provides paid use of the fr-boris. Support call fr-boris model through api, including Node.js, Python, http.
fr-boris huggingface.co is an online trial and call api platform, which integrates fr-boris's modeling effects, including api services, and provides a free online trial of fr-boris, you can try fr-boris online for free by clicking the link below.
Cedille fr-boris online free url in huggingface.co:
fr-boris is an open source model from GitHub that offers a free installation service, and any user can find fr-boris on GitHub to install. At the same time, huggingface.co provides the effect of fr-boris install, users can directly use fr-boris installed effect in huggingface.co for debugging and trial. It also supports api for free installation.