Pretrained bidirectional encoder for russian language.
The model was trained using standard MLM objective on large text corpora including open social data.
See
Training Details
section for more information.
⚠️ This model contains only the encoder part without any pretrained head.
Languages:
Mostly russian and small fraction of other languages
License:
Apache 2.0
How to Get Started with the Model
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("deepvk/roberta-base")
model = AutoModel.from_pretrained("deepvk/roberta-base")
text = "Привет, мир!"
inputs = tokenizer(text, return_tensors='pt')
predictions = model(**inputs)
Training Details
Training Data
500 GB of raw text in total.
A mix of the following data: Wikipedia, Books, Twitter comments, Pikabu, Proza.ru, Film subtitles, News websites, and Social corpus.
Training Hyperparameters
Argument
Value
Training regime
fp16 mixed precision
Training framework
Fairseq
Optimizer
Adam
Adam betas
0.9,0.98
Adam eps
1e-6
Num training steps
500k
The model was trained on a machine with 8xA100 for approximately 22 days.
Architecture details
Argument
Value
Encoder layers
12
Encoder attention heads
12
Encoder embed dim
768
Encoder ffn embed dim
3,072
Activation function
GeLU
Attention dropout
0.1
Dropout
0.1
Max positions
512
Vocab size
50266
Tokenizer type
Byte-level BPE
Evaluation
We evaluated the model on
Russian Super Glue
dev set.
The best result in each task is marked in bold.
All models have the same size except the distilled version of DeBERTa.
roberta-base huggingface.co is an AI model on huggingface.co that provides roberta-base's model effect (), which can be used instantly with this deepvk roberta-base model. huggingface.co supports a free trial of the roberta-base model, and also provides paid use of the roberta-base. Support call roberta-base model through api, including Node.js, Python, http.
roberta-base huggingface.co is an online trial and call api platform, which integrates roberta-base's modeling effects, including api services, and provides a free online trial of roberta-base, you can try roberta-base online for free by clicking the link below.
deepvk roberta-base online free url in huggingface.co:
roberta-base is an open source model from GitHub that offers a free installation service, and any user can find roberta-base on GitHub to install. At the same time, huggingface.co provides the effect of roberta-base install, users can directly use roberta-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.