microsoft / deberta-large

huggingface.co
Total runs: 7.2K
24-hour runs: 0
7-day runs: 282
30-day runs: -404
Model's Last Updated: September 26 2022
fill-mask

Introduction of deberta-large

Model Details of deberta-large

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

DeBERTa improves the BERT and RoBERTa models using disentangled attention and enhanced mask decoder. It outperforms BERT and RoBERTa on majority of NLU tasks with 80GB training data.

Please check the official repository for more details and updates.

Fine-tuning on NLU tasks

We present the dev results on SQuAD 1.1/2.0 and several GLUE benchmark tasks.

Model SQuAD 1.1 SQuAD 2.0 MNLI-m/mm SST-2 QNLI CoLA RTE MRPC QQP STS-B
F1/EM F1/EM Acc Acc Acc MCC Acc Acc/F1 Acc/F1 P/S
BERT-Large 90.9/84.1 81.8/79.0 86.6/- 93.2 92.3 60.6 70.4 88.0/- 91.3/- 90.0/-
RoBERTa-Large 94.6/88.9 89.4/86.5 90.2/- 96.4 93.9 68.0 86.6 90.9/- 92.2/- 92.4/-
XLNet-Large 95.1/89.7 90.6/87.9 90.8/- 97.0 94.9 69.0 85.9 90.8/- 92.3/- 92.5/-
DeBERTa-Large 1 95.5/90.1 90.7/88.0 91.3/91.1 96.5 95.3 69.5 91.0 92.6/94.6 92.3/- 92.8/92.5
DeBERTa-XLarge 1 -/- -/- 91.5/91.2 97.0 - - 93.1 92.1/94.3 - 92.9/92.7
DeBERTa-V2-XLarge 1 95.8/90.8 91.4/88.9 91.7/91.6 97.5 95.8 71.1 93.9 92.0/94.2 92.3/89.8 92.9/92.9
DeBERTa-V2-XXLarge 1,2 96.1/91.4 92.2/89.7 91.7/91.9 97.2 96.0 72.0 93.5 93.1/94.9 92.7/90.3 93.2/93.1

Notes.
cd transformers/examples/text-classification/
export TASK_NAME=mrpc
python -m torch.distributed.launch --nproc_per_node=8 run_glue.py   --model_name_or_path microsoft/deberta-v2-xxlarge   \\
--task_name $TASK_NAME   --do_train   --do_eval   --max_seq_length 128   --per_device_train_batch_size 4   \\
--learning_rate 3e-6   --num_train_epochs 3   --output_dir /tmp/$TASK_NAME/ --overwrite_output_dir --sharded_ddp --fp16
Citation

If you find DeBERTa useful for your work, please cite the following paper:

@inproceedings{
he2021deberta,
title={DEBERTA: DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION},
author={Pengcheng He and Xiaodong Liu and Jianfeng Gao and Weizhu Chen},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=XPZIaotutsD}
}

Runs of microsoft deberta-large on huggingface.co

7.2K
Total runs
0
24-hour runs
50
3-day runs
282
7-day runs
-404
30-day runs

More Information About deberta-large huggingface.co Model

More deberta-large license Visit here:

https://choosealicense.com/licenses/mit

deberta-large huggingface.co

deberta-large huggingface.co is an AI model on huggingface.co that provides deberta-large's model effect (), which can be used instantly with this microsoft deberta-large model. huggingface.co supports a free trial of the deberta-large model, and also provides paid use of the deberta-large. Support call deberta-large model through api, including Node.js, Python, http.

deberta-large huggingface.co Url

https://huggingface.co/microsoft/deberta-large

microsoft deberta-large online free

deberta-large huggingface.co is an online trial and call api platform, which integrates deberta-large's modeling effects, including api services, and provides a free online trial of deberta-large, you can try deberta-large online for free by clicking the link below.

microsoft deberta-large online free url in huggingface.co:

https://huggingface.co/microsoft/deberta-large

deberta-large install

deberta-large is an open source model from GitHub that offers a free installation service, and any user can find deberta-large on GitHub to install. At the same time, huggingface.co provides the effect of deberta-large install, users can directly use deberta-large installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

deberta-large install url in huggingface.co:

https://huggingface.co/microsoft/deberta-large

Url of deberta-large

deberta-large huggingface.co Url

Provider of deberta-large huggingface.co

microsoft
ORGANIZATIONS

Other API from microsoft

huggingface.co

Total runs: 652.7K
Run Growth: -982.2K
Growth Rate: -146.44%
Updated:December 08 2025
huggingface.co

Total runs: 575.6K
Run Growth: 218.2K
Growth Rate: 38.12%
Updated:February 03 2022
huggingface.co

Total runs: 531.0K
Run Growth: 429.5K
Growth Rate: 81.40%
Updated:April 08 2024
huggingface.co

Total runs: 484.5K
Run Growth: -221.9K
Growth Rate: -42.77%
Updated:November 25 2025
huggingface.co

Total runs: 306.0K
Run Growth: 9.8K
Growth Rate: 3.18%
Updated:February 14 2024
huggingface.co

Total runs: 187.3K
Run Growth: -3.1K
Growth Rate: -1.63%
Updated:November 08 2023
huggingface.co

Total runs: 128.6K
Run Growth: -160.0K
Growth Rate: -123.33%
Updated:February 03 2023
huggingface.co

Total runs: 122.1K
Run Growth: -349.4K
Growth Rate: -281.90%
Updated:September 26 2022
huggingface.co

Total runs: 109.1K
Run Growth: -38.1K
Growth Rate: -34.60%
Updated:February 29 2024
huggingface.co

Total runs: 84.2K
Run Growth: -34.3K
Growth Rate: -40.32%
Updated:August 28 2025
huggingface.co

Total runs: 78.5K
Run Growth: -16.0K
Growth Rate: -19.77%
Updated:November 25 2025
huggingface.co

Total runs: 72.1K
Run Growth: -29.7K
Growth Rate: -40.63%
Updated:December 03 2025
huggingface.co

Total runs: 42.4K
Run Growth: -11.8K
Growth Rate: -28.43%
Updated:December 23 2021
huggingface.co

Total runs: 41.0K
Run Growth: 10.0K
Growth Rate: 24.27%
Updated:October 09 2025
huggingface.co

Total runs: 31.0K
Run Growth: -8.1K
Growth Rate: -26.25%
Updated:October 11 2025