DeBERTa: Decoding-enhanced BERT with Disentangled Attention
DeBERTa
improves the BERT and RoBERTa models using disentangled attention and enhanced mask decoder. It outperforms BERT and RoBERTa on majority of NLU tasks with 80GB training data.
1
Following RoBERTa, for RTE, MRPC, STS-B, we fine-tune the tasks based on
DeBERTa-Large-MNLI
,
DeBERTa-XLarge-MNLI
,
DeBERTa-V2-XLarge-MNLI
,
DeBERTa-V2-XXLarge-MNLI
. The results of SST-2/QQP/QNLI/SQuADv2 will also be slightly improved when start from MNLI fine-tuned models, however, we only report the numbers fine-tuned from pretrained base models for those 4 tasks.
2
To try the
XXLarge
model with
HF transformers
, you need to specify
--sharded_ddp
If you find DeBERTa useful for your work, please cite the following paper:
@inproceedings{
he2021deberta,
title={DEBERTA: DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION},
author={Pengcheng He and Xiaodong Liu and Jianfeng Gao and Weizhu Chen},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=XPZIaotutsD}
}
Runs of microsoft deberta-large on huggingface.co
7.2K
Total runs
0
24-hour runs
50
3-day runs
282
7-day runs
-404
30-day runs
More Information About deberta-large huggingface.co Model
deberta-large huggingface.co is an AI model on huggingface.co that provides deberta-large's model effect (), which can be used instantly with this microsoft deberta-large model. huggingface.co supports a free trial of the deberta-large model, and also provides paid use of the deberta-large. Support call deberta-large model through api, including Node.js, Python, http.
deberta-large huggingface.co is an online trial and call api platform, which integrates deberta-large's modeling effects, including api services, and provides a free online trial of deberta-large, you can try deberta-large online for free by clicking the link below.
microsoft deberta-large online free url in huggingface.co:
deberta-large is an open source model from GitHub that offers a free installation service, and any user can find deberta-large on GitHub to install. At the same time, huggingface.co provides the effect of deberta-large install, users can directly use deberta-large installed effect in huggingface.co for debugging and trial. It also supports api for free installation.