Note: This model
does
contain the pretrained weights for the QASS layer (see paper for details). For the model
without
those weights, see
tau/splinter-base
.
Model description
Splinter is a model that is pretrained in a self-supervised fashion for few-shot question answering. This means it was pretrained on the raw texts only, with no humans labelling them in any way (which is why it can use lots of publicly available data) with an automatic process to generate inputs and labels from those texts.
More precisely, it was pretrained with the Recurring Span Selection (RSS) objective, which emulates the span selection process involved in extractive question answering. Given a text, clusters of recurring spans (n-grams that appear more than once in the text) are first identified. For each such cluster, all of its instances but one are replaced with a special
[QUESTION]
token, and the model should select the correct (i.e., unmasked) span for each masked one. The model also defines the Question-Aware Span selection (QASS) layer, which selects spans conditioned on a specific question (in order to perform multiple predictions).
Intended uses & limitations
The prime use for this model is few-shot extractive QA.
Pretraining
The model was pretrained on a v3-8 TPU for 2.4M steps. The training data is based on
Wikipedia
and
BookCorpus
. See the paper for more details.
BibTeX entry and citation info
@inproceedings{ram-etal-2021-shot,
title = "Few-Shot Question Answering by Pretraining Span Selection",
author = "Ram, Ori and
Kirstain, Yuval and
Berant, Jonathan and
Globerson, Amir and
Levy, Omer",
booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)",
month = aug,
year = "2021",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2021.acl-long.239",
doi = "10.18653/v1/2021.acl-long.239",
pages = "3066--3079",
}
Runs of tau splinter-base-qass on huggingface.co
1.9K
Total runs
0
24-hour runs
0
3-day runs
95
7-day runs
41
30-day runs
More Information About splinter-base-qass huggingface.co Model
splinter-base-qass huggingface.co is an AI model on huggingface.co that provides splinter-base-qass's model effect (), which can be used instantly with this tau splinter-base-qass model. huggingface.co supports a free trial of the splinter-base-qass model, and also provides paid use of the splinter-base-qass. Support call splinter-base-qass model through api, including Node.js, Python, http.
splinter-base-qass huggingface.co is an online trial and call api platform, which integrates splinter-base-qass's modeling effects, including api services, and provides a free online trial of splinter-base-qass, you can try splinter-base-qass online for free by clicking the link below.
tau splinter-base-qass online free url in huggingface.co:
splinter-base-qass is an open source model from GitHub that offers a free installation service, and any user can find splinter-base-qass on GitHub to install. At the same time, huggingface.co provides the effect of splinter-base-qass install, users can directly use splinter-base-qass installed effect in huggingface.co for debugging and trial. It also supports api for free installation.