This is the
ScholarBERT_100
variant of the ScholarBERT model family.
The model is pretrained on a large collection of scientific research articles (
221B tokens
).
This is a
cased
(case-sensitive) model. The tokenizer will not convert all inputs to lower-case by default.
The model is based on the same architecture as
BERT-large
and has a total of 340M parameters.
Model Architecture
Hyperparameter
Value
Layers
24
Hidden Size
1024
Attention Heads
16
Total Parameters
340M
Training Dataset
The vocab and the model are pertrained on
100% of the PRD
scientific literature dataset.
The PRD dataset is provided by Public.Resource.Org, Inc. (“Public Resource”),
a nonprofit organization based in California. This dataset was constructed from a corpus
of journal article files, from which We successfully extracted text from 75,496,055 articles from 178,928 journals.
The articles span across Arts & Humanities, Life Sciences & Biomedicine, Physical Sciences,
Social Sciences, and Technology. The distribution of articles is shown below.
BibTeX entry and citation info
If using this model, please cite this paper:
@misc{hong2023diminishing,
title={The Diminishing Returns of Masked Language Models to Science},
author={Zhi Hong and Aswathy Ajith and Gregory Pauloski and Eamon Duede and Kyle Chard and Ian Foster},
year={2023},
eprint={2205.11342},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Runs of globuslabs ScholarBERT on huggingface.co
2.8K
Total runs
0
24-hour runs
0
3-day runs
-2
7-day runs
2.8K
30-day runs
More Information About ScholarBERT huggingface.co Model
ScholarBERT huggingface.co is an AI model on huggingface.co that provides ScholarBERT's model effect (), which can be used instantly with this globuslabs ScholarBERT model. huggingface.co supports a free trial of the ScholarBERT model, and also provides paid use of the ScholarBERT. Support call ScholarBERT model through api, including Node.js, Python, http.
ScholarBERT huggingface.co is an online trial and call api platform, which integrates ScholarBERT's modeling effects, including api services, and provides a free online trial of ScholarBERT, you can try ScholarBERT online for free by clicking the link below.
globuslabs ScholarBERT online free url in huggingface.co:
ScholarBERT is an open source model from GitHub that offers a free installation service, and any user can find ScholarBERT on GitHub to install. At the same time, huggingface.co provides the effect of ScholarBERT install, users can directly use ScholarBERT installed effect in huggingface.co for debugging and trial. It also supports api for free installation.