We identified and fixed an issue related to a wrong permutation of some projections, which affects generation quality. To use the new model revision, please load as follows:
Evo is a biological foundation model capable of long-context modeling and design.
Evo uses the
StripedHyena architecture
to enable modeling of sequences at a single-nucleotide, byte-level resolution with near-linear scaling of compute and memory relative to context length.
Evo has 7 billion parameters and is trained on OpenGenome, a prokaryotic whole-genome dataset containing ~300 billion tokens.
Technical details about Evo can be found in our preprint and our accompanying blog posts. Evo was collaboratively developed by the
Arc Institute
and TogetherAI.
As part of our commitment to open science, we release
weights of 15 intermediate pretraining checkpoints
for phase 1 and phase 2 of pretraining. The checkpoints are available as branches of the corresponding HuggingFace repository.
Evo-1 (Phase 1)
is our first model in the Evo family, trained at a context length of 8k.
Checkpoint Name
Description
evo-1-8k-base
A model pretrained with 8,192 context. We use this model as the base model for molecular-scale finetuning tasks.
evo-1-131k-base
A model pretrained with 131,072 context using
evo-1-8k-base
as the initialization. We use this model to reason about and generate sequences at the genome scale.
Model Architecture
StripedHyena is a deep signal processing, hybrid architecture composed of multi-head attention and gated convolutions arranged in
Hyena
blocks, improving over decoder-only Transformers.
StripedHyena is designed to leverage the specialization of each of its layer classes, with Hyena layers implementing the bulk of the computation required for sequence processing and attention layers supplementing the ability to perform targeted pattern recall.
Some highlights of the architecture:
Efficient autoregressive generation
via a recurrent mode (>500k generation with a single 80GB GPU)
Significantly faster training and finetuning
at long context (>3x at 131k)
Improved scaling laws over state-of-the-art architectures
(e.g., Transformer++) on both natural language and biological sequences.
Robust to training beyond the compute-optimal frontier
e.g., training way beyond Chinchilla-optimal token amounts (see preprint for details -- more details to come)
One of the advantages of deep signal processing models is their flexibility. Different parametrizations of convolutions can be used depending on the memory, expressivity and causality requirements of pretraining, finetuning or inference workloads.
StripedHyena is a mixed precision model. Make sure to keep your
poles
and
residues
in
float32
precision, especially for longer prompts or training.
Disclaimer
To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the
standalone repository
.
Cite
@article{nguyen2024sequence,
author = {Eric Nguyen and Michael Poli and Matthew G. Durrant and Armin W. Thomas and Brian Kang and Jeremy Sullivan and Madelena Y. Ng and Ashley Lewis and Aman Patel and Aaron Lou and Stefano Ermon and Stephen A. Baccus and Tina Hernandez-Boussard and Christopher Ré and Patrick D. Hsu and Brian L. Hie},
journal = {Arc Institute manuscripts},
title = {Sequence modeling and design from molecular to genome scale with Evo},
url = {https://arcinstitute.org/manuscripts/Evo},
year = {2024},
}
Runs of togethercomputer evo-1-8k-base on huggingface.co
3.3K
Total runs
-12
24-hour runs
-175
3-day runs
-441
7-day runs
187
30-day runs
More Information About evo-1-8k-base huggingface.co Model
evo-1-8k-base huggingface.co is an AI model on huggingface.co that provides evo-1-8k-base's model effect (), which can be used instantly with this togethercomputer evo-1-8k-base model. huggingface.co supports a free trial of the evo-1-8k-base model, and also provides paid use of the evo-1-8k-base. Support call evo-1-8k-base model through api, including Node.js, Python, http.
evo-1-8k-base huggingface.co is an online trial and call api platform, which integrates evo-1-8k-base's modeling effects, including api services, and provides a free online trial of evo-1-8k-base, you can try evo-1-8k-base online for free by clicking the link below.
togethercomputer evo-1-8k-base online free url in huggingface.co:
evo-1-8k-base is an open source model from GitHub that offers a free installation service, and any user can find evo-1-8k-base on GitHub to install. At the same time, huggingface.co provides the effect of evo-1-8k-base install, users can directly use evo-1-8k-base installed effect in huggingface.co for debugging and trial. It also supports api for free installation.