multimolecule / bpfold

huggingface.co
Total runs: 64
24-hour runs: -17
7-day runs: -70
30-day runs: -22
Model's Last Updated: June 01 2026
other

Introduction of bpfold

Model Details of bpfold

BPfold

Pre-trained model for RNA secondary structure prediction using base pair motif energy.

Disclaimer

This is an UNOFFICIAL implementation of Deep generalizable prediction of RNA secondary structure via base pair motif energy by Heqin Zhu, Fenghe Tang, Quan Quan, Ke Chen, Peng Xiong, and S. Kevin Zhou.

The OFFICIAL repository of BPfold is at heqin-zhu/BPfold .

The MultiMolecule implementation preserves the released BPfold architecture, base-pair motif energy feature construction, and canonical/non-canonical post-processing semantics.

The team releasing BPfold did not write this model card for this model so this model card has been written by the MultiMolecule team.

Model Details

BPfold predicts RNA base-pair contact maps from a single RNA sequence. It augments a transformer encoder with two L x L base-pair motif energy maps computed from three-neighbor base-pair motifs. MultiMolecule exposes BPfold as a single checkpoint and stores the motif-energy lookup tables inside it.

The model uses:

  • token order : follows the MultiMolecule tokenizer.
  • unknown bases : tokenized as N and treated as U during BPfold feature construction, matching the upstream fallback; padding follows attention_mask .
  • self-attention : dynamic position bias with adjacency bias from motif-energy maps.
  • pairwise convolutions : three residual 2D convolution layers over the adjacency maps before the transformer blocks.
  • post-processing : constrained refinement for canonical pairs, plus the optional BPfold non-canonical pass and mixed canonical/non-canonical outputs.
Model Specification
Num Layers Hidden Size Num Parameters (M) FLOPs (G) MACs (G)
12 256 47.77 87.78 42.74

FLOPs and MACs are computed with multimolecule.utils for one 600 nt sequence.

Links
Usage

The model file depends on the multimolecule library. You can install it using pip:

pip install multimolecule
RNA Secondary Structure Pipeline
import multimolecule
from transformers import pipeline

predictor = pipeline("rna-secondary-structure", model="multimolecule/bpfold")
output = predictor("GGUAAAACAGCCUGU")
PyTorch Inference
from multimolecule import BpfoldModel, RnaTokenizer

tokenizer = RnaTokenizer.from_pretrained("multimolecule/bpfold")
model = BpfoldModel.from_pretrained("multimolecule/bpfold")
input = tokenizer("GGUAAAACAGCCUGU", return_tensors="pt")

output = model(**input)
contact_map = output.contact_map  # (1, L, L) base-pair probability matrix
Training Details

BPfold was trained for RNA secondary structure prediction with base-pair motif energy priors.

Training Data
  • RNAStrAlign: 37,149 RNAs from eight RNA families were filtered to remove redundant sequences and invalid secondary structures, yielding 29,647 unique RNAs. Sequences longer than 600 nt were removed for training, leaving 19,313 training RNAs.
  • bpRNA-1m: 102,318 RNAs from 2,588 families were deduplicated with CD-HIT at 80% sequence identity and split into TR0/TS0 with 12,114/1,305 RNAs.
  • evaluation data: ArchiveII contains 3,966 RNAs; Rfam12.3-14.10 contains 10,791 RNAs from 1,992 families; bpRNA-new contains 5,401 RNAs; PDB contains 116 high-resolution RNAs split into TS1/TS2/TS3.
Training Procedure
  • objective: binary cross entropy over base-pair contact maps.
  • optimizer: Adam.
  • learning rate: 5e-4.
  • training epochs: 150.
  • batch size: 48.
  • positive-class weight: 300.
  • batching: length-matching mini-batches to reduce padding.
  • sequence features: token embeddings converted to the MultiMolecule tokenizer order.
  • structural priors: two L x L energy maps from three-neighbor base-pair motifs.
  • post-processing: constrained refinement for canonical pairs, minimum loop length, non-overlapping pairs, and isolated-pair removal.
Citation
@article{zhu2025bpfold,
  title   = {Deep generalizable prediction of {RNA} secondary structure via base pair motif energy},
  author  = {Zhu, Heqin and Tang, Fenghe and Quan, Quan and Chen, Ke and Xiong, Peng and Zhou, S. Kevin},
  journal = {Nature Communications},
  volume  = {16},
  number  = {1},
  pages   = {5856},
  year    = {2025},
  doi     = {10.1038/s41467-025-60048-1},
  url     = {https://doi.org/10.1038/s41467-025-60048-1}
}

The artifacts distributed in this repository are part of the MultiMolecule project. If you use MultiMolecule in your research, you must cite the MultiMolecule project.

License

This model is licensed under the GNU Affero General Public License .

For additional terms and clarifications, please refer to our License FAQ .

SPDX-License-Identifier: AGPL-3.0-or-later

Runs of multimolecule bpfold on huggingface.co

64
Total runs
-17
24-hour runs
-39
3-day runs
-70
7-day runs
-22
30-day runs

More Information About bpfold huggingface.co Model

More bpfold license Visit here:

https://choosealicense.com/licenses/agpl-3.0

bpfold huggingface.co

bpfold huggingface.co is an AI model on huggingface.co that provides bpfold's model effect (), which can be used instantly with this multimolecule bpfold model. huggingface.co supports a free trial of the bpfold model, and also provides paid use of the bpfold. Support call bpfold model through api, including Node.js, Python, http.

multimolecule bpfold online free

bpfold huggingface.co is an online trial and call api platform, which integrates bpfold's modeling effects, including api services, and provides a free online trial of bpfold, you can try bpfold online for free by clicking the link below.

multimolecule bpfold online free url in huggingface.co:

https://huggingface.co/multimolecule/bpfold

bpfold install

bpfold is an open source model from GitHub that offers a free installation service, and any user can find bpfold on GitHub to install. At the same time, huggingface.co provides the effect of bpfold install, users can directly use bpfold installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

bpfold install url in huggingface.co:

https://huggingface.co/multimolecule/bpfold

Url of bpfold

Provider of bpfold huggingface.co

multimolecule
ORGANIZATIONS

Other API from multimolecule

huggingface.co

Total runs: 12.8K
Run Growth: -20.9K
Growth Rate: -164.14%
Updated:February 02 2026
huggingface.co

Total runs: 1.2K
Run Growth: 107
Growth Rate: 9.01%
Updated:February 02 2026
huggingface.co

Total runs: 310
Run Growth: -7.4K
Growth Rate: -2388.71%
Updated:February 02 2026
huggingface.co

Total runs: 303
Run Growth: 124
Growth Rate: 40.92%
Updated:February 02 2026
huggingface.co

Total runs: 165
Run Growth: -537
Growth Rate: -325.45%
Updated:February 02 2026
huggingface.co

Total runs: 75
Run Growth: 42
Growth Rate: 100.00%
Updated:May 31 2026
huggingface.co

Total runs: 48
Run Growth: 0
Growth Rate: 0.00%
Updated:December 16 2024
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:September 14 2024
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:September 14 2024