The MultiMolecule team has confirmed that the provided model and checkpoints are producing the same intermediate representations as the original implementation.
The team releasing BPNet did not write this model card for this model so this model card has been written by the MultiMolecule team.
Model Details
BPNet is a convolutional neural network (CNN) trained to predict base-resolution transcription-factor binding signal (ChIP-nexus) from primary DNA sequence. It uses a convolutional motif stem followed by a stack of dilated residual convolutions that aggregate ~1 kb of genomic context.
BPNet predicts a single base-resolution signal task whose output is factorized into two terminal branches that share the dilated-convolution backbone:
a
profile
branch predicting the
shape
of the binding signal as per-position multinomial logits, trained with a multinomial negative log-likelihood;
a
count
branch predicting the
total magnitude
of the signal as a scalar per task and strand (in log space), trained with mean-squared error.
The usable base-resolution prediction recombines the two branches as
softmax(profile_logits, positions) * exp(count_logits)
, exposed via
BPNetForProfilePrediction.postprocess
. Please refer to the
Training Details
section for more information on the training process.
Developed by
: Žiga Avsec, Melanie Weilert, Avanti Shrikumar, Sabrina Krueger, Amr Alexandari, Khyati Dalal, Robin Fropf, Charles McAnany, Julien Gagneur, Anshul Kundaje, Julia Zeitlinger
The recombined
track
is the usable base-resolution prediction. The last dimension stacks
num_tasks
(Oct4, Sox2, Nanog, Klf4) by
num_strands
(forward, reverse).
Training Details
BPNet was trained to predict the base-resolution ChIP-nexus binding profiles of the pluripotency transcription factors Oct4, Sox2, Nanog and Klf4 in mouse embryonic stem cells.
Training Data
The published BPNet-OSKN model was trained on ChIP-nexus profiles for Oct4, Sox2, Nanog and Klf4, using 1 kb genomic windows centered on detected binding peaks. The training regions and trained Keras checkpoint are distributed as part of the
BPNet manuscript data
.
Training Procedure
Training
The model was trained with a composite loss: a multinomial negative log-likelihood on the per-position profile shape plus a mean-squared-error regression on the log total counts.
Count head: per-task global average pooling + linear layer producing log-count scalars
Optimizer: Adam
Citation
@article{avsec2021baseresolution,
author = {Avsec, {\v{Z}}iga and Weilert, Melanie and Shrikumar, Avanti and Krueger, Sabrina and Alexandari, Amr and Dalal, Khyati and Fropf, Robin and McAnany, Charles and Gagneur, Julien and Kundaje, Anshul and Zeitlinger, Julia},
title = {Base-resolution models of transcription-factor binding reveal soft motif syntax},
journal = {Nature Genetics},
volume = 53,
number = 3,
pages = {354--366},
year = 2021,
publisher = {Nature Publishing Group},
doi = {10.1038/s41588-021-00782-6}
}
The artifacts distributed in this repository are part of the MultiMolecule project.
If you use MultiMolecule in your research, you must cite the MultiMolecule project as follows:
@software{chen_2024_12638419,
author = {Chen, Zhiyuan and Zhu, Sophia Y.},
title = {MultiMolecule},
doi = {10.5281/zenodo.12638419},
publisher = {Zenodo},
url = {https://doi.org/10.5281/zenodo.12638419},
year = 2024,
month = may,
day = 4
}
Contact
Please use GitHub issues of
MultiMolecule
for any questions or comments on the model card.
Please contact the authors of the
BPNet paper
for questions or comments on the paper/model.
bpnet huggingface.co is an AI model on huggingface.co that provides bpnet's model effect (), which can be used instantly with this multimolecule bpnet model. huggingface.co supports a free trial of the bpnet model, and also provides paid use of the bpnet. Support call bpnet model through api, including Node.js, Python, http.
bpnet huggingface.co is an online trial and call api platform, which integrates bpnet's modeling effects, including api services, and provides a free online trial of bpnet, you can try bpnet online for free by clicking the link below.
multimolecule bpnet online free url in huggingface.co:
bpnet is an open source model from GitHub that offers a free installation service, and any user can find bpnet on GitHub to install. At the same time, huggingface.co provides the effect of bpnet install, users can directly use bpnet installed effect in huggingface.co for debugging and trial. It also supports api for free installation.