Bernini is a unified framework for video generation and editing that combines an MLLM-based semantic planner with a DiT-based renderer.
Compared with the renderer-only Bernini-R release,
Bernini-Diffusers
packages the full semantic-planning pipeline: a Qwen2.5-VL planner, Bernini planning weights, and Wan2.2 diffusion components in one self-contained directory. This makes it the recommended release when you need stronger instruction following, multi-step semantic planning, and better handling of complex video editing requests.
🧾 Model card
Field
Description
Model type
Full video generation/editing pipeline with an MLLM-based semantic planner and a DiT-based renderer.
On video editing, Bernini reaches the first tier among leading closed-source commercial models in our internal arena evaluation based on blind human pairwise comparisons.
📦 Package layout
This release is a
self-contained diffusers-format directory
. Pass the downloaded
Bernini-Diffusers
directory directly to
--config
.
--use_pe
enhances the prompt through an OpenAI-compatible endpoint and is recommended for best generation quality.
export BERNINI_PE_API_KEY=... # or OPENAI_API_KEYexport BERNINI_PE_BASE_URL=... # or OPENAI_BASE_URLexport BERNINI_PE_MODEL=... # vision-capable chat model
The
scripts/bernini/
directory in the Bernini repo provides ready-to-run task launchers for the full pipeline:
run_t2i.sh
run_i2i.sh
run_t2v.sh
run_v2v.sh
run_rv2v.sh
run_r2v.sh
run_gradio.sh
You can override the model directory with:
export BERNINI_CONFIG=/path/to/Bernini-Diffusers
📑 Citation
If you use Bernini in your research, please cite:
@article{bernini,
title = {Bernini: Latent Semantic Planning for Video Diffusion},
author = {Chenchen Liu and Junyi Chen and Lei Li and Lu Chi and Mingzhen Sun and Zhuoying Li and Yi Fu and Ruoyu Guo and Yiheng Wu and Ge Bai and Zehuan Yuan},
journal = {arXiv preprint arXiv:2605.22344},
year = {2026}
}
🙏 Acknowledgements
Bernini builds on several outstanding open-source projects:
Bernini-Diffusers huggingface.co is an AI model on huggingface.co that provides Bernini-Diffusers's model effect (), which can be used instantly with this ByteDance Bernini-Diffusers model. huggingface.co supports a free trial of the Bernini-Diffusers model, and also provides paid use of the Bernini-Diffusers. Support call Bernini-Diffusers model through api, including Node.js, Python, http.
Bernini-Diffusers huggingface.co is an online trial and call api platform, which integrates Bernini-Diffusers's modeling effects, including api services, and provides a free online trial of Bernini-Diffusers, you can try Bernini-Diffusers online for free by clicking the link below.
ByteDance Bernini-Diffusers online free url in huggingface.co:
Bernini-Diffusers is an open source model from GitHub that offers a free installation service, and any user can find Bernini-Diffusers on GitHub to install. At the same time, huggingface.co provides the effect of Bernini-Diffusers install, users can directly use Bernini-Diffusers installed effect in huggingface.co for debugging and trial. It also supports api for free installation.