mudler / vibevoice.cpp-models

huggingface.co
Total runs: 22.5K
24-hour runs: -59
7-day runs: 6.7K
30-day runs: -7.3K
Model's Last Updated: April 29 2026

Introduction of vibevoice.cpp-models

Model Details of vibevoice.cpp-models

vibevoice.cpp — quantized model bundle

Brought to you by the LocalAI team — the creators of LocalAI, the open-source AI engine that runs any model — LLMs, vision, voice, image, video — on any hardware. No GPU required.

Quantized GGUF weights for vibevoice.cpp , a C/C++ port of Microsoft VibeVoice (TTS + ASR) on top of ggml .

File Source Quant Size
vibevoice-realtime-0.5B-q8_0.gguf microsoft/VibeVoice-Realtime-0.5B Q8_0 (matmul) + F16 ~1.6 GB
vibevoice-asr-q8_0.gguf microsoft/VibeVoice-ASR Q8_0 (matmul) + F16 ~13 GB
voice-en-Carter_man.gguf upstream voice prompt cache F16 8 MB
voice-en-Emma.gguf upstream voice prompt cache F16 6 MB
tokenizer.gguf Qwen2.5 BPE + VibeVoice specials 6 MB
Quantization scheme

scripts/quantize_gguf.py in the source repo selectively quantizes only the LM matmul weights — attention q/k/v/o, ffn gate/up/down, and lm_head — to Q8_0. Everything else (1-D conv kernels, RMSNorm scales, biases, layer-scale gammas, token embeddings, small scalars) passes through unchanged. The conv1d implementation in vibevoice.cpp casts kernels to F16 inline rather than dequantizing on the fly, so quantizing those would corrupt the convolution outputs.

Q8_0 was chosen because it's pure-Python implementable in gguf-py and gives a ~60% size reduction on the 7B ASR model with no measurable quality regression in the closed-loop TTS → ASR roundtrip test.

Quickstart
git clone --recursive https://github.com/mudler/vibevoice.cpp
cd vibevoice.cpp && cmake -B build -DVIBEVOICE_BUILD_TESTS=ON && cmake --build build -j

# Pull this bundle
mkdir -p models && cd models
hf download mudler/vibevoice.cpp-models --local-dir .
cd ..

# TTS
build/bin/vibevoice-cli tts \
    --model models/vibevoice-realtime-0.5B-q8_0.gguf \
    --voice models/voice-en-Carter_man.gguf \
    --tokenizer models/tokenizer.gguf \
    --text "Hello world this is a test of the synthesis system." \
    --out hello.wav

# ASR
build/bin/vibevoice-cli asr \
    --model models/vibevoice-asr-q8_0.gguf \
    --tokenizer models/tokenizer.gguf \
    --audio hello.wav
# -> [{"Start":0,"End":2.8,"Speaker":0,"Content":"Hello world, this is a test of the synthesis system."}]
Closed-loop verification

The test_closed_loop ctest in vibevoice.cpp runs TTS → ASR end-to-end and asserts ≥80% source-word recall in the recovered transcript. With this bundle (both Q8_0 models) it passes at 10/10 (100 %).

License

Weights are derived from Microsoft VibeVoice ( VibeVoice-Realtime-0.5B and VibeVoice-ASR ); follow the upstream model licenses for use. The conversion + quantization tooling is released under MIT as part of vibevoice.cpp.

Runs of mudler vibevoice.cpp-models on huggingface.co

22.5K
Total runs
-59
24-hour runs
512
3-day runs
6.7K
7-day runs
-7.3K
30-day runs

More Information About vibevoice.cpp-models huggingface.co Model

More vibevoice.cpp-models license Visit here:

https://choosealicense.com/licenses/mit

vibevoice.cpp-models huggingface.co

vibevoice.cpp-models huggingface.co is an AI model on huggingface.co that provides vibevoice.cpp-models's model effect (), which can be used instantly with this mudler vibevoice.cpp-models model. huggingface.co supports a free trial of the vibevoice.cpp-models model, and also provides paid use of the vibevoice.cpp-models. Support call vibevoice.cpp-models model through api, including Node.js, Python, http.

vibevoice.cpp-models huggingface.co Url

https://huggingface.co/mudler/vibevoice.cpp-models

mudler vibevoice.cpp-models online free

vibevoice.cpp-models huggingface.co is an online trial and call api platform, which integrates vibevoice.cpp-models's modeling effects, including api services, and provides a free online trial of vibevoice.cpp-models, you can try vibevoice.cpp-models online for free by clicking the link below.

mudler vibevoice.cpp-models online free url in huggingface.co:

https://huggingface.co/mudler/vibevoice.cpp-models

vibevoice.cpp-models install

vibevoice.cpp-models is an open source model from GitHub that offers a free installation service, and any user can find vibevoice.cpp-models on GitHub to install. At the same time, huggingface.co provides the effect of vibevoice.cpp-models install, users can directly use vibevoice.cpp-models installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

vibevoice.cpp-models install url in huggingface.co:

https://huggingface.co/mudler/vibevoice.cpp-models

Url of vibevoice.cpp-models

vibevoice.cpp-models huggingface.co Url

Provider of vibevoice.cpp-models huggingface.co

mudler
ORGANIZATIONS

Other API from mudler

huggingface.co

Total runs: 450
Run Growth: 450
Growth Rate: 100.00%
Updated:June 22 2026