omnivoice-singing huggingface.co api & ModelsLab omnivoice-singing github AI Model

Introduction of omnivoice-singing

Model Details of omnivoice-singing

OmniVoice — Singing + Emotion Finetune

A finetune of k2-fsa/OmniVoice that adds:

[singing] tag — sung speech / nursery-style melodic vocals
Emotion tags — [happy] , [sad] , [angry] , [excited] , [calm] , [nervous] , [whisper]
Combined tags — e.g. [singing] [happy] ... or [singing] [sad] ...

Original OmniVoice capabilities (multilingual zero-shot TTS, voice cloning, voice design, 600+ languages) are preserved — the base speech head was protected during finetuning with a continuity mix of plain speech and singing.

Drop-in replacement

This checkpoint is fully compatible with the upstream k2-fsa/OmniVoice code — same architecture (Qwen3-0.6B LM + HiggsAudioV2 audio tokenizer at 24 kHz), same inference API. Replace the model id:

from omnivoice.models.omnivoice import OmniVoice

model = OmniVoice.from_pretrained("ModelsLab/omnivoice-singing").to("cuda").eval()

# Normal speech (unchanged behavior)
audios = model.generate(
    text="The quick brown fox jumps over the lazy dog.",
    language="English",
)

# Singing
audios = model.generate(
    text="[singing] Twinkle twinkle little star, how I wonder what you are.",
    language="English",
)

# Emotional speech
audios = model.generate(
    text="[happy] I just got the best news of my entire year!",
    language="English",
)

# Combined
audios = model.generate(
    text="[singing] [sad] Quiet rain falls on the stone, memories of days now gone.",
    language="English",
)

import soundfile as sf
sf.write("out.wav", audios[0], model.sampling_rate)

CLI works the same way:

omnivoice-infer --model ModelsLab/omnivoice-singing \
    --text "[happy] Hello there, how wonderful to see you today!" \
    --language English \
    --output out.wav

Supported tags

Tag	Source data	Strength
`[singing]`	GTSinger English (6,755 clips, ~8 h)	strong
`[happy]`	CREMA-D + RAVDESS + Expresso (~2900 clips)	strong
`[sad]`	CREMA-D + RAVDESS + Expresso (~2900 clips)	strong
`[angry]`	CREMA-D + RAVDESS (~1500 clips)	strong
`[nervous]`	CREMA-D fear + RAVDESS fearful (~1400 clips)	strong
`[whisper]`	Expresso whisper (~1500 clips)	strong
`[calm]`	RAVDESS calm (~190 clips)	weak — limited data
`[excited]`	RAVDESS surprised (~190 clips)	weak — limited data

Guidance scale of 3.0 (up from default 2.0) is recommended to make tag behavior more pronounced:

audios = model.generate(
    text="[happy] Welcome!",
    language="English",
    guidance_scale=3.0,
)

What's preserved from the base

Multilingual TTS (English, Chinese, Japanese, Korean, Spanish, French, German, Italian, Russian, Hindi, Gujarati, etc.)
Voice cloning from reference audio ( ref_audio / ref_text args)
Voice design via instruct parameter (pitch / gender / age / accent attributes)
Fine-grained pronunciation control (pinyin / CMU phoneme overrides)
Speed and duration control ( speed / duration args)
Built-in non-verbal symbols ( [laughter] , [sigh] , etc.)

Training

Two-stage finetune from k2-fsa/OmniVoice :

Stage 1 — Singing (2500 steps):

GTSinger English (6.8k clips, tagged [singing] {lyrics} )
LibriTTS-R dev+test clean (10k clips, plain text — speech preservation)
LR 3e-5 cosine, bf16, 2 GPUs, batch_tokens=8192
Final eval loss: 4.74

Stage 2 — Emotion (2500 steps, forked from singing/checkpoint-2500):

CREMA-D + RAVDESS + Expresso read config (10.8k emotion clips)
1.5k singing + 1.5k speech continuity samples
LR 3e-5 cosine, bf16, 2 GPUs, batch_tokens=8192
Best eval loss: 4.72 (at step 750) / final 4.88 (step 2500 — this checkpoint, found to sound better subjectively)

This published checkpoint is the final emotion step 2500 , which subjectively produces the cleanest emotional tag behavior while preserving speech/singing quality.

Known limitations

[calm] and [excited] had only ~190 training samples each (only one dataset contributed) — behavior is weaker than the other emotion tags.
Cross-language singing (sung Hindi, Gujarati, etc.) is extrapolation — works but quality varies.
Like the base model, output quality is bounded by the HiggsAudioV2 tokenizer (24 kHz, ~2 kbps, speech-domain tuned). Music / drum content is not supported by design.

License

Apache 2.0. Downstream users must also comply with the individual licenses of the training datasets:

GTSinger: CC BY-NC-SA 4.0 (research use)
CREMA-D: ODbL
RAVDESS: CC BY-NC-SA 4.0
Expresso: CC BY-NC 4.0
LibriTTS-R: CC BY 4.0

Acknowledgements

k2-fsa/OmniVoice — base model & training framework
HiggsAudioV2 — discrete audio tokenizer
Qwen team — Qwen3-0.6B backbone
Dataset authors: GTSinger, CREMA-D, RAVDESS, Expresso, LibriTTS-R teams

Runs of ModelsLab omnivoice-singing on huggingface.co

308

Total runs

24-hour runs

3-day runs

7-day runs

270

30-day runs

More Information About omnivoice-singing huggingface.co Model

More omnivoice-singing license Visit here:

https://choosealicense.com/licenses/apache-2.0

omnivoice-singing huggingface.co

omnivoice-singing huggingface.co is an AI model on huggingface.co that provides omnivoice-singing's model effect (), which can be used instantly with this ModelsLab omnivoice-singing model. huggingface.co supports a free trial of the omnivoice-singing model, and also provides paid use of the omnivoice-singing. Support call omnivoice-singing model through api, including Node.js, Python, http.

omnivoice-singing huggingface.co Url

https://huggingface.co/ModelsLab/omnivoice-singing

ModelsLab omnivoice-singing online free

omnivoice-singing huggingface.co is an online trial and call api platform, which integrates omnivoice-singing's modeling effects, including api services, and provides a free online trial of omnivoice-singing, you can try omnivoice-singing online for free by clicking the link below.

ModelsLab omnivoice-singing online free url in huggingface.co:

https://huggingface.co/ModelsLab/omnivoice-singing

omnivoice-singing install

omnivoice-singing is an open source model from GitHub that offers a free installation service, and any user can find omnivoice-singing on GitHub to install. At the same time, huggingface.co provides the effect of omnivoice-singing install, users can directly use omnivoice-singing installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

omnivoice-singing install url in huggingface.co:

https://huggingface.co/ModelsLab/omnivoice-singing

huggingface.co

ModelsLab/rm-ckpts

Total runs: 28.3K

Run Growth: -10.5K

Growth Rate: -37.00%

Updated:October 07 2025

huggingface.co

ModelsLab/FLUX.2-klein-9B

Total runs: 2.7K

Run Growth: -1.2K

Growth Rate: -50.22%

Updated:March 11 2026

huggingface.co

ModelsLab/RealVisXL_V5.0_Lightning

Total runs: 450

Run Growth: 418

Growth Rate: 93.10%

Updated:January 16 2025

huggingface.co

ModelsLab/controlnet-xl-pro

Total runs: 436

Run Growth: 0

Growth Rate: 0.00%

Updated:September 25 2024

huggingface.co

ModelsLab/CLIP-ViT-H-14-laion2B-s32B-b79K

Total runs: 249

Run Growth: -83

Growth Rate: -32.81%

Updated:January 17 2025

huggingface.co

ModelsLab/sd-vae-ft-mse

Total runs: 80

Run Growth: 5

Growth Rate: 6.49%

Updated:January 17 2025

huggingface.co

ModelsLab/shuttle-3-diffusion-nf4

Total runs: 38

Run Growth: 0

Growth Rate: 0.00%

Updated:November 18 2024

huggingface.co

ModelsLab/obj-controlnet

Total runs: 32

Run Growth: 0

Growth Rate: 0.00%

Updated:May 10 2025

huggingface.co

ModelsLab/punctuate-indic-v1

Total runs: 27

Run Growth: -15

Growth Rate: -55.56%

Updated:August 01 2024

huggingface.co

ModelsLab/Realistic_Vision_V4.0_noVAE

Total runs: 19

Run Growth: 12

Growth Rate: 63.16%

Updated:January 17 2025

huggingface.co

ModelsLab/Llama-3-uncensored-Dare-1

Total runs: 18

Run Growth: 9

Growth Rate: 50.00%

Updated:June 28 2024

huggingface.co

ModelsLab/FluxDev-HyperSD-nf4

Total runs: 15

Run Growth: 2

Growth Rate: 13.33%

Updated:November 20 2024

huggingface.co

ModelsLab/Llama-3.1-8b-Uncensored-Dare

Total runs: 14

Run Growth: -13

Growth Rate: -92.86%

Updated:July 31 2024

huggingface.co

ModelsLab/fish-speech-1.5

Total runs: 12

Run Growth: -20

Growth Rate: -166.67%

Updated:February 27 2025

huggingface.co

ModelsLab/zero123plus-v1.1

Total runs: 10

Run Growth: 0

Growth Rate: 0.00%

Updated:January 17 2025

huggingface.co

ModelsLab/IDM-VTON

Total runs: 9

Run Growth: -3

Growth Rate: -30.00%

Updated:January 17 2025

huggingface.co

ModelsLab/animatediff-motion-adapter-v1-5-2-001

Total runs: 7

Run Growth: 1

Growth Rate: 14.29%

Updated:May 02 2024

huggingface.co

ModelsLab/chatterbox

Total runs: 6

Run Growth: 0

Growth Rate: 0.00%

Updated:January 09 2026

huggingface.co

ModelsLab/Obj-base

Total runs: 6

Run Growth: 2

Growth Rate: 33.33%

Updated:May 10 2025

huggingface.co

ModelsLab/Pixelwave-Flux

Total runs: 4

Run Growth: 0

Growth Rate: 0.00%

Updated:November 05 2024

huggingface.co

ModelsLab/cosmic-babes

Total runs: 4

Run Growth: 3

Growth Rate: 75.00%

Updated:January 17 2025

huggingface.co

ModelsLab/Flux-Prompt-Enhance

Total runs: 4

Run Growth: 2

Growth Rate: 50.00%

Updated:January 17 2025

huggingface.co

ModelsLab/toonyou_beta6

Total runs: 3

Run Growth: 1

Growth Rate: 33.33%

Updated:January 17 2025

huggingface.co

ModelsLab/indigo-furry-mix-v65

Total runs: 3

Run Growth: -3

Growth Rate: -100.00%

Updated:September 16 2023

huggingface.co

ModelsLab/blipdiffusion-controlnet

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:January 16 2025

huggingface.co

ModelsLab/dpo-sdxl-plus

Total runs: 1

Run Growth: -3

Growth Rate: -300.00%

Updated:December 19 2023

huggingface.co

ModelsLab/unicontrol-v1.1

Total runs: 1

Run Growth: 0

Growth Rate: 0.00%

Updated:April 03 2024

huggingface.co

ModelsLab/blipdiffusion

Total runs: 1

Run Growth: 0

Growth Rate: 0.00%

Updated:January 16 2025

huggingface.co

ModelsLab/lcm-chinese

Total runs: 1

Run Growth: 0

Growth Rate: 0.00%

Updated:April 04 2025

huggingface.co

ModelsLab/my-stablediffusion-lora-5fcc71a4-583a-491b-9c5d-5e4ea400a3a0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 02 2026

huggingface.co

ModelsLab/XL-resolution

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 07 2024

huggingface.co

ModelsLab/F5-tts-brazilian

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 03 2024

huggingface.co

ModelsLab/wan-taesd-vae

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 14 2025

huggingface.co

ModelsLab/sdvn5-3dcutewave

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:January 17 2025

huggingface.co

ModelsLab/Uncensored-llama3.1-nemotron

Total runs: 0

Run Growth: -1

Growth Rate: 0.00%

Updated:November 17 2024

huggingface.co

ModelsLab/higgs-audio-sage2-build

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:September 01 2025

huggingface.co

ModelsLab/vton_checkpoints

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:April 30 2024

huggingface.co

ModelsLab/FISHSPEECH_HINDI_LORA

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2025

huggingface.co

ModelsLab/age

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 27 2023

huggingface.co

ModelsLab/Nemotron_uncensored

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 16 2024

huggingface.co

ModelsLab/Flux-Loras

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:September 13 2024

huggingface.co

ModelsLab/ADetailer

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 26 2024

huggingface.co

ModelsLab/RMBG

Total runs: 0

Run Growth: -2

Growth Rate: 0.00%

Updated:May 25 2024

huggingface.co

ModelsLab/my-stablediffusion-lora-aa33dbbc-4993-488c-9417-ec8c8cb51803

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 02 2026

huggingface.co

ModelsLab/langsegment-whl

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 17 2025

huggingface.co

ModelsLab/Sage_2_plus_plus_build

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:August 11 2025

huggingface.co

ModelsLab/my-stablediffusion-lora-28c3e216-bb6f-4c33-9b83-be7a303090f0

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 13 2026

huggingface.co

ModelsLab/obj-fast-lora

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 10 2025

huggingface.co

ModelsLab/dreamshaper-xl-turbo

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:December 07 2023

huggingface.co

ModelsLab/svdquant-builds

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 13 2026

huggingface.co

ModelsLab/motion_modules

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 26 2024

huggingface.co

ModelsLab/LTX-Q8-Kernels-build-ada

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 25 2025

huggingface.co

ModelsLab/my-stablediffusion-lora-c3b44441-0387-4d15-92d7-a689feb9c129

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 27 2026

huggingface.co

ModelsLab/my-stablediffusion-lora-ef8afc6e-12fa-46af-a844-108289618fb3

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 28 2026

huggingface.co

ModelsLab/my-stablediffusion-lora-22954cd3-6208-4e0d-b9c3-50c16519a8af

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 02 2026

huggingface.co

ModelsLab/my-stablediffusion-lora-80f06cb5-c6eb-411b-8987-46ce11e10437

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:March 02 2026

huggingface.co

ModelsLab/my-stablediffusion-lora-d369d28e-656f-46bc-8efa-e32ce5ea1166

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 12 2026

huggingface.co

ModelsLab/vocal-remover-model

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:August 20 2024

huggingface.co

ModelsLab/my-stablediffusion-lora-a79cbd97-9b07-43e9-adaa-cc14ef6baf26

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 13 2026

huggingface.co

ModelsLab/Sage-2.0.0-build-music-gen

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ModelsLab/Wan2.2-qint8-high-low-noise-models

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 30 2025

ModelsLab / omnivoice-singing

Introduction of omnivoice-singing

Model Details of omnivoice-singing

OmniVoice — Singing + Emotion Finetune

Drop-in replacement

Supported tags

What's preserved from the base

Training

Known limitations

License

Acknowledgements

Runs of ModelsLab omnivoice-singing on huggingface.co

More Information About omnivoice-singing huggingface.co Model

More omnivoice-singing license Visit here:

omnivoice-singing huggingface.co

omnivoice-singing huggingface.co Url

ModelsLab omnivoice-singing online free

ModelsLab omnivoice-singing online free url in huggingface.co:

omnivoice-singing install

omnivoice-singing install url in huggingface.co:

Url of omnivoice-singing

omnivoice-singing huggingface.co Url

Provider of omnivoice-singing huggingface.co

Other API from ModelsLab