MaskGCT huggingface.co api & amphion MaskGCT github AI Model

Introduction of MaskGCT

Model Details of MaskGCT

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Quickstart

Clone and install

git clone https://github.com/open-mmlab/Amphion.git
# create env
bash ./models/tts/maskgct/env.sh

Model download

We provide the following pretrained checkpoints:

Model Name	Description
Acoustic Codec	Converting speech to semantic tokens.
Semantic Codec	Converting speech to acoustic tokens and reconstructing waveform from acoustic tokens.
MaskGCT-T2S	Predicting semantic tokens with text and prompt semantic tokens.
MaskGCT-S2A	Predicts acoustic tokens conditioned on semantic tokens.

You can download all pretrained checkpoints from HuggingFace or use huggingface api.

from huggingface_hub import hf_hub_download

# download semantic codec ckpt
semantic_code_ckpt = hf_hub_download("amphion/MaskGCT" filename="semantic_codec/model.safetensors")

# download acoustic codec ckpt
codec_encoder_ckpt = hf_hub_download("amphion/MaskGCT", filename="acoustic_codec/model.safetensors")
codec_decoder_ckpt = hf_hub_download("amphion/MaskGCT", filename="acoustic_codec/model_1.safetensors")

# download t2s model ckpt
t2s_model_ckpt = hf_hub_download("amphion/MaskGCT", filename="t2s_model/model.safetensors")

# download s2a model ckpt
s2a_1layer_ckpt = hf_hub_download("amphion/MaskGCT", filename="s2a_model/s2a_model_1layer/model.safetensors")
s2a_full_ckpt = hf_hub_download("amphion/MaskGCT", filename="s2a_model/s2a_model_full/model.safetensors")

Basic Usage

You can use the following code to generate speech from text and a prompt speech.

from models.tts.maskgct.maskgct_utils import *
from huggingface_hub import hf_hub_download
import safetensors
import soundfile as sf

if __name__ == "__main__":

    # build model
    device = torch.device("cuda:0")
    cfg_path = "./models/tts/maskgct/config/maskgct.json"
    cfg = load_config(cfg_path)
    # 1. build semantic model (w2v-bert-2.0)
    semantic_model, semantic_mean, semantic_std = build_semantic_model(device)
    # 2. build semantic codec
    semantic_codec = build_semantic_codec(cfg.model.semantic_codec, device)
    # 3. build acoustic codec
    codec_encoder, codec_decoder = build_acoustic_codec(cfg.model.acoustic_codec, device)
    # 4. build t2s model
    t2s_model = build_t2s_model(cfg.model.t2s_model, device)
    # 5. build s2a model
    s2a_model_1layer = build_s2a_model(cfg.model.s2a_model.s2a_1layer, device)
    s2a_model_full =  build_s2a_model(cfg.model.s2a_model.s2a_full, device)

    # download checkpoint
    ...

    # load semantic codec
    safetensors.torch.load_model(semantic_codec, semantic_code_ckpt)
    # load acoustic codec
    safetensors.torch.load_model(codec_encoder, codec_encoder_ckpt)
    safetensors.torch.load_model(codec_decoder, codec_decoder_ckpt)
    # load t2s model
    safetensors.torch.load_model(t2s_model, t2s_model_ckpt)
    # load s2a model
    safetensors.torch.load_model(s2a_model_1layer, s2a_1layer_ckpt)
    safetensors.torch.load_model(s2a_model_full, s2a_full_ckpt)

    # inference
    prompt_wav_path = "./models/tts/maskgct/wav/prompt.wav"
    save_path = "[YOUR SAVE PATH]"
    prompt_text = " We do not break. We never give in. We never back down."
    target_text = "In this paper, we introduce MaskGCT, a fully non-autoregressive TTS model that eliminates the need for explicit alignment information between text and speech supervision."
    # Specify the target duration (in seconds). If target_len = None, we use a simple rule to predict the target duration.
    target_len = 18
    recovered_audio = maskgct_inference(prompt_wav_path, prompt_text, target_text, "en", "en", target_len=target_len)
    sf.write(save_path, recovered_audio, 24000)

Runs of amphion MaskGCT on huggingface.co

700

Total runs

24-hour runs

3-day runs

7-day runs

-4

30-day runs

More Information About MaskGCT huggingface.co Model

More MaskGCT license Visit here:

https://choosealicense.com/licenses/cc-by-nc-4.0

MaskGCT huggingface.co

MaskGCT huggingface.co is an AI model on huggingface.co that provides MaskGCT's model effect (), which can be used instantly with this amphion MaskGCT model. huggingface.co supports a free trial of the MaskGCT model, and also provides paid use of the MaskGCT. Support call MaskGCT model through api, including Node.js, Python, http.

MaskGCT huggingface.co Url

https://huggingface.co/amphion/MaskGCT

amphion MaskGCT online free

MaskGCT huggingface.co is an online trial and call api platform, which integrates MaskGCT's modeling effects, including api services, and provides a free online trial of MaskGCT, you can try MaskGCT online for free by clicking the link below.

amphion MaskGCT online free url in huggingface.co:

https://huggingface.co/amphion/MaskGCT

MaskGCT install

MaskGCT is an open source model from GitHub that offers a free installation service, and any user can find MaskGCT on GitHub to install. At the same time, huggingface.co provides the effect of MaskGCT install, users can directly use MaskGCT installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

MaskGCT install url in huggingface.co:

https://huggingface.co/amphion/MaskGCT

huggingface.co

amphion/Vevo

Total runs: 71

Run Growth: 9

Growth Rate: 13.43%

Updated:4월 13 2025

huggingface.co

amphion/Metis

Total runs: 45

Run Growth: 26

Growth Rate: 59.09%

Updated:4월 13 2025

huggingface.co

amphion/TaDiCodec-TTS-MGM

Total runs: 38

Run Growth: 32

Growth Rate: 84.21%

Updated:9월 02 2025

huggingface.co

amphion/TaDiCodec

Total runs: 38

Run Growth: 20

Growth Rate: 55.56%

Updated:9월 02 2025

huggingface.co

amphion/anyaccomp

Total runs: 26

Run Growth: 14

Growth Rate: 56.00%

Updated:12월 22 2025

huggingface.co

amphion/TaDiCodec-TTS-AR-Qwen2.5-0.5B

Total runs: 25

Run Growth: 17

Growth Rate: 70.83%

Updated:9월 02 2025

huggingface.co

amphion/Vevo1.5

Total runs: 19

Run Growth: -6

Growth Rate: -31.58%

Updated:4월 13 2025

huggingface.co

amphion/TaDiCodec-TTS-AR-Qwen2.5-3B

Total runs: 8

Run Growth: -4

Growth Rate: -50.00%

Updated:8월 27 2025

huggingface.co

amphion/valle

Total runs: 4

Run Growth: 0

Growth Rate: 0.00%

Updated:7월 19 2024

huggingface.co

amphion/anyenhance

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:1월 14 2025

huggingface.co

amphion/vits_hifitts

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:2월 24 2024

huggingface.co

amphion/deepfake_detection

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:1월 04 2025

huggingface.co

amphion/dualcodec

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:10월 14 2025

huggingface.co

amphion/INTP

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:9월 09 2025

huggingface.co

amphion/naturalspeech3_facodec

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:3월 13 2024

huggingface.co

amphion/diffwave

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 21 2023

huggingface.co

amphion/VevoSing

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:4월 10 2025

huggingface.co

amphion/hifigan_ljspeech

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:2월 24 2024

huggingface.co

amphion/fastspeech2_ljspeech

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:2월 24 2024

huggingface.co

amphion/text_to_audio

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 18 2023

huggingface.co

amphion/Ints

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:5월 18 2025

huggingface.co

amphion/valle_libritts

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:1월 11 2024

huggingface.co

amphion/naturalspeech2_libritts

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 19 2023

huggingface.co

amphion/valle_librilight_6k

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:1월 24 2024

huggingface.co

amphion/BigVGAN_singing_bigdata

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 21 2023

huggingface.co

amphion/singing_voice_conversion

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 21 2023

huggingface.co

amphion/vits_ljspeech

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:2월 24 2024

huggingface.co

amphion/hifigan_speech_bigdata

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:12월 21 2023

huggingface.co

amphion/Vevo2

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:9월 08 2025

huggingface.co

amphion/dualcodec-tts

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:6월 03 2025

amphion / MaskGCT

Introduction of MaskGCT

Model Details of MaskGCT

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Quickstart

Runs of amphion MaskGCT on huggingface.co

More Information About MaskGCT huggingface.co Model

More MaskGCT license Visit here:

MaskGCT huggingface.co

MaskGCT huggingface.co Url

amphion MaskGCT online free

amphion MaskGCT online free url in huggingface.co:

MaskGCT install

MaskGCT install url in huggingface.co:

Url of MaskGCT

MaskGCT huggingface.co Url

Provider of MaskGCT huggingface.co

Other API from amphion