NEWAgents can now see video via MCP.Try it now →

    Text To Audio Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    200 models available

    Showing 4972 of 200 models

    Text To Audio

    Marvis-AI/marvis-tts-250m-v0.1-transformers

    625
    22
    transformers
    Text To Audio

    forkjoin-ai/vibevoice-1.5b

    615
    llama-cpp
    Text To Audio

    declare-lab/TangoFlux

    610
    106
    Text To Audio

    forkjoin-ai/qwen3-tts-12hz-1.7b-voicedesign

    609
    llama-cpp
    Text To Audio

    HKUSTAudio/AudioX-MAF

    567
    8
    Text To Audio

    yuhuacheng/clap-musicgen

    561
    Text To Audio

    ACE-Step/acestep-v15-turbo-shift1

    530
    14
    transformers
    Text To Audio

    schalor/speecht5_finetuned

    530
    transformers
    Text To Audio

    nateraw/musicgen-songstarter-v0.2

    527
    170
    audiocraft
    Text To Audio

    forkjoin-ai/vibevoice-realtime-0.5b

    504
    llama-cpp
    Text To Audio

    tencent/SongGeneration

    490
    337
    tencent-song-generation
    Text To Audio

    echarlaix/tiny-random-vits

    480
    transformers
    Text To Audio

    JayLL13/VoxCPM-1.5-VN

    471
    18
    Text To Audio

    facebook/magnet-medium-10secs

    444
    9
    audiocraft
    Text To Audio

    facebook/magnet-small-10secs

    440
    25
    audiocraft
    Text To Audio

    Marvis-AI/marvis-tts-250m-v0.2-MLX-8bit

    427
    4
    transformers
    Text To Audio

    HKUSTAudio/AudioX-MAF-MMDiT

    373
    9
    Text To Audio

    ford442/stable-audio-open-1.0

    365
    stable-audio-tools
    Text To Audio

    ylacombe/musicgen-stereo-melody

    349
    transformers
    Text To Audio

    forkjoin-ai/qwen3-tts-12hz-0.6b-customvoice

    332
    llama-cpp
    Text To Audio

    tencent/HunyuanVideo-Foley

    330
    163
    hunyuanvideo-foley
    Text To Audio

    forkjoin-ai/qwen2-audio-7b-instruct-gguf

    330
    llama-cpp
    Text To Audio

    ACE-Step/ACE-Step-v1-chinese-rap-LoRA

    316
    34
    diffusers
    Text To Audio

    mustafoyev202/speecht5_finetuned

    311
    1
    transformers
    3 / 9