NEWVectors or files. Pick a path.Start →

    Text To Audio Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    364 models available

    Showing 193216 of 364 models

    Text To Audio

    Mohira/tts_turkish_dataset

    72
    transformers
    Text To Audio

    marcorez8/acestep-v15-xl-base-bf16

    70
    1
    transformers
    Text To Audio

    lichang0928/QA-MDT

    70
    14
    diffusers
    Text To Audio

    forkjoin-ai/qwen3-tts-12hz-0.6b-base

    69
    llama-cpp
    Text To Audio

    michjosh/speecht5-hausa-tts

    68
    transformers
    Text To Audio

    aufklarer/Stable-Audio-3-DiT-Medium-MLX-8bit

    68
    mlx
    Text To Audio

    mlx-community/SongGeneration-v2-medium-4bit

    68
    mlx
    Text To Audio

    piyazon/TTS-CV-Unique-Ug-2

    67
    4
    transformers
    Text To Audio

    Dalision/Omni2Sound

    67
    5
    Text To Audio

    LeeAeron/VibeVoice-Large-Q8

    67
    1
    transformers
    Text To Audio

    jongwooko/Flex-Omni-7B

    66
    2
    transformers
    Text To Audio

    rhymeswithlion/MIDI-LLM_Llama-3.2-1B-Q8_0-GGUF

    66
    1
    transformers
    Text To Audio

    olawale-ahmed/pidgin_speecht5_tts_anonxx_pidgin_dataset

    66
    transformers
    Text To Audio

    magenta-community/magenta-realtime-2-small

    66
    1
    transformers
    Text To Audio

    forkjoin-ai/vibevoice-1.5b

    65
    llama-cpp
    Text To Audio

    zhangj1an/AudioX

    65
    diffusers
    Text To Audio

    rnjema-unima/waxal-mms-tts-lug

    65
    transformers
    Text To Audio

    mariammohamed00/speecht5_finetuned

    65
    1
    transformers
    Text To Audio

    olawale-ahmed/pidgin_speecht5_tts_nigerian-pidgin-1.0

    65
    transformers
    Text To Audio

    AXERA-TECH/kokoro.axera

    63
    Text To Audio

    voces-ai/tts-rap-male-v3

    63
    transformers
    Text To Audio

    JBZhang2342/speecht5_tts

    62
    transformers
    Text To Audio

    mlx-community/SongGeneration-v2-medium-bf16

    62
    mlx
    Text To Audio

    schalor/speecht5_finetuned

    61
    transformers
    9 / 16