NEWAgents can now see video via MCP.Try it now →

    Audio To Audio Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    100 models available

    Showing 7396 of 100 models

    Audio To Audio

    speechbrain/sepformer-whamr-enhancement

    204
    14
    speechbrain
    Audio To Audio

    lucadellalib/focalcodec_50hz_2k_causal

    187
    torch
    Audio To Audio

    khimaros/Qwen3-TTS-Tokenizer-12Hz-GGUF

    187
    Audio To Audio

    nvidia/bigvgan_base_22khz_80band

    172
    PyTorch
    Audio To Audio

    HiDolen/Mini-BS-RoFormer-18M

    156
    4
    transformers
    Audio To Audio

    AXERA-TECH/Speech-Translation.axera

    156
    1
    Audio To Audio

    Mungert/LFM2.5-Audio-1.5B-GGUF

    151
    liquid-audio
    Audio To Audio

    speechbrain/sepformer-wham

    146
    46
    speechbrain
    Audio To Audio

    weya-ai/hush

    143
    20
    hush
    Audio To Audio

    JorisCos/ConvTasNet_Libri3Mix_sepnoisy_8k

    140
    2
    asteroid
    Audio To Audio

    lucadellalib/focalcodec_50hz_4k_causal

    136
    torch
    Audio To Audio

    lucadellalib/dycast

    135
    4
    torch
    Audio To Audio

    JorisCos/ConvTasNet_Libri3Mix_sepclean_8k

    134
    asteroid
    Audio To Audio

    JorisCos/ConvTasNet_Libri2Mix_sepnoisy_8k

    133
    1
    asteroid
    Audio To Audio

    mispeech/dasheng-denoiser

    132
    3
    transformers
    Audio To Audio

    mlx-community/sam-audio-large

    126
    6
    mlx-audio
    Audio To Audio

    AEmotionStudio/sam-audio-models

    126
    4
    Audio To Audio

    maitrix-org/Voila-autonomous-preview

    122
    17
    transformers
    Audio To Audio

    Ademola265/Qwen3-TTS-Tokenizer-12Hz

    116
    Audio To Audio

    MansfieldPlumbing/Demucs_v4_TRT

    116
    2
    tensorrt
    Audio To Audio

    YatharthS/NovaSR

    115
    83
    Audio To Audio

    line-corporation/open-universe

    113
    3
    Audio To Audio

    speechbrain/sepformer-wham-enhancement

    111
    33
    speechbrain
    Audio To Audio

    kyutai/moshika-rag-candle-bf16

    111
    5
    moshi
    4 / 5