NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    13,634 models available

    Showing 62416264 of 13,634 models

    Translation

    Helsinki-NLP/opus-mt-en-da

    2K
    3
    transformers
    Feature Extraction

    Synthyra/ESMFold2-Fast

    2K
    transformers
    Text To Speech

    cstr/vibevoice-realtime-0.5b-GGUF

    2K
    3
    ggml
    Feature Extraction

    reasonir/ReasonIR-8B

    2K
    56
    transformers
    Image Feature Extraction

    timm/vit_base_patch16_clip_224.laion2b

    2K
    1
    timm
    Text To Speech

    cstr/kokoro-voices-GGUF

    2K
    1
    ggml
    Audio To Audio

    mispeech/dashengtokenizer

    2K
    12
    transformers
    Zero Shot Image Classification

    laion/CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg

    2K
    5
    open_clip
    Text To Speech

    calcuis/chatterbox-gguf

    2K
    54
    Text To Video

    wanabmeya/clip_vision_h.safetensors

    2K
    5
    Image To Image

    InstantX/Qwen-Image-ControlNet-Inpainting

    2K
    115
    diffusers
    Zero Shot Image Classification

    flax-community/clip-rsicd-v2

    2K
    26
    transformers
    Image Segmentation

    openmmlab/upernet-convnext-tiny

    2K
    3
    transformers
    Image To Text

    breezedeus/pix2text-mfr-1.5

    2K
    1
    transformers
    Feature Extraction

    jinaai/jina-embeddings-v5-text-small-retrieval-mlx

    2K
    3
    mlx
    Image Feature Extraction

    theaiinstitute/theia-base-patch16-224-cdiv

    2K
    9
    transformers
    Voice Activity Detection

    BUT-FIT/diarizen-wavlm-large-s80-md-v2

    2K
    17
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv6_small_rec

    2K
    14
    PaddleOCR
    Feature Extraction

    ToolathlonBot/MyAwesomeModel-TestRepo

    2K
    transformers
    Image To Text

    naver-clova-ix/donut-base-finetuned-rvlcdip

    2K
    20
    transformers
    Text To Speech

    OpenMOSS-Team/MOSS-TTSD-v0.5

    2K
    54
    Audio Classification

    griko/gender_cls_svm_ecapa_voxceleb

    2K
    Audio Classification

    7wolf/wav2vec2-base-gender-classification

    2K
    1
    transformers
    Feature Extraction

    mradermacher/Octen-Embedding-0.6B-i1-GGUF

    2K
    transformers
    261 / 569