NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 169192 of 550 models

    Image Text To Text

    google/translategemma-4b-it

    220K
    782
    transformers
    Image Text To Text

    google/paligemma-3b-mix-224

    219K
    100
    transformers
    Image Text To Text

    google/paligemma-3b-ft-cococap-448

    219K
    3
    transformers
    Image Text To Text

    ChantalPellegrini/RaDialog-interactive-radiology-report-generation

    217K
    14
    transformers
    Image Text To Text

    ibm-granite/granite-vision-4.1-4b

    214K
    87
    transformers
    Image Text To Text

    cyankiwi/Qwen3.5-122B-A10B-AWQ-4bit

    210K
    37
    transformers
    Image Text To Text

    nvidia/NVIDIA-Nemotron-Parse-v1.2

    208K
    39
    transformers
    Image Text To Text

    trl-internal-testing/tiny-Qwen3_5MoeForConditionalGeneration-3.6

    208K
    transformers
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B-NVFP4

    207K
    34
    Image Text To Text

    Qwen/Qwen3.5-4B-Base

    206K
    67
    transformers
    Image Text To Text

    palmfuture/Qwen3.6-35B-A3B-GPTQ-Int4

    205K
    20
    transformers
    Image Text To Text

    baidu/Qianfan-OCR

    204K
    1,177
    transformers
    Image Text To Text

    YannQi/R-4B

    204K
    183
    transformers
    Image Text To Text

    lmstudio-community/Qwen3.6-27B-MLX-8bit

    204K
    1
    transformers
    Image Text To Text

    baidu/ERNIE-4.5-VL-28B-A3B-PT

    202K
    103
    transformers
    Image Text To Text

    Infomaniak-AI/vllm-translategemma-4b-it

    201K
    13
    transformers
    Image Text To Text

    Qwen/Qwen3.5-0.8B-Base

    198K
    78
    transformers
    Image Text To Text

    HauhauCS/Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced

    195K
    147
    Image Text To Text

    unsloth/Qwen3.5-9B-MTP-GGUF

    192K
    84
    transformers
    Image Text To Text

    Open-Bee/Bee-8B-RL

    192K
    79
    transformers
    Image Text To Text

    unsloth/Qwen2.5-VL-7B-Instruct-GGUF

    191K
    179
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-4B-Thinking

    191K
    111
    transformers
    Image Text To Text

    RedHatAI/gemma-4-31B-it-NVFP4

    188K
    49
    transformers
    Image Text To Text

    MiniMaxAI/MiniMax-VL-01

    187K
    285
    8 / 23