135Kdl/month
1,587likes
Identifier
Model ID
meta-llama/Llama-3.2-11B-Vision-InstructTags
transformerssafetensorsmllamaimage-text-to-textfacebookmetapytorchllamallama-3conversationalendefritpthiestharxiv:2204.05149license:llama3.2text-generation-inferenceendpoints_compatibleregion:us
Use Llama-3.2-11B-Vision-Instruct on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationmeta-llama
TaskImage Text To Text
Librarytransformers
Licensellama3.2
Downloads/mo135K
Likes1,587
View on HuggingFace
See model card, files, and community discussion