5Kdl/month
28likes
Identifier
Model ID
Xenova/vit-gpt2-image-captioningTags
transformers.jsonnxvision-encoder-decoderimage-text-to-textimage-captioningimage-to-textbase_model:nlpconnect/vit-gpt2-image-captioningbase_model:quantized:nlpconnect/vit-gpt2-image-captioningregion:us
Use vit-gpt2-image-captioning on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationXenova
TaskImage To Text
Librarytransformers.js
Downloads/mo5K
Likes28
View on HuggingFace
See model card, files, and community discussion