NEWVectors or files. Pick a path.Start →
    Models/Image To Text/noamrot/FuseCap_Image_Captioning
    Image To Texttransformersmit

    FuseCap_Image_Captioning

    by noamrot

    864dl/month
    23likes
    Identifier
    Model ID
    noamrot/FuseCap_Image_Captioning

    Tags

    transformerspytorchblipimage-text-to-textimage-captioningimage-to-textarxiv:2305.17718license:mitregion:us

    Use FuseCap_Image_Captioning on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.

    Open Studio

    How It Runs on Mixpeek

    On Mixpeek, FuseCap_Image_Captioning runs as a managed extractor inside a processing pipeline. Point a bucket of image to text data at it, and Mixpeek handles GPU provisioning, batching, retries, and writing the outputs into a vector store you can query.

    Extractor outputs land in the Mixpeek Vector Store (MVS), where you can combine them with retrieval, reranking, and filter stages to build end-to-end search and agent-perception pipelines, no model-serving infrastructure to maintain.