NEWAgents can now see video via MCP.Try it now →
    Visual Question Answeringtransformersgemma

    ViLaH

    by BhashaAI

    Identifier
    Model ID
    BhashaAI/ViLaH

    Tags

    transformerssafetensorspaligemmaimage-text-to-textvisual-question-answeringBilingualenhidataset:damerajee/clean_hin_vqalicense:gemmatext-generation-inferenceregion:us

    Use ViLaH on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder