11dl/month
1likes
Identifier
Model ID
Or4cl3-1/multimodal-fusion-optimizedTags
transformersMultimodal AI ModelmergemergekitlazymergekitOpenAI/CLIPOr4cl3-1/cognitive-agent-xtts-optimizeddocument-question-answeringenbase_model:Or4cl3-1/cognitive-agent-xtts-optimizedbase_model:finetune:Or4cl3-1/cognitive-agent-xtts-optimizedlicense:apache-2.0endpoints_compatibleregion:us
Use multimodal-fusion-optimized on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationOr4cl3-1
TaskDocument Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo11
Likes1
View on HuggingFace
See model card, files, and community discussion
Related Document Question Answering Models
impira/layoutlm-document-qa
85K
naver-clova-ix/donut-base-finetuned-docvqa
14K
fxmarty/tiny-doc-qa-vision-encoder-decoder
3K
impira/layoutlm-invoices
1K
tiennvcs/layoutlmv2-base-uncased-finetuned-docvqa
603
zpm/Llama-3.1-PersianQA
367
Xenova/donut-base-finetuned-docvqa
204
AntonioTH/Layout-finetuned-fr-model-50instances20-100epochs-5e-05lr
188