SenseNova-U1-8B-MoT

by sensenova

8B any-to-any multimodal model for image understanding, generation, and editing

32.7Kdl/month

281likes

8Bparams

HuggingFace Run on your data, free

Identifiers

Model ID

sensenova/SenseNova-U1-8B-MoT

Feature URI

mixpeek://image_extractor@v1/sensenova_u1_8b_mot_v1

Overview

SenseNova-U1-8B-MoT is an any-to-any multimodal model tagged for feature extraction, image-to-text, text-to-image, image editing, and custom-code inference. That mix matters for agents because perception is often not a single captioning call: an agent may need to inspect an image, generate an explanation, propose an edit, and preserve evidence of what changed.

On Mixpeek, SenseNova U1 fits pipelines that retrieve visual evidence first, then ask a multimodal model to explain or transform that evidence. It is especially relevant for creative QA, ad review, product imagery, and human-in-the-loop visual analysis.

Architecture

8B-class mixture-of-transformers style any-to-any multimodal model. Supports image-to-text, text-to-image, image editing, and feature extraction paths according to the Hugging Face model metadata.

Mixpeek SDK Integration

import { Mixpeek } from "mixpeek";

const mx = new Mixpeek({ apiKey: "API_KEY" });

// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
  namespace_id: "my-namespace",
  collection_name: "my-collection",
  source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
  feature_extractor: {
    feature_extractor_name: "s3",
    version: "v1",
    parameters: { model_id: "mixpeek://image_extractor@v1/sensenova_u1_8b_mot_v1" },
  },
});