BiRefNet

by ZhengPeng7

High-resolution foreground segmentation for object masks and visual evidence cleanup

824Kdl/month

0.2B classparams

HuggingFace Run on your data, free

Identifiers

Model ID

ZhengPeng7/BiRefNet

Feature URI

mixpeek://image_extractor@v1/zhengpeng7_birefnet_v1

Overview

BiRefNet is the official checkpoint for Bilateral Reference for High-Resolution Dichotomous Image Segmentation. It targets foreground/background masks, salient object segmentation, and related cases where the useful evidence is an object region rather than the whole image.

On Mixpeek, BiRefNet can turn images or sampled video frames into mask metadata. Agents can use those masks to filter frames with clear foreground objects, crop objects before embedding, or remove distracting background before downstream OCR, detection, captioning, or similarity search.

Architecture

Image-segmentation model for high-resolution dichotomous segmentation. The Hugging Face card lists Transformers support through AutoModelForImageSegmentation, MIT licensing, and tags for background removal, mask generation, camouflaged object detection, and salient object detection.

Mixpeek SDK Integration

import { Mixpeek } from "mixpeek";

const mx = new Mixpeek({ apiKey: "API_KEY" });

// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
  namespace_id: "my-namespace",
  collection_name: "my-collection",
  source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
  feature_extractor: {
    feature_extractor_name: "segmentation",
    version: "v1",
    parameters: { model_id: "ZhengPeng7/BiRefNet" },
  },
});

Capabilities

Foreground/background mask generation
High-resolution dichotomous image segmentation
Background removal and object isolation
Useful pre-processing for embeddings, OCR, and VLM captioning
MIT license

Use Cases on Mixpeek

Index object masks for visual search and filtering

Crop foreground products before embedding or captioning

Clean screenshots and image evidence before OCR

Find frames where the foreground object occupies enough of the scene

Benchmarks

Dataset	Metric	Score	Source
Hugging Face	Monthly downloads	824K	HF model metadata, June 2026
BiRefNet task coverage	Segmentation tags	DIS, camouflaged, salient object	BiRefNet model card