detr-resnet-50

by facebook

End-to-end object detection with Transformers, no anchor boxes needed

2.3Mdl/month

961likes

42Mparams

HuggingFace Run on your data, free

Identifiers

Model ID

facebook/detr-resnet-50

Feature URI

mixpeek://image_extractor@v1/facebook_detr_r50_v1

Overview

DETR (DEtection TRansformer) reimagines object detection as a set prediction problem, using a transformer encoder-decoder architecture to directly output a set of bounding boxes and class labels without the need for hand-designed components like anchor boxes or non-maximum suppression.

On Mixpeek, DETR extracts structured object annotations from video frames and images, producing bounding boxes with class labels that power attribute-based filtering in retrieval pipelines.

Architecture

ResNet-50 CNN backbone followed by a 6-layer transformer encoder-decoder. Uses bipartite matching loss (Hungarian algorithm) to assign predictions to ground truth. Outputs 100 object queries in parallel.

Mixpeek SDK Integration

import { Mixpeek } from "mixpeek";

const mx = new Mixpeek({ apiKey: "API_KEY" });

// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
  namespace_id: "my-namespace",
  collection_name: "my-collection",
  source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
  feature_extractor: {
    feature_extractor_name: "object_detection",
    version: "v1",
    parameters: { model_id: "facebook/detr-resnet-50" },
  },
});

Capabilities

91 COCO object categories out of the box
Bounding box + class label predictions
Panoptic segmentation with extensions
No hand-designed post-processing (NMS-free)

Use Cases on Mixpeek

Video surveillance, detect people, vehicles, objects in security footage

Retail analytics, count and classify products on shelves

Content moderation, identify objects for compliance filtering

Autonomous driving data, annotate frames with detected objects

Benchmarks

Dataset	Metric	Score	Source
COCO val2017	AP (box)	42.0	Carion et al., 2020 — Table 1
COCO val2017	AP50	62.4	Carion et al., 2020 — Table 1
COCO val2017	AP (small)	20.5	Carion et al., 2020 — Table 1