NEWVectors or files. Pick a path.Start →

    Feature Extractors

    After your data is connected, extractors run in parallel to pull out structured features, embeddings, entities, transcripts, and more.

    102 extractors available

    Facial Recognition

    Detect and identify faces in images with high accuracy

    Image

    Video Embedding

    Generate vector embeddings for video content

    Video

    Web Scraper

    Extract structured data from webpages while maintaining semantic context and relationships

    Text

    Text Embedding

    Extract semantic embeddings from documents, transcripts and text content

    Text

    Image Embedding

    Generate visual embeddings for similarity search and clustering

    Image

    Audio Embedding

    Extract semantic embeddings from audio content for similarity search

    Audio

    Multimodal Extractor

    Unified embeddings for video, audio, image, and text — scene/silence chunking, Whisper transcription, thumbnails, and Gemini vision.

    Multimodal

    Universal Extractor

    All-in-one extractor for image, video, audio, and documents — auto-detects modality and applies the right pipeline.

    Multimodal

    Gemini Multifile Extractor

    Embed all files of an object (images, PDFs, video, audio, text) into a single 3072-D Gemini vector.

    Multimodal

    Document Graph Extractor

    Decompose PDFs into spatial blocks — paragraphs, tables, forms, headers — with layout classification and E5 text embeddings.

    Document

    Passthrough Extractor

    Store and canonicalize objects with zero ML — metadata-only ingestion for bucket/object modeling without embeddings.

    Utility

    Scrolling Text Extractor

    Read scrolling/marquee video text via phase-correlation band detection, panoramic stitching, and VLM OCR.

    Video
    1 / 9