Get Object
This endpoint retrieves an object by its ID from the specified bucket.
Presigned URLs: Set return_presigned_urls=true query parameter to generate fresh presigned download URLs
for all blobs with S3 storage (default: false). URLs are added to each blob’s properties as
presigned_url and expire after 1 hour.
Document count: document_count (how many documents this object produced, via vector-store
lineage) is computed by default. It fans out to the vector partition, which on a cold/serverless
partition can be slow — pass include_document_count=false to skip it for latency-sensitive,
interactive views (e.g. an object-detail modal) that don’t render the count. Even when requested,
the count is best-effort and bounded by a deadline: it returns null rather than blocking the
response if the partition is cold.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Path Parameters
The unique identifier of the bucket.
The unique identifier of the object.
Query Parameters
Generate fresh presigned download URLs for all blobs with S3 storage
Compute document_count via vector-store lineage (default true). Pass false to skip the vector fan-out for latency-sensitive views that don't render the count.
Response
Successful Response
Response model for bucket objects.
ID of the bucket this object belongs to
Unique identifier for the object
Storage key/path of the object, this will be used to retrieve the object from the storage. It is similar to a file path. If not provided, it will be placed in the root of the bucket.
List of blobs contained in this object
Lineage/source details for this object; used for downstream references.
The current status of the object.
PENDING, QUEUED, IN_PROGRESS, PROCESSING, COMPLETED, COMPLETED_WITH_ERRORS, FAILED, CANCELED, INTERRUPTED, UNKNOWN, SKIPPED, DRAFT, ACTIVE, ARCHIVED, SUSPENDED The error message if the object failed to process.
"Failed to process object: Object not found"
Timestamp when the object was created. Automatically populated by the system.
Timestamp when the object was last updated. Automatically populated by the system.
Number of documents produced from this object across all collections. Populated on GET requests. Null on list responses (expensive query). Use this to check if an object has already been processed.
When and how this object becomes a searchable document. Set on write (create) responses: managed ingestion is visible only after a collection batch processes the object — poll until document_count > 0.

