How This Helps
Export your dataset’s metadata, labels, and media files for use in training pipelines, annotation tools, or downstream analysis. Both full and selective exports are supported.
Prerequisites
- A dataset in
READYstatus. - A dataset ID (visible in the browser URL when viewing a dataset:
https://app.visual-layer.com/dataset/<dataset_id>/data). - A valid JWT token. See Authentication.
Full Dataset Export
Export all media and metadata for an entire dataset. This is an asynchronous operation — initiate the export, poll for completion, then download the result.Step 1: Initiate Export
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
file_name | string | Yes | Name for the output ZIP file. |
export_format | string | Yes | json or parquet. |
include_images | boolean | No | Set to true to include image files in the export. Default: false. |
Example
Response
id — you need it to poll export status.
Step 2: Poll Export Status
Status Response
status is COMPLETED or FAILED.
Step 3: Download
-L to follow S3 redirects.
Selective Export
Export specific media items or entire clusters usingPOST /api/v1/dataset/{dataset_id}/export_entities_async. This is useful for exporting only the results of a search or filter operation.
Query Parameters
| Parameter | Type | Description |
|---|---|---|
export_format | string | json or parquet. |
include_images | boolean | Include image files in the export. |
Export by Media IDs
media_ids are returned in the media_id field of any Explore endpoint response (visual search, semantic search, or duplicate retrieval results).
Export by Cluster
cluster_id values are returned in the cluster_id field of Explore endpoint responses and are visible in the Visual Layer UI when browsing clusters.
id to use for polling status.
Full Automation Script
The following script handles the complete export workflow: initiate, poll, download, and extract.Working with Exported Data
After extraction, the archive contains a metadata file (Parquet or JSON) and optionally animages/ folder.
Filter by Uniqueness Score
Copy Filtered Images
Response Codes
See Error Handling for the error response format and Python handling patterns.| HTTP Code | Meaning |
|---|---|
| 200 | Export task ID returned. |
| 401 | Unauthorized — check your JWT token. |
| 404 | Dataset not found. |
| 409 | Dataset is not in READY status. |