Apr 23, 2026

Multiple datasets for Actors

Actors can now output to multiple datasets, each with its own schema and validation rules. Keep your data clean, structured, and purpose-built.

Use case

Website Content Crawler uses multiple datasets to store details about URLs it failed to scrape, making it easier to debug crawl issues without digging through the main output.

How it works

Define datasets in actor.json under storages.datasets
Each key becomes an alias that refers to that dataset within the run
A default alias is required; all others are optional
Each dataset automatically follows the run’s data retention policy
Reference datasets in your output schema via template variables, e.g. {{storages.datasets.categories.apiUrl}}

Access datasets

JS SDK ≥3.7.0: await Actor.openDataset({alias: 'categories'})
Python SDK ≥3.3.0: await Actor.open_dataset(alias='categories')
Without SDK: Read dataset IDs from the ACTOR_STORAGES_JSON environment variable

For more information on multiple datasets support, see the docs .

Patrik Braborec

Product Manager