Apr 23, 2026
Multiple datasets for Actors
New
Actor
Actors can now output to multiple datasets, each with its own schema and validation rules. Keep your data clean, structured, and purpose-built.
Website Content Crawler uses multiple datasets to store details about URLs it failed to scrape, making it easier to debug crawl issues without digging through the main output.
- Define datasets in
actor.jsonunderstorages.datasets - Each key becomes an alias that refers to that dataset within the run
- A
defaultalias is required; all others are optional - Each dataset automatically follows the run’s data retention policy
- Reference datasets in your output schema via template variables, e.g.
{{storages.datasets.categories.apiUrl}}
- JS SDK ≥3.7.0:
await Actor.openDataset({alias: 'categories'}) - Python SDK ≥3.3.0:
await Actor.open_dataset(alias='categories') - Without SDK: Read dataset IDs from the
ACTOR_STORAGES_JSONenvironment variable
For more information on multiple datasets support, see the docs .
