Back to all change logs

Apr 23, 2026

Multiple datasets for Actors

New

Actor

Actors can now output to multiple datasets, each with its own schema and validation rules. Keep your data clean, structured, and purpose-built.

Use case

Website Content Crawler  uses multiple datasets to store details about URLs it failed to scrape, making it easier to debug crawl issues without digging through the main output.

How it works

  • Define datasets in actor.json under storages.datasets
  • Each key becomes an alias that refers to that dataset within the run
  • A default alias is required; all others are optional
  • Each dataset automatically follows the run’s data retention policy
  • Reference datasets in your output schema via template variables, e.g. {{storages.datasets.categories.apiUrl}}

Access datasets

  • JS SDK ≥3.7.0: await Actor.openDataset({alias: 'categories'})
  • Python SDK ≥3.3.0: await Actor.open_dataset(alias='categories')
  • Without SDK: Read dataset IDs from the ACTOR_STORAGES_JSON environment variable

For more information on multiple datasets support, see the docs .

Patrik Braborec

Product Manager