Output to Dataset

Pricing

from $0.70 / 1,000 results

Try for free

Go to Apify Store

Output to Dataset

Try for free

Merges outputs from multiple actors into a single dataset. Execute actors in series or parallel, combine data from datasets, key-value stores, webhooks, and export the final output in various formats.

Pricing

from $0.70 / 1,000 results

Rating

5.0

(1)

Developer

njoylab

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

22 days ago

Last modified

Features

Multiple Data Sources: Fetch data from:
- Existing datasets
- Key-value stores
- Actor runs
- Webhook URLs
Actor Execution: Run multiple actors and collect their outputs
- Parallel execution: Run all actors simultaneously for faster results
- Series execution: Run actors one after another
Merge Strategies:
- Append: Combine all data (keeps duplicates)
- Deduplicate: Remove duplicates based on specified fields
Data Transformations: Filter, remap, pick, or enrich records before merging
Instant Downloads: Every run logs the dataset console link plus JSON, CSV, and XLSX download URLs powered by Apify dataset exports

Input Configuration

Sources

Array of existing data sources to merge:

{
  "sources": [
    {
      "type": "dataset",
      "id": "datasetId123"
    },
    {
      "type": "keyValueStore",
      "id": "storeId456",
      "key": "OUTPUT"
    },
    {
      "type": "actorRun",
      "id": "runId789"
    },
    {
      "type": "webhook",
      "id": "https://api.example.com/data"
    }
  ]
}

Actor Runs

Array of actors to execute before merging:

{
  "actorRuns": [
    {
      "actorId": "apify/web-scraper",
      "input": {
        "startUrls": [{"url": "https://example.com"}]
      },
      "outputType": "dataset"
    },
    {
      "actorId": "apify/google-search-scraper",
      "input": {
        "queries": "apify"
      },
      "outputType": "keyValueStore",
      "outputKey": "OUTPUT"
    }
  ]
}

Use outputType to control where each run stores its data before merging:

dataset (default) – read the items that the actor pushed to its default dataset; no outputKey needed.
keyValueStore – read a file/record saved via Actor.setValue in the default key-value store; set outputKey to the record name (e.g., "MERGED_OUTPUT.json").

Example mixing both sinks:

{
  "actorRuns": [
    {
      "actorId": "my-dataset-actor",
      "outputType": "dataset"
    },
    {
      "actorId": "my-exporting-actor",
      "outputType": "keyValueStore",
      "outputKey": "LATEST_EXPORT"
    }
  ]
}

Execution Mode

parallel (default): Run all actors at the same time
series: Run actors one after another

Merge Strategy

append (default): Combine all items, keeping duplicates
deduplicate: Remove duplicate items based on specified fields

{
  "mergeStrategy": "deduplicate",
  "deduplicateBy": ["url", "title"]
}

Output Location

All merged records are pushed to the actor's default dataset. Use Apify Console exports (JSON, CSV, XLSX, etc.) when you need a specific download format.

Transformations

Apply zero or more transformations to each item before the merge step. Transformations run in the order provided.

{
  "transformations": [
    {
      "type": "filter",
      "field": "price",
      "operator": "lessThan",
      "value": 50
    },
    {
      "type": "mapFields",
      "mapping": {
        "title": "product.name",
        "price": "product.price"
      },
      "removeOriginal": true
    },
    {
      "type": "pickFields",
      "fields": ["product.name", "product.price", "url"]
    },
    {
      "type": "setField",
      "field": "currency",
      "value": "USD",
      "overwrite": false
    }
  ]
}

Supported transformation types:

filter: keep only items whose field matches a condition (equals, notEquals, contains, greaterThan, lessThan, exists).
mapFields: copy data from one field path to another (with optional removal of the original field).
pickFields: keep only the listed field paths (missing values are kept unless dropUndefined is true).
setField: write a static value into a field, optionally skipping existing values unless overwrite is true.

Complete Example

{
  "actorRuns": [
    {
      "actorId": "apify/web-scraper",
      "input": {
        "startUrls": [
          {"url": "https://apify.com/store"}
        ],
        "pageFunction": "async function pageFunction(context) { return context.request; }"
      }
    },
    {
      "actorId": "apify/google-search-scraper",
      "input": {
        "queries": "web scraping"
      }
    }
  ],
  "sources": [
    {
      "type": "dataset",
      "id": "existingDatasetId"
    }
  ],
  "executionMode": "parallel",
  "mergeStrategy": "deduplicate",
  "deduplicateBy": ["url"]
}

Output

The actor saves all merged data to its default dataset. You can:

Access via Apify Console: View the dataset in the run's output tab
Download: Export the dataset in any format from the Apify platform
Follow the logs: After each run the actor prints both a console link and ready-to-use JSON/CSV/XLSX download URLs for the merged dataset.

Use Cases

1. Merge Multiple Scraping Runs

Run the same scraper with different inputs and merge results:

{
  "actorRuns": [
    {
      "actorId": "my-scraper",
      "input": {"category": "electronics"}
    },
    {
      "actorId": "my-scraper",
      "input": {"category": "books"}
    },
    {
      "actorId": "my-scraper",
      "input": {"category": "clothing"}
    }
  ],
  "executionMode": "parallel",
  "mergeStrategy": "append"
}

2. Combine Historical Data

Merge data from multiple previous runs:

{
  "sources": [
    {"type": "actorRun", "id": "run1"},
    {"type": "actorRun", "id": "run2"},
    {"type": "actorRun", "id": "run3"}
  ],
  "mergeStrategy": "deduplicate",
  "deduplicateBy": ["id"]
}

3. Aggregate Multiple Datasets

Combine existing datasets into one:

{
  "sources": [
    {"type": "dataset", "id": "dataset1"},
    {"type": "dataset", "id": "dataset2"},
    {"type": "dataset", "id": "dataset3"}
  ]
}

Markdown Table Generator

visita/markdown-table-generator

This Actor converts data from multiple sources into a clean, presentation-ready table. You can provide raw text, a direct URL to a file (like `.xlsx` or `.csv`), or Run ID of another Apify Actor, and this tool will automatically format it into Markdown, HTML, or Confluence Wiki markup.

Visita AI & Automation

VIN decoder API

njoylab/vin-decoder-api

Decode and validate VINs at scale. Extract WMI/manufacturer, country/region, model year, plant, and sequence. ISO 3779 check digit with configurable policy (auto/require/ignore). Accepts an array of VINs, outputs a structured dataset with validity views.

njoylab

5.0

DNS lookup API: Intelligence & Security Analyzer

njoylab/apify-dns

A powerful DNS intelligence tool that performs multi-record lookups, global propagation checks, reverse DNS, SSL/TLS inspection, email security analysis, and rich metadata extraction all in one automated workflow

njoylab

5.0

Website Links Graph Generator

crawlerbros/web-link-graph-visualizer

Creates an oriented graph visualizing links between webpages. Outputs: graph.png (visual network diagram) and graph.json (structured data) saved to Key-Value Store, plus detailed dataset of all crawled pages. Configure depth, boundaries, and layout.

Crawler Bros

5.0

Signature Generator

crawlerbros/signature-generator

Create professional email signatures in seconds! Choose from multiple templates, customize with your brand colors and logo, add social media icons, and export to HTML (copy-paste ready for Gmail/Outlook), PNG, JPG, or SVG. All outputs are saved to the dataset and downloadable from the Storage tab.

Crawler Bros

5.0

Fast Vin Decoder

trev0n/fast-vin-decoder

Fast and efficient VIN decoder. Decodes vehicle identification numbers using freevindecoder.eu. Extracts make, model, year, body style, engine specs, transmission and more. Supports batch processing of multiple VINs. Perfect for vehicle research, fleet management and automotive market analysis.

Paweł

DNS Extractor

juansgaitan/dns-extractor

Gets the IP addresses & *Reverse DNS from an input url. *Reverse DNS is IP address to domain name mapping - the opposite of forward (normal) DNS which maps domain names to IP addresses.

Juan Gaitán Villamizar

118

Vehicle API vPIC

drerew/vehicle-api-vpic

The NHTSA Product Information Catalog Vehicle Listing (vPIC) Application Programming Interface (API) provides different ways to gather information on Vehicles and their specifications

Drew Carden

Website DNS Scraper - Mail provider / tech stack

saswave/website-dns-scraper

From a list of domain names, get infos about tech stack (hubspot, webflow, klaviyo ...) used and mail provider (google, outlook ...) + All the dns records (TXT, A, SOA ..) DNS based