Apify Dataset QA Gate
Pricing
Pay per usage
Apify Dataset QA Gate
Pass, warn, or stop Apify datasets before CRM import, enrichment, client delivery, or webhook automation.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
jiaxun mao
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
Pass, warn, or stop Apify datasets before CRM import, enrichment, client delivery, or webhook automation.
This Actor is for teams that run scrapers repeatedly and need a quality gate before bad data flows into expensive or visible downstream steps.
Use it after a scraper and before:
- CRM import
- enrichment APIs
- Google Sheets exports
- client lead-list delivery
- n8n, Make, Zapier, or Apify webhook workflows
Why use a QA gate?
Deduplication tools clean rows. Enrichment tools add data. This Actor answers the earlier question:
Should this dataset continue through the workflow at all?
It returns:
qaStatus:PASS,WARN, orFAILautomationAction:continue,review, orstop- failed quality checks with actual vs expected values
- CRM-ready record count and percentage
- duplicate count and percentage
- field coverage for company, domain, email, phone, location, and category
- sample messy rows for review
- sample clean rows that can continue downstream
- recommendations for cleanup, enrichment, or scoring
Input options
Use either:
records: paste raw records as JSON.sourceDatasetId: select an existing Apify Dataset ID.
Example:
{"sourceDatasetId": "YOUR_DATASET_ID","requiredFields": ["companyName", "domain", "email", "phone", "location"],"passThresholds": {"minCrmReadyPercent": 80,"maxDuplicatePercent": 10,"minRequiredFieldCoveragePercent": 70},"maxRecords": 10000,"sampleSize": 25}
Automation workflow
- Run a lead, directory, product, review, or listing scraper.
- Send the scraper Dataset ID into this Actor.
- If
automationActioniscontinue, send clean rows to CRM, Sheets, or enrichment. - If
automationActionisreview, route the dataset to manual review. - If
automationActionisstop, block the workflow before wasting enrichment credits or importing bad data.
Lead workflow
For lead lists, run this Actor first. If the dataset passes, run Lead Intelligence Scorer to deduplicate, score, and prioritize the leads.
Recommended chain:
scraper -> QA Gate -> Lead Intelligence Scorer -> CRM/export
Best fit
- agencies validating client lead-list deliverables
- operators running scheduled Apify scrapers
- founders sending scraper output into Sheets or a CRM
- automation builders who need a simple pass/fail signal