Workday Jobs Scraper & B2B Lead Generator
Pricing
from $4.90 / 1,000 results
Workday Jobs Scraper & B2B Lead Generator
Scrape Workday career portals instantly. Auto-filter retail/blue-collar roles to extract HQ decision-makers. Get clean, English-only, LLM-ready job listings for B2B sales automation.
Pricing
from $4.90 / 1,000 results
Rating
0.0
(0)
Developer
Aether
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
a month ago
Last modified
Categories
Share
Workday HQ Intent Signals Scraper
Stop paying for 10,000 retail and warehouse listings. Get only the HQ & IT roles that actually buy B2B software.
This Apify Actor scrapes corporate Workday career portals and delivers clean, LLM-ready JSON containing only corporate, technology, and decision-maker job openings. No cashiers. No warehouse operators. No interns. Just high-intent B2B buying signals.
๐ฏ Why This Exists
Most job scrapers give you everything โ 90% of which is worthless for B2B SaaS prospecting. This Actor is purpose-built for SDRs and revenue teams:
| What Others Deliver | What We Deliver |
|---|---|
| 10,000+ retail/blue-collar jobs | Only HQ, IT, and corporate roles |
| Bloated HTML descriptions | Clean 400-char plain-text summaries |
| No requisition IDs | requisition_id extracted for hyper-personalized outreach |
| No intent classification | intent_category โ Data & AI, Revenue, Engineering, Product, Marketing, General |
| One-size-fits-all | Configurable filters: max age, HQ-only, English-only |
๐ง How It Works
- API Interception via Playwright โ intercepts the internal Workday search JSON API (no fragile DOM scraping).
- Smart Filtering โ excludes
cashier|warehouse|driver|intern|store|retail|clerk|assistant|worker|operator|mechanic|technician|student. - Requisition ID Extraction โ pulls the
R234160-style ID from Workday's API response. - Intent Classification โ categorizes each role into Data & AI, Revenue, Engineering, Product, Marketing, or General.
- Clean Output โ strips HTML to 400-char plain-text summaries, ready to paste into ChatGPT or your outreach tool.
๐ฆ Output Schema
Each record pushed to the Apify dataset:
{"company_name": "sysco","requisition_id": "R234160","job_title": "Senior Data Engineer","location": "Houston, TX","days_active": 3,"intent_category": "Data & AI","job_summary_clean": "We are looking for a Senior Data Engineer to design and build scalable data pipelines...","apply_url": "https://sysco.wd5.myworkdayjobs.com/syscocareers/job/Houston-TX/Senior-Data-Engineer_R234160"}
Clean, flat, no nested objects. Perfect for LLM ingestion or direct upload to your CRM.
โ๏ธ Input Configuration
| Field | Type | Default | Description |
|---|---|---|---|
company_workday_urls | Array<String> | (required) | Workday career portal URLs to scrape |
max_days_old | Integer | 7 | Skip jobs older than N days |
hq_roles_only | Boolean | true | Filter out retail, warehouse, intern, and blue-collar roles |
english_only | Boolean | true | Include only English-language job postings |
Example input:
{"company_workday_urls": ["https://sysco.wd5.myworkdayjobs.com/syscocareers","https://coca-cola.wd5.myworkdayjobs.com/cocacola"],"max_days_old": 14,"hq_roles_only": true,"english_only": true}
๐ Use Cases
- SDR Prospecting โ Find companies hiring for Data, AI, or Revenue roles โ trigger personalized outreach.
- Intent Monitoring โ Run weekly to detect spikes in engineering or product hiring (expansion signal).
- Competitive Intel โ Track which competitors are building out specific teams.
- Account Prioritization โ Rank accounts by recency and volume of corporate openings.
๐ณ Running Locally
# Install dependenciesnpm install# Run with tsx (dev mode)npm start# Build for productionnpm run build
For Apify platform deployment, the Dockerfile uses apify/actor-node-playwright-chrome:20 โ all browsers and dependencies are pre-installed.
๐ก๏ธ Built for Scale
- Playwright + Chromium for robust SPA rendering and API interception
- Automatic pagination through Workday's internal search API
- Apify proxy integration to avoid IP rate-limiting
- TypeScript โ fully typed, no guesswork
Stop scraping noise. Start scraping signals.