Pricing

from $1.80 / 1,000 job-results

Public ATS Hiring Signal Scraper

Scrape public Greenhouse, Lever, Ashby, SmartRecruiters, Workable and Recruitee job boards into clean, CSV-ready hiring-signal data - no login or cookies required.

Pricing

from $1.80 / 1,000 job-results

Rating

0.0

(0)

Developer

Delowar Munna

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

✨ Why this scraper

Hiring-intelligence focused — every row carries derived signals (department, remote type, seniority, role category, detected technologies, salary when visible, and a transparent hiring-signal score) useful for sales triggers, recruiting, and market research.
Six ATS providers, one schema — Greenhouse, Lever, Ashby, SmartRecruiters, Workable, Recruitee, all normalized into the same flat row.
Two input modes — paste board/career URLs (provider auto-detected), or give known boards by provider + slug.
Flat, CSV-friendly output — no nested objects; drops straight into Sheets/Excel/CRMs.
Pay-Per-Event — one flat job-result event per saved unique job. Duplicates, filtered, and invalid rows are never charged.
No AI, no enrichment costs — rule-based derivations only, from visible scraped fields.

🚀 Quick start — sample inputs

Example 1 — board/career URLs (provider auto-detected)

{
    "startUrls": [
        { "url": "https://boards.greenhouse.io/airbnb" },
        { "url": "https://jobs.lever.co/leverdemo" },
        { "url": "https://jobs.ashbyhq.com/Ashby" }
    ],
    "maxResults": 500,
    "includeKeywords": ["engineer", "data", "sales"],
    "excludeKeywords": ["internship"],
    "includeDescription": true,
    "dedupe": true,
    "proxyConfiguration": { "useApifyProxy": true }
}

Example 2 — known boards by provider + slug, remote-only, with your own proxy

{
    "companySlugs": [
        { "provider": "greenhouse", "slug": "airbnb" },
        { "provider": "smartrecruiters", "slug": "SmartRecruiters" },
        { "provider": "recruitee", "slug": "sympower" }
    ],
    "providers": ["greenhouse", "smartrecruiters", "recruitee"],
    "maxResults": 1000,
    "maxResultsPerSource": 200,
    "remoteOnly": true,
    "departments": ["engineering", "data"],
    "includeDescription": true,
    "proxyConfiguration": {
        "useApifyProxy": false,
        "proxyUrls": ["http://user:pass@proxy.iproyal.com:12321"]
    }
}

Provide at least one of startUrls or companySlugs. If you provide both, the actor runs both and deduplicates across the whole run.

The actor blocks Apify Residential proxy; if you need residential routing, supply your own provider via proxyConfiguration.proxyUrls. See 🚦 Proxy policy below.

📦 Output

The dataset has one view: Jobs & hiring signals — a flat 33-column table.

Public ATS Hiring Signal Scraper — all-fields table view

Output fields (33)

job_id, job_key, provider, company_name, company_slug, job_title, department, team, location, locations_text, country, remote_type, employment_type, seniority, role_category, salary_min, salary_max, salary_currency, salary_period, job_description, description_length, detected_technologies, signal_score, signal_label, signal_tags, posted_date, posted_date_available, apply_url, job_url, source_url, source_type, raw_source_fields_json, scraped_at.

Sample record — Jobs & hiring signals

(Real run output; job_description truncated here for readability.)

{
    "job_id": "7957137",
    "job_key": "greenhouse:7957137",
    "provider": "greenhouse",
    "company_name": "Airbnb",
    "company_slug": "airbnb",
    "job_title": "Growth Strategy & Operations Lead",
    "department": "Sales",
    "team": "",
    "location": "Paris, France",
    "locations_text": "Paris, France",
    "country": "France",
    "remote_type": "unknown",
    "employment_type": "contract",
    "seniority": "lead",
    "role_category": "sales",
    "salary_min": 98000,
    "salary_max": 123000,
    "salary_currency": "EUR",
    "salary_period": "year",
    "job_description": "Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals...",
    "description_length": 6097,
    "detected_technologies": "python; sql",
    "signal_score": 95,
    "signal_label": "high",
    "signal_tags": "many_open_roles; large_hiring_batch; revenue_role; senior_role; tech_stack_visible; salary_visible; detailed_job_description; recent_posting; growth_language",
    "posted_date": "2026-05-27",
    "posted_date_available": true,
    "apply_url": "https://careers.airbnb.com/positions/7957137?gh_jid=7957137",
    "job_url": "https://careers.airbnb.com/positions/7957137?gh_jid=7957137",
    "source_url": "https://boards.greenhouse.io/airbnb",
    "source_type": "start_url",
    "raw_source_fields_json": "",
    "scraped_at": "2026-06-02T06:38:35.705Z"
}

🎯 Hiring-signal score

Transparent rule-based score (0–100) computed only from visible scraped fields — no AI, no external enrichment.

Signal	Points
Company has 5+ open jobs in this run	+20
Company has 15+ open jobs in this run	+15
Role is engineering, data, sales, or product	+10
Seniority is senior, lead, or executive	+10
Remote or hybrid	+10
At least one technology detected	+10
Salary / compensation visible	+10
Description longer than 800 characters	+10
Posted within the last 30 days	+5
Growth language in title (growth, expansion, scale, …)	+5

Score is capped at 100. Labels: low (0–39) · medium (40–69) · high (70–100).

signal_tags is a semicolon-separated list explaining the score — e.g. many_open_roles, large_hiring_batch, revenue_role, technical_role, senior_role, remote_hiring, tech_stack_visible, salary_visible, detailed_job_description, recent_posting, growth_language. The score is a transparent sorting aid, not a prediction.

💰 Pricing

Pay-Per-Event. One flat event per saved row (final per-event price is configured on the Apify console):

Event	Charged when
`job-result`	Once per unique job row that passed all filters and was successfully written to the dataset.

So your bill is simply results_saved × price_per_event. The actor honors the user-configured per-run spending cap (Apify eventChargeLimitReached): it caps how many results it collects up-front to what the limit can pay for, and stops cleanly the moment the cap is reached.

Not charged: duplicates, filtered-out rows, invalid rows (missing title and URL/ID), failed sources, or provider-discovery attempts.

🚦 Proxy policy

Use Apify Datacenter proxy or no proxy for normal runs — both work reliably for public ATS endpoints at this actor's conservative concurrency.

Apify Residential proxy is not supported. The actor will fail at startup if proxyConfiguration.apifyProxyGroups includes RESIDENTIAL. Reason: in pay-per-event actors, residential bandwidth (~/GB) is billed to the developer, not the run user, so a single bandwidth-heavy run could exceed the per-result event revenue.

If you genuinely need residential routing, supply your own residential provider via the proxy editor's Custom proxy URLs field — that traffic goes through your provider, not Apify, and is unaffected:

http://user:pass@proxy.iproyal.com:12321
http://user:pass@proxy.brightdata.com:22225
http://user:pass@proxy.oxylabs.io:7777

📊 Run summary

After each run, a RUN_SUMMARY entry is written to the key-value store:

{
    "inputs_total": 3,
    "successful_inputs": 3,
    "failed_inputs": 0,
    "unsupported_inputs": 0,
    "providers_detected": { "greenhouse": 1, "lever": 1, "recruitee": 1 },
    "raw_results_found": 626,
    "results_saved": 500,
    "duplicates_removed": 12,
    "filtered_out": 48,
    "invalid_rows": 0,
    "charged_events": 500,
    "blocked_requests": 0,
    "retry_count": 0,
    "detail_pages_visited": 0,
    "proxy_mode": "apify_datacenter",
    "runtime_seconds": 6,
    "scraped_at": "2026-06-02T02:05:14.061Z"
}

charged_events equals the number of successfully saved unique rows.

⚙️ Filters

Filter	Effect
`includeKeywords` / `excludeKeywords`	Case-insensitive match on title, department, team, location, and description. Exclusion wins.
`locationQuery`	Case-insensitive substring on location, locations text, and country.
`remoteOnly`	Keep only jobs classified as `remote`.
`departments`	Match against department, team, and role category.
`postedAfter` (`YYYY-MM-DD`)	Minimum posted date; rows with no reliable date are kept (`posted_date_available=false`).
`dedupe`	Remove duplicates across sources and repeated URLs (recommended ON).

Filters are applied before any dataset push or event charge.

🚧 Limitations (V1)

Public board data only — no login, cookies, or member-only content.
Some providers (SmartRecruiters, Workable) expose the description only on a per-job detail call; turning includeDescription off skips those calls (faster, fewer fields).
Salary is populated only when a provider exposes explicit compensation — Greenhouse pay-transparency ranges, Ashby's structured salary component, Lever, and Recruitee. It is never inferred from free text.
Employment type comes from the provider's own field where available (Lever, Ashby, SmartRecruiters, Workable, Recruitee). For Greenhouse (which has no employment-type field), it is inferred conservatively from the title and a few high-confidence description phrases, and is left empty when not certain.
Posted date is normalized to YYYY-MM-DD only where the provider exposes a reliable date; otherwise posted_date_available=false.
No recruiter/contact extraction, email enrichment, company-website crawling, LinkedIn enrichment, or AI scoring.
Company career pages are resolved only when they embed a supported ATS board link.

❓ FAQ

Do I need an account or cookies for any board? No. The actor only uses public ATS endpoints. Boards that require login are marked unsupported and skipped.

Which providers are supported? Greenhouse, Lever, Ashby, SmartRecruiters, Workable, and Recruitee.

Can I just paste a company careers page URL? Yes — put it in startUrls. If it embeds a supported ATS board, the actor detects the provider and slug automatically. Otherwise it's recorded as unsupported and the run continues.

Why are some rows missing salary or posted date? Those fields only appear when the provider exposes them. Salary is never guessed; posted date is kept only when reliable.

Can I export to CSV? Yes — every field is flat. Use Apify's CSV / Excel export, or call the dataset API with format=csv.

Will I get blocked? The actor uses conservative concurrency, realistic headers, session rotation, and retry/backoff. Default Apify Proxy is sufficient for typical runs; supply your own proxy for very large runs.

🛠️ Technical notes

Stack: Node.js 22 · Apify SDK 3 · Crawlee HttpCrawler (JSON-first) · native fetch/got-scraping. No browser.
Endpoints: each provider's public job-board JSON API (Greenhouse boards API, Lever postings, Ashby posting-api, SmartRecruiters postings, Workable accounts API, Recruitee offers).
Concurrency: min=1, max=10 (conservative; tune after real runs).
Memory: 1 GB min · 2 GB default · 4 GB max.
Proxy: Apify Proxy enabled by default; custom configs accepted; Apify Residential rejected at startup.
Modular adapters: each provider lives in its own src/providers/*.js file with a uniform interface, ready to lift into single-provider clone actors.

Public ATS Hiring Signals Monitor

sgforce/public-ats-hiring-signals-monitor

Monitor public Greenhouse, Lever, Ashby, Workable, and Recruitee job boards. Export clean hiring-signal rows for sales, recruiting, market research, and API workflows.

Francesco Scilipoti

Ashby Hiring Intelligence Scraper

coregent/ashby-hiring-intelligence-scraper

Scrape public Ashby job boards by board name or URL into clean, CSV-ready hiring-signal data - titles, locations, departments, compensation, descriptions, remote flags, and signal tags. No login or cookies required.

Delowar Munna

ATS Hiring Signal Report

taroyamada/ats-hiring-signal-report

Turn public Greenhouse, Lever, and Ashby job boards into decision-ready hiring signal reports with role priorities, keyword matches, region signals, warnings, and next actions. No user API key required.

naoki anzai

ATS Hiring-Signal Scraper (Greenhouse, Lever, Ashby)

datahoeven/ats-hiring-signal-scraper

Scrape jobs and hiring signals from Greenhouse, Lever and Ashby public JSON APIs into one uniform, deduplicated schema.

Daan Hoeven

ATS Hiring Signal Scraper

taroyamada/ats-hiring-signal-intelligence

Monitor Greenhouse, Lever, and Ashby public job boards for new roles, departments, regions, remote hiring, matched keywords, hiring signal scores, PPE charged events, and no-charge invalid/no-public-board rows.

naoki anzai

ATS Job Scraper & Monitor — Greenhouse, Lever, Ashby +3

vertaizen/ats-hiring-signals

ATS job scraper for Greenhouse, Lever, Ashby, SmartRecruiters, Recruitee & Workable. Point it at any company list and get clean job postings. Monitor mode returns only NEW jobs — hiring signals for sales prospecting & recruiting intel. Pay per result, no proxies, MCP-ready.

Diego Moragues

ATS Jobs Scraper — Greenhouse, Lever, Ashby, Recruitee

agency-shift/ats-jobs-scraper

Scrape job postings from ATS platforms (Greenhouse, Lever, Ashby, Recruitee). Track hiring signals and company growth. For recruitment intelligence.

Valdeir Lima

Jobs Scraper — Greenhouse, Lever, Workday, Ashby & Remote

sashaebashu/job-postings-scraper

Scrape jobs from Greenhouse, Lever, Workday, Ashby, Recruitee, SmartRecruiters + remote boards. Titles, departments, locations, dates. Hiring-signal data.