Hiring.Cafe Scraper — 2.8M AI-Enriched Jobs from 46 ATS avatar

Hiring.Cafe Scraper — 2.8M AI-Enriched Jobs from 46 ATS

Pricing

from $1.80 / 1,000 results

Go to Apify Store
Hiring.Cafe Scraper — 2.8M AI-Enriched Jobs from 46 ATS

Hiring.Cafe Scraper — 2.8M AI-Enriched Jobs from 46 ATS

Scrape Hiring.Cafe (hiring.cafe) — AI-enriched job aggregator with 2.8M+ listings from 46 ATS platforms. Structured salary, company, and remote-work data with incremental tracking for recurring job monitoring.

Pricing

from $1.80 / 1,000 results

Rating

0.0

(0)

Developer

Black Falcon Data

Black Falcon Data

Maintained by Community

Actor stats

1

Bookmarked

24

Total users

14

Monthly active users

6 minutes ago

Last modified

Share

What does Hiring.Cafe Scraper do?

Hiring.Cafe Scraper extracts structured job data from hiring.cafe — including salary data, apply URLs, company metadata, full descriptions, and location data. It supports keyword search, location filters, and controllable result limits, so you can run the same query consistently over time. The actor also offers detail enrichment (full descriptions and company metadata) where the source provides them.

New to Apify? Sign up free and use the included $5 monthly platform credit to test this actor.

Key features

  • ♻️ Incremental mode — recurring runs emit only NEW / UPDATED / REAPPEARED records — UNCHANGED and EXPIRED are opt-in. First run builds the baseline; subsequent runs emit and charge only for the diff. Pair with notifications for daily "new jobs" alerts to your hiring team. Saves 80–95% on daily monitoring.
  • 🔔 Notifications — Telegram, Slack, Discord, WhatsApp Cloud API, generic webhook — out of the box. Pair with incremental + notifyOnlyChanges for daily "new Hiring jobs" pings to your hiring channel.
  • 📋 Detail enrichment — two-stage mode: list, then enrich each job with the full description + detail-page fields (apply counts, education, etc.). One toggle, no extra orchestration.
  • 🔀 Source-board provenance — every listing carries the original source-board URL — full audit trail across hiring.cafe's aggregated sources, so you can de-duplicate against direct-source feeds you already run.
  • 📦 Compact mode — AI-agent and MCP-friendly compact payloads with core fields only — pipe straight into your ATS, salary-benchmarking tool, or LLM context without parsing extras.
  • ✂️ Description truncation — cap description length with descriptionMaxLength to control LLM prompt cost and dataset size — set 0 for full descriptions, or any char-limit to trim.
  • 📌 Change classification — each record carries a changeType of NEW / UPDATED / UNCHANGED / REAPPEARED / EXPIRED. Default emits NEW + UPDATED + REAPPEARED; opt into the others with emitUnchanged / emitExpired. Repost detection flags previously-expired listings that come back.
  • 📤 Export anywhere — Download the dataset as JSON, CSV, or Excel from the Apify Console, or stream live via the Apify API and integrations (Make, Zapier, Google Sheets, n8n, …).

What data can you extract from hiring.cafe?

Each result includes Core listing fields (jobId, title, location, workplaceType, commitment, seniorityLevel, jobCategory, and salaryMin, and more), detail fields when enrichment is enabled (roleType, roleActivities, and description), apply information (applyUrl), and company metadata (company, positionEmployerType, companyName, and companyWebsite). In standard mode, all fields are always present — unavailable data points are returned as null, never omitted. In compact mode, only core fields are returned.

Enable detail enrichment in the input to get richer fields such as full descriptions and company metadata where the source provides them.

Input

The main inputs are a search keyword, an optional location filter, and a result limit. Additional filters and options are available in the input schema.

Key parameters:

  • query — Job search keywords. Leave blank to browse all jobs.
  • country — Country market to search. Hiring.Cafe currently exposes a verified country-level search market for the United States; more countries will be added here after their location slugs are verified. (default: "US")
  • location — City, state, or region.
  • maxResults — Maximum total results (0 = unlimited). Memory scales automatically: 256 MB up to 1000, 512 MB up to 2000, 1024 MB above. (default: 25)
  • includeDetails — Fetch full job details. (default: true)
  • descriptionMaxLength — Truncate description to N chars. 0 = no truncation. (default: 0)
  • compact — Core fields only (for AI-agent/MCP workflows). (default: false)
  • incrementalMode — Compare against previous run state. (default: false)
  • stateKey — Optional stable identifier for the tracked search universe. Leave empty to auto-derive a stable identifier from your search inputs — different keyword/location/filter combinations get isolated state automatically.
  • skipReposts — In incremental mode, skip jobs that appear to be reposts of previously-seen expired jobs (same content hash). (default: false)
  • emitUnchanged — In incremental mode, include unchanged jobs in the dataset. Useful when you want a full snapshot with changeType on every record. (default: false)
  • emitExpired — In incremental mode, include EXPIRED tombstone records for jobs that were active in previous state but are absent from the current result window. (default: false)
  • ...and 11 more parameters

Input examples

Basic search — Keyword-driven search with a result cap.

→ Full payload per result — all standard fields populated where the source provides them.

{
"query": "software engineer",
"maxResults": 50
}

Incremental tracking — Only emit jobs that changed since the previous run with this stateKey.

→ First run builds the baseline state. Subsequent runs emit only records that are new or whose tracked content changed. Set emitUnchanged: true to include unchanged records as well.

{
"query": "software engineer",
"maxResults": 200,
"incrementalMode": true,
"stateKey": "software-engineer-tracker"
}

Compact output for AI agents — Return only core fields for AI-agent and MCP workflows.

→ Small payload with the most important fields — ideal for piping into LLMs without token overhead.

{
"query": "software engineer",
"maxResults": 50,
"compact": true
}

Output

Each run produces a dataset of structured job records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

Example job record

{
"jobId": "ef0d6041374f0ea3b7136b71a69ad6277a13df47bb504ac39e3d819745a1ad3f",
"title": "Software Developer",
"company": "AM Pierce & Associates",
"location": "Patuxent River, Maryland, United States",
"workplaceType": "Onsite",
"commitment": "Full Time",
"seniorityLevel": "Senior Level",
"roleType": "Individual Contributor",
"roleActivities": [
"Oversee tasks",
"Coordinate efforts",
"Represent NAVAIR"
],
"jobCategory": "Software Development",
"description": "<p style=\"text-align: justify; line-height: normal; background: white;\"><span style=\"color: #000000;\"><strong><span style=\"font-size: 12pt; font-family: 'Verdana Pro', sans-serif;\">Who We Are:</span><...",
"salaryMin": 108000,
"salaryMax": 180000,
"salaryCurrency": "USD",
"salaryFrequency": "Yearly",
"isCompensationTransparent": true,
"requirementsSummary": "Senior software developer with DoD clearance, 8+ years of experience in engineering/software, plus 3+ years in acquisition software engineering; strong Interfacing and system integration skills.",
"technicalTools": [
"Software Development",
"Open Architecture",
"Systems Integration"
],
"minYearsExperience": 8,
"minManagementYears": null,
"degreeRequirement": "Bachelors",
"degreeFieldsOfStudy": [
"Computer Science",
"Software Engineering",
"Electrical Engineering"
],
"licensesOrCertifications": [
"dod secret clearance"
],
"languageRequirements": [
"English"
],
"securityClearance": "Secret",
"driverLicenseRequired": false,
"retirement401kMatching": true,
"retirementPlan": true,
"tuitionReimbursement": false,
"generousParentalLeave": false,
"generousPaidTimeOff": false,
"fourDayWorkWeek": false,
"visaSponsorship": false,
"relocationAssistance": false,
"fairChance": true,
"militaryVeterans": true,
"physicalLaborIntensity": "Low",
"physicalPosition": "Sitting",
"workplaceEnvironment": "Office",
"computerUsage": "High",
"cognitiveDemand": "High",
"oralCommunicationLevel": "Medium",
"overtimeRequired": false,
"onCallRequirement": null,
"airTravelRequirement": null,
"landTravelRequirement": "Minimal",
"morningShiftWork": null,
"eveningShiftWork": null,
"overnightWork": null,
"weekendAvailabilityRequired": false,
"holidayAvailabilityRequired": false,
"positionEmployerType": "External Position",
"workplaceCountries": [
"US"
],
"workplaceContinents": [
"North America"
],
"workplaceStates": [
"Maryland, US"
],
"workplaceCities": [
"Patuxent River, Maryland, US"
],
"workplaceCounties": [
"Saint Marys County, Maryland, US"
],
"isWorkplaceWorldwideOk": false,
"latitude": 38.2709555,
"longitude": -76.4369803,
"companyName": "AM Pierce and Associates",
"companyWebsite": "ampierce.com",
"companySector": "Information Technology",
"companyIndustries": [
"Aerospace and Defense",
"Information Technology"
],
"companyActivities": [
"Engineering",
"Research"
],
"companyTagline": "A woman-owned small business providing Engineering & Research, Cyber, C5ISR, Program & Acquisition Management services and solutions to government and industry.",
"companyEmployeeCount": 120,
"companyHqCountry": "US",
"companyYearFounded": 2007,
"companyOrganizationType": "Private",
"companyParent": null,
"companySubsidiaries": [
"Applied Technologies Group"
],
"companyStockExchange": null,
"companyStockSymbol": null,
"companyFundingType": null,
"companyFundingYear": null,
"companyFundingAmount": null,
"companyFundingInvestors": null,
"applyUrl": "https://ampierce.applicantpro.com/jobs/4034720.html",
"portalUrl": "https://hiring.cafe/viewjob/gxxqfjhrbsdfualt",
"sourceAts": "applicantpro",
"postedDate": "2026-03-26T00:00:00.000Z",
"scrapedAt": "2026-04-05T13:50:10.906Z",
"source": "hiring.cafe",
"changeType": null
}

Incremental fields

When incremental: true, each record also carries:

  • changeType — one of NEW, UPDATED, UNCHANGED, REAPPEARED, EXPIRED. Default output covers NEW / UPDATED / REAPPEARED; set emitUnchanged: true or emitExpired: true to opt into the others.
  • firstSeenAt, lastSeenAt — ISO-8601 timestamps tracking the listing across runs.
  • isRepost, repostOfId, repostDetectedAt — populated when a new listing matches the tracked content of a previously expired one. Set skipReposts: true to drop detected reposts from the output.

How to scrape hiring.cafe

  1. Go to Hiring.Cafe Scraper in Apify Console.
  2. Enter a search keyword and optional location filter.
  3. Set maxResults to control how many results you need.
  4. Enable includeDetails if you need full descriptions, company data.
  5. Click Start and wait for the run to finish.
  6. Export the dataset as JSON, CSV, or Excel.

Use cases

  • Extract job data from hiring.cafe for market research and competitive analysis.
  • Track salary trends across regions and categories over time.
  • Monitor new and changed listings on scheduled runs without processing the full dataset every time.
  • Auto-apply or feed apply URLs into your ATS / hiring pipeline.
  • Research company hiring patterns, employer profiles, and industry distribution.
  • Use structured location data for regional analysis, mapping, and geo-targeting.
  • Feed structured data into AI agents, MCP tools, and automated pipelines using compact mode.
  • Export clean, structured data to dashboards, spreadsheets, or data warehouses.

How much does it cost to scrape hiring.cafe?

Hiring.Cafe Scraper uses pay-per-event pricing. You pay a small fee when the run starts and then for each result that is actually produced.

  • Run start: $0.005 per run
  • Per result: $0.0018 per job record

Example costs:

  • 10 results: $0.02
  • 100 results: $0.18
  • 500 results: $0.91

Example: recurring monitoring savings

These examples compare full re-scrapes with incremental runs at different churn rates. Churn is the share of listings that are new or whose tracked content changed since the previous run. Actual churn depends on your query breadth, source activity, and polling frequency — the scenarios below are examples, not predictions.

Example setup: 100 results per run, daily polling (30 runs/month). Event-pricing examples scale linearly with result count.

Churn rateFull re-scrape run costIncremental run costSavings vs full re-scrapeMonthly cost after baseline
5% — stable niche query$0.18$0.01$0.17 (92%)$0.42
15% — moderate broad query$0.18$0.03$0.15 (83%)$0.96
30% — high-volume aggregator$0.18$0.06$0.13 (68%)$1.77

Full re-scrape monthly cost at daily polling: $5.55. First month with incremental costs $0.59 / $1.11 / $1.90 for the 5% / 15% / 30% scenarios because the first run builds baseline state at full cost before incremental savings apply.

FAQ

How many results can I get from hiring.cafe?

The number of results depends on the search query and available listings on hiring.cafe. Use the maxResults parameter to control how many results are returned per run.

Does Hiring.Cafe Scraper support recurring monitoring?

Yes. Enable incremental mode to only receive new or changed listings on subsequent runs. This is ideal for scheduled monitoring where you want to track changes over time without re-processing the full dataset.

Can I integrate Hiring.Cafe Scraper with other apps?

Yes. Hiring.Cafe Scraper works with Apify's integrations to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

Can I use Hiring.Cafe Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the Apify API. Client libraries are available for JavaScript, Python, and other languages.

Can I use Hiring.Cafe Scraper through an MCP Server?

Yes. Apify provides an MCP Server that lets AI assistants and agents call this actor directly. Use compact mode and descriptionMaxLength to keep payloads manageable for LLM context windows.

This actor extracts publicly available data from hiring.cafe. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

Your feedback

If you have questions, need a feature, or found a bug, please open an issue on the actor's page in Apify Console. Your feedback helps us improve.

You might also like

Getting started with Apify

New to Apify? Create a free account with $5 credit — no credit card required.

  1. Sign up — $5 platform credit included
  2. Open this actor and configure your input
  3. Click Start — export results as JSON, CSV, or Excel

Need more later? See Apify pricing.