Workday Job Scraper avatar
Workday Job Scraper

Pricing

Pay per usage

Go to Apify Store
Workday Job Scraper

Workday Job Scraper

Meet the Workday Job Scraper, a lightweight actor for extracting job data from any company's Workday portal. Fast and reliable. To ensure stable performance and avoid blocks across multiple domains, using residential proxies is highly recommended for best results.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

8 days ago

Last modified

Share

Description

This Apify actor efficiently scrapes job listings from Workday-powered career portals. It extracts comprehensive job data including titles, locations, descriptions, and application links from Workday's job search APIs. The actor handles pagination automatically and provides flexible configuration for targeted job searches.

Features

  • Comprehensive Data Extraction: Captures all key job details from Workday APIs
  • Automatic Pagination: Handles multi-page results seamlessly
  • Flexible Filtering: Supports keyword, location, and category-based searches
  • Proxy Support: Built-in proxy rotation for reliable scraping
  • Dataset Integration: Saves results directly to Apify datasets
  • Error Handling: Robust retry mechanisms and session management

Input

Required Parameters

  • startUrl (string): The Workday API endpoint URL to start scraping from (e.g., https://company.wd1.myworkdayjobs.com/wday/cxs/company/site/jobs)

Optional Parameters

  • keyword (string): Job search keywords (e.g., "software engineer", "data analyst")
  • location (string): Geographic location filter
  • category (string): Job category filter
  • results_wanted (integer): Maximum number of jobs to collect (default: 100)
  • max_pages (integer): Maximum pages to scrape (default: 999)
  • collectDetails (boolean): Whether to collect full job data or just URLs (default: true)
  • proxyConfiguration: Proxy settings for enhanced reliability

Output

The actor outputs structured job data to an Apify dataset with the following schema:

{
"title": "Job Title",
"company": "Company Name",
"category": "Job Category",
"location": "Job Location",
"date_posted": "Posting Date",
"job_type": "Employment Type",
"description_html": "<p>Full job description in HTML</p>",
"description_text": "Plain text job description",
"job_url": "https://company.wd1.myworkdayjobs.com/job/..."
}

Usage

Basic Usage

  1. Set the Start URL: Provide the Workday API endpoint for the company you want to scrape
  2. Configure Filters: Add keywords, location, or category to narrow results
  3. Run the Actor: Execute on Apify platform or locally
  4. Access Results: Download the dataset in JSON, CSV, or other formats

Example Input

{
"startUrl": "https://workday.wd5.myworkdayjobs.com/wday/cxs/workday/Workday/jobs",
"keyword": "software engineer",
"location": "San Francisco",
"results_wanted": 50
}

Advanced Configuration

For large-scale scraping:

  • Use residential proxies for better success rates
  • Set reasonable results_wanted limits to avoid timeouts
  • Enable collectDetails for complete job information

Configuration

Proxy Settings

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Performance Tuning

  • Concurrency: Adjust based on target site rate limits
  • Delays: Built-in delays prevent blocking
  • Retries: Automatic retry on failures

Limits & Considerations

  • Rate Limiting: Respects target site limits with built-in delays
  • Data Freshness: Results reflect current job postings
  • API Changes: Workday may update their API structure
  • Geographic Restrictions: Some jobs may have location restrictions

Cost Optimization

  • Use targeted searches to reduce runtime
  • Set appropriate results_wanted limits
  • Choose cost-effective proxy options

Support

For issues or questions:

  • Check the input parameters match your target Workday site
  • Verify the API endpoint URL is correct
  • Ensure proxy configuration is enabled for production use