Dice Job Scraper avatar
Dice Job Scraper

Pricing

Pay per usage

Go to Apify Store
Dice Job Scraper

Dice Job Scraper

This lightweight and fast actor effortlessly scrapes job listings from Dice.com. Designed for simplicity, it extracts only the most essential data fields, giving you a clean and focused dataset. For a smooth and reliable run, using Apify Residential Proxies is strongly recommended.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

1

Bookmarked

26

Total users

16

Monthly active users

0.63 hours

Issues response

20 hours ago

Last modified

Share

Dice.com Job Scraper - Comprehensive Job Listings Extractor

Apify Actor

A powerful and efficient web scraper designed to extract detailed job listings from Dice.com. This actor provides comprehensive job data including titles, companies, locations, salaries, and full descriptions, making it ideal for job market analysis, recruitment, and research purposes.

✨ Features

  • Comprehensive Data Extraction: Captures all key job details including title, company, location, posting dates, work settings, employment types, and salaries
  • Flexible Search Parameters: Supports keyword-based searches, location filtering, and date-based job posting filters
  • Pagination Handling: Automatically navigates through multiple pages to collect the desired number of results
  • Structured Output: Saves data in clean, structured JSON format ready for analysis or integration
  • Proxy Support: Built-in proxy configuration for reliable and uninterrupted scraping
  • High Performance: Optimized for speed and efficiency while respecting website limits

📥 Input Parameters

Configure your job search with the following input fields:

FieldTypeDescriptionDefault
keywordStringJob title, skill, or keyword to search for (e.g., "Software Engineer", "Data Analyst")
locationStringGeographic location for job search (e.g., "Austin, TX", "Remote", "United States")
posted_dateStringFilter jobs by posting date (options: "1d", "3d", "7d", "14d", "30d", "all")all
results_wantedNumberMaximum number of job listings to scrape (1-1000)100
proxyConfigurationObjectProxy settings for the scraping process{ "useApifyProxy": true }

Input Configuration Example

{
"keyword": "Machine Learning Engineer",
"location": "San Francisco, CA",
"posted_date": "7d",
"results_wanted": 200,
"proxyConfiguration": {
"useApifyProxy": true,
"groups": ["RESIDENTIAL"]
}
}

📤 Output Data Structure

Each scraped job listing is stored as a structured JSON object with the following fields:

FieldTypeDescription
titleStringJob position title
companyStringHiring company name
locationStringJob location or "Remote"
postedStringOriginal posting date
updatedStringLast update date
workSettingStringWork arrangement (Remote, Hybrid, On-site)
employmentTypeStringEmployment type (Full-Time, Contract, etc.)
salaryStringSalary range or information
description_htmlStringFull job description in HTML format
description_textStringPlain text version of job description
urlStringDirect link to job posting on Dice.com
dice_idStringUnique Dice job identifier
position_idStringPosition identifier
sourceStringData source (always "dice.com")

Sample Output Record

{
"title": "Senior Python Developer",
"company": "Innovative Tech Corp",
"location": "Austin, TX",
"posted": "3 days ago",
"updated": "1 day ago",
"workSetting": "Hybrid",
"employmentType": "Full-Time",
"salary": "$120,000 - $160,000 per year",
"description_html": "<p>Join our dynamic team...</p>",
"description_text": "Join our dynamic team as a Senior Python Developer...",
"url": "https://www.dice.com/job-detail/example-job-id",
"dice_id": "ABC123XYZ",
"position_id": "POS789",
"source": "dice.com"
}

🚀 Usage Guide

Running on Apify Platform

  1. Navigate to the Apify Console
  2. Search for "Dice Job Scraper" or access the actor directly
  3. Click the "Run" button
  4. Configure your input parameters in the form
  5. Start the actor run
  6. Monitor progress and download results from the Dataset tab when complete

Local Execution

For local deployment:

  1. Ensure your development environment is set up
  2. Install project dependencies
  3. Execute the scraper with your configured parameters

⚙️ Configuration Options

Proxy Configuration

This scraper uses residential proxies by default for optimal performance and reduced blocking risk. You can customize proxy settings through the proxyConfiguration input field.

Recommended Settings:

  • Use residential proxy groups for best results
  • Enable Apify Proxy for automatic IP rotation

Performance Tuning

  • Adjust results_wanted based on your data needs (higher values may take longer)
  • Use specific posted_date filters to limit result scope and improve speed
  • Combine keywords and locations for more targeted searches

Search Tips

  • Use specific keywords like "Senior Software Engineer" instead of generic terms
  • Include location for better relevance (e.g., "New York, NY" or "Remote")
  • Recent postings (1-7 days) often yield fresher, more active opportunities

📊 Use Cases

  • Job Market Research: Analyze salary trends and job availability
  • Recruitment Automation: Build candidate databases from job postings
  • Career Planning: Track job opportunities in specific fields
  • Competitive Analysis: Monitor hiring patterns of companies
  • Data Analytics: Extract structured data for business intelligence

🔒 Data Privacy & Ethics

This scraper respects Dice.com's terms of service and implements responsible scraping practices:

  • Respects rate limits and implements appropriate delays
  • Uses legitimate proxy configurations
  • Extracts only publicly available job listing information

📞 Support & Issues

If you encounter any problems or have questions:

  • Check the Apify Community Forum for common solutions
  • Review the input configuration examples above
  • Ensure your search parameters are valid and specific

📈 Performance Metrics

  • Average Speed: Processes 50-100 job listings per minute
  • Success Rate: >95% data extraction accuracy
  • Scalability: Handles up to 1000+ results per run
  • Reliability: Built-in error handling and retry mechanisms

Built for reliability, speed, and comprehensive job data extraction from Dice.com