Dice Job Scraper
Pricing
Pay per usage
Dice Job Scraper
This lightweight and fast actor effortlessly scrapes job listings from Dice.com. Designed for simplicity, it extracts only the most essential data fields, giving you a clean and focused dataset. For a smooth and reliable run, using Apify Residential Proxies is strongly recommended.
Pricing
Pay per usage
Rating
5.0
(1)
Developer

Shahid Irfan
Actor stats
1
Bookmarked
26
Total users
16
Monthly active users
0.63 hours
Issues response
20 hours ago
Last modified
Categories
Share
Dice.com Job Scraper - Comprehensive Job Listings Extractor
A powerful and efficient web scraper designed to extract detailed job listings from Dice.com. This actor provides comprehensive job data including titles, companies, locations, salaries, and full descriptions, making it ideal for job market analysis, recruitment, and research purposes.
✨ Features
- Comprehensive Data Extraction: Captures all key job details including title, company, location, posting dates, work settings, employment types, and salaries
- Flexible Search Parameters: Supports keyword-based searches, location filtering, and date-based job posting filters
- Pagination Handling: Automatically navigates through multiple pages to collect the desired number of results
- Structured Output: Saves data in clean, structured JSON format ready for analysis or integration
- Proxy Support: Built-in proxy configuration for reliable and uninterrupted scraping
- High Performance: Optimized for speed and efficiency while respecting website limits
📥 Input Parameters
Configure your job search with the following input fields:
| Field | Type | Description | Default |
|---|---|---|---|
keyword | String | Job title, skill, or keyword to search for (e.g., "Software Engineer", "Data Analyst") | |
location | String | Geographic location for job search (e.g., "Austin, TX", "Remote", "United States") | |
posted_date | String | Filter jobs by posting date (options: "1d", "3d", "7d", "14d", "30d", "all") | all |
results_wanted | Number | Maximum number of job listings to scrape (1-1000) | 100 |
proxyConfiguration | Object | Proxy settings for the scraping process | { "useApifyProxy": true } |
Input Configuration Example
{"keyword": "Machine Learning Engineer","location": "San Francisco, CA","posted_date": "7d","results_wanted": 200,"proxyConfiguration": {"useApifyProxy": true,"groups": ["RESIDENTIAL"]}}
📤 Output Data Structure
Each scraped job listing is stored as a structured JSON object with the following fields:
| Field | Type | Description |
|---|---|---|
title | String | Job position title |
company | String | Hiring company name |
location | String | Job location or "Remote" |
posted | String | Original posting date |
updated | String | Last update date |
workSetting | String | Work arrangement (Remote, Hybrid, On-site) |
employmentType | String | Employment type (Full-Time, Contract, etc.) |
salary | String | Salary range or information |
description_html | String | Full job description in HTML format |
description_text | String | Plain text version of job description |
url | String | Direct link to job posting on Dice.com |
dice_id | String | Unique Dice job identifier |
position_id | String | Position identifier |
source | String | Data source (always "dice.com") |
Sample Output Record
{"title": "Senior Python Developer","company": "Innovative Tech Corp","location": "Austin, TX","posted": "3 days ago","updated": "1 day ago","workSetting": "Hybrid","employmentType": "Full-Time","salary": "$120,000 - $160,000 per year","description_html": "<p>Join our dynamic team...</p>","description_text": "Join our dynamic team as a Senior Python Developer...","url": "https://www.dice.com/job-detail/example-job-id","dice_id": "ABC123XYZ","position_id": "POS789","source": "dice.com"}
🚀 Usage Guide
Running on Apify Platform
- Navigate to the Apify Console
- Search for "Dice Job Scraper" or access the actor directly
- Click the "Run" button
- Configure your input parameters in the form
- Start the actor run
- Monitor progress and download results from the Dataset tab when complete
Local Execution
For local deployment:
- Ensure your development environment is set up
- Install project dependencies
- Execute the scraper with your configured parameters
⚙️ Configuration Options
Proxy Configuration
This scraper uses residential proxies by default for optimal performance and reduced blocking risk. You can customize proxy settings through the proxyConfiguration input field.
Recommended Settings:
- Use residential proxy groups for best results
- Enable Apify Proxy for automatic IP rotation
Performance Tuning
- Adjust
results_wantedbased on your data needs (higher values may take longer) - Use specific
posted_datefilters to limit result scope and improve speed - Combine keywords and locations for more targeted searches
Search Tips
- Use specific keywords like "Senior Software Engineer" instead of generic terms
- Include location for better relevance (e.g., "New York, NY" or "Remote")
- Recent postings (1-7 days) often yield fresher, more active opportunities
📊 Use Cases
- Job Market Research: Analyze salary trends and job availability
- Recruitment Automation: Build candidate databases from job postings
- Career Planning: Track job opportunities in specific fields
- Competitive Analysis: Monitor hiring patterns of companies
- Data Analytics: Extract structured data for business intelligence
🔒 Data Privacy & Ethics
This scraper respects Dice.com's terms of service and implements responsible scraping practices:
- Respects rate limits and implements appropriate delays
- Uses legitimate proxy configurations
- Extracts only publicly available job listing information
📞 Support & Issues
If you encounter any problems or have questions:
- Check the Apify Community Forum for common solutions
- Review the input configuration examples above
- Ensure your search parameters are valid and specific
📈 Performance Metrics
- Average Speed: Processes 50-100 job listings per minute
- Success Rate: >95% data extraction accuracy
- Scalability: Handles up to 1000+ results per run
- Reliability: Built-in error handling and retry mechanisms
Built for reliability, speed, and comprehensive job data extraction from Dice.com