πŸš€ Startup.jobs Scraper avatar
πŸš€ Startup.jobs Scraper

Pricing

Pay per usage

Go to Apify Store
πŸš€ Startup.jobs Scraper

πŸš€ Startup.jobs Scraper

Extract job listings from Startup.jobs, the leading platform for startup careers. This lightweight actor is designed for speed and efficiency. For seamless data extraction and to avoid blocks, using residential proxies is highly recommended.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Startup Jobs Scraper

Extract remote job listings from Startup.jobs - the leading job board for startup and tech company positions worldwide. Get comprehensive job data including titles, companies, locations, salaries, and full descriptions.


What This Scraper Does

Startup Jobs Scraper automatically extracts job listings from Startup.jobs, providing structured data ready for analysis, integration, or personal job tracking.

How It Works (Cheap Hybrid Strategy)

  • 1) Playwright listing: Opens the results page once and extracts job detail URLs (handles dynamic rendering / Cloudflare).
  • Only URLs matching the job pattern like https://startup.jobs/<slug>-<id> are collected (company pages like /company/... are ignored).
  • The listing uses max_pages pagination and stops early once results_wanted URLs are collected.
  • 2) HTTP + Cheerio details (cheap): Fetches each job detail page with got-scraping and parses JSON-LD/HTML using Cheerio.
  • 3) Playwright fallback (only when blocked): If a detail page is Cloudflare-blocked over HTTP, loads it in Playwright and parses the HTML.

Key Capabilities

  • Remote Job Focus - Specialized for remote and distributed work opportunities
  • Comprehensive Data - Extracts titles, companies, locations, job types, salaries, and descriptions
  • Flexible Search - Filter by keywords, location, and job categories
  • Scalable Collection - Gather from 10 to 500+ listings per run
  • Production Ready - Handles rate limiting and anti-bot measures automatically

Use Cases

IndustryApplication
RecruitmentBuild candidate sourcing databases for startup roles
Job SeekersTrack and monitor new remote opportunities automatically
Market ResearchAnalyze startup hiring trends and salary benchmarks
HR AnalyticsStudy demand for skills across the startup ecosystem
Career PlatformsIntegrate startup job data into your application

Input Configuration

Basic Example - Remote Software Jobs

{
"keyword": "software engineer",
"results_wanted": 50
}

Advanced Example - Full Configuration

{
"keyword": "product manager",
"location": "Remote",
"collectDetails": true,
"results_wanted": 100,
"max_pages": 5,
"maxConcurrency": 2,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Custom URL Example

{
"startUrl": "https://startup.jobs/remote-jobs?w=remote&q=data+scientist",
"results_wanted": 30,
"collectDetails": true
}

Input Parameters

ParameterTypeRequiredDefaultDescription
startUrlStringNo-Custom Startup.jobs URL. Overrides other search parameters when set.
keywordStringNo""Search terms for job titles or skills (e.g., "react developer", "marketing manager").
locationStringNo"Remote"Location filter. Use "Remote" for remote-only positions.
collectDetailsBooleanNotrueExtract full job descriptions by visiting detail pages. Set to false for faster runs with basic data only.
results_wantedIntegerNo25Maximum jobs to extract (1-500).
max_pagesIntegerNo3Maximum listing pages to process (1-20).
maxConcurrencyIntegerNo2Concurrent browser sessions (1-5). Lower values are more reliable.
proxyConfigurationObjectNoApify ResidentialProxy settings for the scraper. Residential proxies recommended.

Output Data

Each job listing contains the following fields:

Sample Output

{
"id": "7565554",
"title": "Senior Software Engineer",
"company": "TechStartup Inc.",
"location": "Remote",
"job_type": "Full-time",
"salary": "$120,000 - $180,000",
"description_text": "We are looking for a senior software engineer to join our growing team...",
"description_html": "<div>We are looking for a senior software engineer...</div>",
"company_logo": "https://startup.jobs/cdn/logos/company.png",
"posted_at": "2 days ago",
"apply_link": "https://startup.jobs/apply/7565554",
"url": "https://startup.jobs/senior-software-engineer-techstartup-7565554",
"source": "html",
"fetched_at": "2024-01-15T10:30:00.000Z"
}

Output Fields Reference

FieldTypeDescription
idStringUnique job identifier
titleStringJob position title
companyStringHiring company name
locationStringWork location (typically "Remote")
job_typeStringEmployment type (Full-time, Part-time, Contract)
salaryStringCompensation details when available
description_textStringClean text job description
description_htmlStringFull HTML description with formatting
company_logoStringURL to company logo image
posted_atStringWhen the job was posted
apply_linkStringDirect application URL
urlStringJob detail page URL
sourceStringData extraction method
fetched_atStringTimestamp of data extraction

Performance and Cost

Use CaseJobsDetailsPagesConcurrencyEstimated Time
Quick Test10Yes12~1 minute
Standard Run50Yes32~3 minutes
Large Collection200Yes103~8 minutes
Speed Optimized100No53~2 minutes

Tips for Best Results

  1. Start with a test run using 10-25 jobs to verify configuration
  2. Enable Apify Proxy with residential IPs for reliable results
  3. Use lower concurrency (2) for more consistent data extraction
  4. Set collectDetails: false for faster runs when only basic info is needed

Integration Options

Export Formats

Download your data in multiple formats directly from Apify:

  • JSON - Structured data for applications and APIs
  • CSV - Spreadsheet-compatible format
  • Excel - Direct import to Microsoft Excel
  • XML - For legacy system integration

API Access

Access results programmatically via the Apify API:

GET https://api.apify.com/v2/datasets/{datasetId}/items

Webhooks and Scheduling

  • Webhooks - Get notified when runs complete
  • Scheduling - Automate daily, weekly, or custom schedules
  • Integration - Connect with Zapier, Make, or custom workflows

Troubleshooting

No Results Found

  • Verify the keyword matches actual job listings on Startup.jobs
  • Try broader search terms
  • Check if the website is accessible in your region

Timeout Errors

  • Reduce results_wanted and max_pages values
  • Lower maxConcurrency to 1 or 2
  • Ensure proxy configuration is enabled

Incomplete Data

  • Enable collectDetails: true for full job descriptions
  • Some jobs may have limited information posted
  • Older listings may have expired or been removed

Blocked Requests

  • Enable Apify Proxy with residential IP groups
  • Reduce concurrency to minimize detection
  • Consider running during off-peak hours
  • If Cloudflare blocks HTTP, Playwright fallback will automatically bootstrap cookies; for best reliability, use residential proxies.

This scraper extracts publicly available job listing data from Startup.jobs. Users are responsible for:

  • Complying with Startup.jobs terms of service
  • Respecting rate limits and fair use policies
  • Following applicable data protection regulations
  • Using extracted data responsibly and ethically

Support

For questions, issues, or feature requests:

  • Review the input configuration examples above
  • Check the troubleshooting section for common issues
  • Test with smaller result sets before scaling up

Keywords: startup jobs, remote jobs, tech jobs, startup careers, software engineer jobs, product manager jobs, remote work, tech hiring, job scraping, career data, employment listings, startup recruitment, tech talent