JobTeaser Job Scraper avatar
JobTeaser Job Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
JobTeaser Job Scraper

JobTeaser Job Scraper

Discover the JobTeaser Job Scraper, your efficient tool for extracting job postings and internship offers from JobTeaser. This lightweight actor is designed for quick and easy scraping. For optimal performance and to avoid blocks, using residential proxies is highly recommended.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(2)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

1

Monthly active users

6 days ago

Last modified

Share

Fast job listing extraction from JobTeaser - Uses sitemap discovery and JSON-LD parsing for maximum speed and reliability.

Apify Actor Job Scraper Data Extraction

Overview

The JobTeaser Job Scraper is a high-performance Apify actor that extracts job listings from JobTeaser. It uses a sitemap-first approach for URL discovery, bypassing anti-bot measures, and prioritizes JSON-LD structured data for fast, reliable extraction.

Key Features:

  • Fast Sitemap Mode - Discover job URLs from XML sitemap (bypasses Cloudflare)
  • 🎯 Smart Filtering - Filter by keyword, location, date, and language
  • 📊 Clean JSON-LD Data - Extract structured data from schema.org JobPosting
  • 🌍 Multi-Language - Support for 15 languages (EN, FR, DE, ES, etc.)
  • 🔄 Fallback Parsing - HTML parsing when JSON-LD unavailable

How It Works

flowchart LR
A[Sitemap XML] --> B[Filter by Keyword/Date]
B --> C[Queue Detail URLs]
C --> D[Extract JSON-LD]
D --> E[Apply Location Filter]
E --> F[Save to Dataset]
  1. Sitemap Discovery - Fetches job URLs from JobTeaser's sitemap (hosted on CDN, no blocking)
  2. Pre-Filtering - Filters URLs by keyword (in URL slug) and date (via lastmod)
  3. Detail Extraction - Visits each job page to extract JSON-LD structured data
  4. Post-Filtering - Applies location filter based on extracted job data
  5. Output - Saves clean JSON data to Apify dataset

Input Parameters

ParameterTypeDefaultDescription
keywordstring""Filter jobs by keyword (e.g., 'admin', 'developer', 'marketing')
locationstring""Filter by location (e.g., 'Paris', 'Germany', 'Remote')
languageenum"en"Job listing language (en, fr, de, es, it, nl, pl, pt, sv, etc.)
posted_withinenum"any"Date filter: "any", "24h", "7d", "30d"
results_wantedinteger50Maximum number of jobs to collect
useSitemapbooleantrueUse fast sitemap mode (recommended)
collectDetailsbooleantrueFetch full job descriptions from detail pages
proxyConfigurationobjectResidentialProxy settings

Output Schema

Each job is returned as a structured JSON object:

{
"title": "Software Engineer",
"company": "Tech Company",
"location": "Paris, France",
"date_posted": "2026-01-03",
"valid_through": "2026-02-03",
"job_type": "Full-time",
"job_category": "Technology",
"salary": "EUR 55000 - 75000",
"description_html": "<p>We are looking for...</p>",
"description_text": "We are looking for...",
"job_url": "https://www.jobteaser.com/en/job-offers/...",
"source": "json-ld"
}

Field Details

  • title - Job position title
  • company - Hiring organization name
  • location - Job location (city, country)
  • date_posted - Publication date (ISO format)
  • valid_through - Job expiry date
  • job_type - Employment type (Full-time, Part-time, Internship, etc.)
  • job_category - Industry or category
  • salary - Compensation information (when available)
  • description_html - Full HTML job description
  • description_text - Plain text description
  • job_url - Direct link to the job posting
  • source - Extraction method (json-ld or html)

Usage Examples

Find Admin Jobs in France

{
"keyword": "admin",
"location": "France",
"language": "fr",
"results_wanted": 50
}

Recent Developer Jobs (Last 7 Days)

{
"keyword": "developer",
"posted_within": "7d",
"results_wanted": 100
}

German Marketing Jobs

{
"keyword": "marketing",
"language": "de",
"location": "Germany",
"results_wanted": 30
}

All Jobs Without Filtering

{
"results_wanted": 200,
"posted_within": "any"
}

Performance

ModeSpeedReliability
Sitemap + JSON-LD (default)~30-60 jobs/min⭐⭐⭐⭐⭐
Pagination + HTML~15-25 jobs/min⭐⭐⭐

The sitemap mode bypasses Cloudflare protection entirely by fetching job URLs from JobTeaser's CDN-hosted sitemap, then visiting detail pages for JSON-LD extraction.

Filtering Logic

Keyword Filtering

  • Applied during sitemap parsing
  • Matches keyword(s) in the job URL slug
  • URL format: /job-offers/UUID-company-job-title-slug
  • Example: keyword "admin" matches URLs containing "admin"

Location Filtering

  • Applied after fetching job detail pages
  • Matches against location field from JSON-LD
  • Case-insensitive partial matching
  • Example: "Paris" matches "Paris, France"

Date Filtering

  • Sitemap mode: Uses lastmod from sitemap XML
  • Pagination mode: Uses datePosted from JSON-LD
  • Options: Last 24 hours, 7 days, 30 days, or any time

Best Practices

  1. Use Sitemap Mode (default) - Fastest and most reliable
  2. Be Specific with Keywords - Reduces unnecessary requests
  3. Use Residential Proxies - Better success rate for detail pages
  4. Start Small - Test with results_wanted: 10 first

Troubleshooting

No Results Found

  • Try a broader keyword or remove keyword filter
  • Check if the language has job listings
  • Disable sitemap mode to use pagination

Slow Performance

  • Ensure sitemap mode is enabled
  • Reduce results_wanted for faster completion
  • Use datacenter proxies for sitemap (faster than residential)

Technical Notes

  • Sitemap URLs: https://assets-cf.jobteaser.com/sitemaps/job_ads_sitemap*.xml
  • Languages Supported: en, fr, de, es, it, nl, pl, pt, sv, da, fi, no, cs, ca, gl
  • Data Format: JSON-LD using schema.org JobPosting schema
  • No Browser Required: Pure HTTP + Cheerio (fast and cost-effective)

Ready to extract job data? Start scraping JobTeaser listings with this fast, reliable actor.