Pricing

from $0.59 / 1,000 jobs

Welcome to the Jungle Jobs: JSON/RAG Scraper

Scrape Welcome to the Jungle job listings with salary and location data across 50+ countries. Raw JSON or RAG-ready chunks.

Pricing

from $0.59 / 1,000 jobs

Rating

0.0

(0)

Developer

GetAScraper

Actor stats

Bookmarked

Total users

Monthly active users

9 days ago

Last modified

🐒 WTTJ Jobs Scraper: Raw JSON or RAG-Ready Chunks

Extract job listings from Welcome to the Jungle with two output modes: raw JSON or RAG-ready chunks. Get salary ranges, company data, locations, contract type, and remote policy across France, UK, US, Germany, and 50+ countries. Drop raw output into spreadsheets or pipelines. Use RAG-ready chunks directly in Qdrant, Pinecone, Weaviate, LangChain, or LlamaIndex for AI-powered job matching and resume screening.

🔍 What does Welcome to the Jungle Scraper do?

This Actor queries WTTJ's public Algolia search API to extract structured job data. It supports two output modes:

Raw JSON - Full job listings with title, company, salary, locations, contract type, remote policy, summary, and sectors. Ready for spreadsheets, CRMs, or data warehouses.
RAG-Ready - Job descriptions tokenized into fixed-size chunks using tiktoken cl100k_base. Configurable chunk size (default 512 tokens) with overlap. Drop straight into any vector database for LLM-based job matching, resume screening, or market intelligence.

The scraper covers 10,000+ companies across Europe and the US. No login, no API key, no proxy required.

💡 Why use Welcome to the Jungle Scraper for RAG?

Dual output modes. Raw JSON for traditional pipelines, RAG-ready chunks for AI agents and vector databases. One Actor handles both workflows.
RAG-ready chunks. Job descriptions pre-split for LLM ingestion. No LaTeX stripping, no HTML parsing, no custom chunking logic on your side. Works with Qdrant, Pinecone, Weaviate, pgvector, and Chroma.
Framework-ready. Raw output drops into Google Sheets, BigQuery, n8n, or any CRM. RAG chunks drop into LangChain, LlamaIndex, Haystack, or custom LLM pipelines.
Europe plus US coverage. France, UK, Germany, Spain, and 50+ countries in one query. Compare salary ranges across markets.
Rich job metadata. Salary ranges (EUR, GBP, USD), remote policy, contract type, experience level, company size, and hiring velocity.
No auth required. Uses WTTJ's public Algolia API. No cookies, no API key, no account.

🚀 How to use Welcome to the Jungle Scraper

Open the Actor in Apify Console.
Set a search query (e.g., data engineer, product manager, sales).
Choose output mode: raw for standard JSON, rag for chunked text.
Optional: Add filters for country, city, contract type, remote policy.
Set max results (default 100, max 10,000).
Click Start. Download as JSON, CSV, or Excel.

📥 Input

Field	Type	Description
`query`	string	Job title, skill, or keyword to search (required).
`locale`	string	Language: `en` or `fr`. Default: `en`.
`countryCodes`	array	Filter by country codes (e.g., `FR`, `GB`, `US`).
`locations`	array	Filter by city or region (e.g., `Paris`, `London`).
`contractTypes`	array	Filter by contract type (`FULL_TIME`, `PART_TIME`, `INTERNSHIP`, `FREELANCE`).
`remotePolicies`	array	Filter by remote policy (`full`, `partial`, `punctual`, `no`).
`categories`	array	Filter by job category (`tech`, `data`, `sales`, `marketing`).
`maxItems`	integer	Maximum jobs to return. Default: 100. Max: 10,000.
`outputMode`	string	`raw` (full JSON) or `rag` (chunked text). Default: `raw`.
`chunkSize`	integer	Target tokens per chunk for RAG mode. Default: 512.
`chunkOverlap`	integer	Token overlap between chunks for RAG mode. Default: 50.

📤 Output

Raw Mode Example

{
    "jobId": "2d4fbe25-352d-47a4-8280-bd6d4642cfb1",
    "title": "Data analytics engineer",
    "company": {
        "name": "Visian",
        "slug": "visian",
        "size": "50-249 employees",
        "employeeCount": 200,
        "description": "Consulting et execution technique pour projets data."
    },
    "locations": [
        { "city": "Courbevoie", "country": "France", "countryCode": "FR", "region": "Ile-de-France" }
    ],
    "primaryLocation": "Courbevoie, France",
    "contractType": "full_time",
    "remotePolicy": "partial",
    "salary": {
        "hasSalary": true,
        "min": 42000,
        "max": 55000,
        "yearlyMinimum": 42000,
        "currency": "EUR",
        "period": "yearly"
    },
    "publishedAt": "2026-06-04T09:39:54Z",
    "summary": "Rejoignez Visian, une societe de conseil specialisee en innovation...",
    "sectors": ["Artificial Intelligence / Machine Learning", "IT / Digital"],
    "url": "https://www.welcometothejungle.com/en/jobs/data-engineer-visian-xxx",
    "scrapedAt": "2026-06-06T10:00:00.000Z"
}

RAG Mode Example

{
    "jobId": "2d4fbe25-352d-47a4-8280-bd6d4642cfb1",
    "title": "Data analytics engineer",
    "company": { "name": "Visian", "slug": "visian" },
    "primaryLocation": "Courbevoie, France",
    "chunks": [
        { "idx": 0, "text": "Job Title: Data analytics engineer\nCompany: Visian\nLocation: Courbevoie, France\n\nRejoignez Visian...", "tokens": 256 },
        { "idx": 1, "text": "...SQL, Databricks, Git et Power BI...", "tokens": 128 }
    ],
    "scrapedAt": "2026-06-06T10:00:00.000Z"
}

📊 Data Table

Field	Description
`jobId`	WTTJ unique job identifier.
`title`	Job title as posted.
`company.name`	Hiring company name.
`company.slug`	Company URL slug.
`company.size`	Company size category (e.g., "50-249 employees").
`company.employeeCount`	Total employees when available.
`locations`	Array of location objects with city, country, countryCode, region.
`primaryLocation`	Primary location as formatted string.
`contractType`	Contract type enum (full_time, part_time, internship, freelance).
`remotePolicy`	Remote work policy (full, partial, punctual, no, unknown).
`salary`	Salary object with min, max, currency, period when disclosed.
`publishedAt`	ISO date of job posting.
`summary`	Short job summary or description.
`sectors`	Array of job category sectors.
`url`	Canonical job URL on welcometothejungle.com.
`chunks`	RAG mode only: array of `{ idx, text, tokens }` chunks.
`scrapedAt`	ISO timestamp of scrape.

💰 Pricing

$3.99 per 1,000 job listings (pay-per-result).

Volume	Estimated cost
100 jobs	$0.40
1,000 jobs	$3.99
10,000 jobs	$39.90

No subscription. No minimum. You pay only for successful records.

⭐ Enjoying WTTJ Jobs Scraper?

⭐ ⭐ ⭐ ⭐ ⭐
Getting job data pre-chunked for a vector database instead of writing your own tokenizer and cleanup logic?
A 5-star rating takes 10 seconds and helps other recruiters and RAG pipeline builders find it. Your feedback also tells us what to build next.

★ Rate this Actor on Apify

🛠️ Tips

Use RAG mode for AI agents. Feed chunks directly into a vector database for job matching, resume screening, or market intelligence. Compatible with LangChain, LlamaIndex, Qdrant, Pinecone, and Weaviate.
Filter by country first. WTTJ has different job densities per market. France and UK have the most listings.
Combine with salary filters. Many jobs disclose salary ranges. Filter salary.hasSalary: true downstream for benchmarking.
Schedule weekly for market tracking. Compare job counts and salary ranges over time to detect hiring trend shifts.

❓ FAQ, Disclaimers, and Support

Is scraping Welcome to the Jungle legal? This Actor uses WTTJ's public Algolia search API, which requires no authentication. It collects publicly visible job listing data for research, market analysis, and recruiting intelligence. Users are responsible for ensuring their use complies with applicable laws and WTTJ's terms of service.

Why are some salary fields empty? Many jobs on WTTJ do not disclose salary. The Actor only saves salary values that the source exposes. Filter salary.hasSalary: true downstream if salary data is critical.

What is RAG mode? RAG (Retrieval-Augmented Generation) mode splits job descriptions into fixed-token chunks using tiktoken cl100k_base encoding. These chunks are ready to embed and store in a vector database for LLM-based job matching or resume screening.

How is this different from other WTTJ scrapers? This is the only WTTJ scraper with dual output modes: raw JSON for traditional pipelines and RAG-ready chunks for AI agents. No other scraper offers tiktoken-based chunking optimized for LLM ingestion.

Support: Open an issue on the Actor's Issues tab in Apify Console for bug reports, feature requests, or custom-run quotes.

Built with Apify + Crawlee + TypeScript. Part of the actorstack portfolio.

🔗 Other actors

NoFluffJobs Scraper: Tech Jobs & Salaries ↗ - extracts tech job listings and salaries from NoFluffJobs.
CWJobs Scraper: UK Tech Jobs, Salaries & Geo ↗ - collects UK tech job listings, salaries, and location data.
S1 Jobs Scraper: Scotland Jobs, Salaries & Employers ↗ - pulls Scotland job listings, salaries, and employer data.
AllJobs Scraper: Israel Job Listings ↗ - gathers job listings from Israel's AllJobs board.
AMS Austria Jobs Scraper: Bulk Austrian Job Leads ↗ - extracts job listings and employer contacts from Austria's AMS portal.

Welcome to the Jungle Jobs Scraper

benthepythondev/welcome-to-the-jungle-jobs-scraper

Scrape Welcome to the Jungle jobs with title, company, location, remote policy, salary, contract, description, application URL and company intelligence.

Ben

Welcome to the Jungle Job Scraper

codingfrontend/jungle-job-scraper

Scrape job listings from Welcome to the Jungle with salaries, company information, benefits, and full job descriptions. Perfect for recruitment analysis.

Coding Frontned

Welcome to the Jungle Jobs Scraper

crawlerbros/welcome-to-the-jungle-scraper

Scrape job listings from Welcome to the Jungle - the leading French job board with 100K+ listings. Search by keyword, location, contract type, and remote preference. No account required.

Crawler Bros

Welcome to the Jungle Jobs Scraper

crawlergang/welcome-to-the-jungle-scraper

Scrape job listings from Welcome to the Jungle - the leading French job board with 100K+ listings. Search by keyword, location, contract type, and remote preference. No account required.

Crawler Gang

5.0

Welcome To The Jungle Scraper

silentflow/welcome-to-the-jungle-scraper

Scraper for Welcome to the Jungle — Europe's #1 job board. Extract job postings and company profiles with salary ranges, contract types, remote options, and 50+ structured fields. Supports keyword search, location, and 7 markets.

SilentFlow

Welcome to the Jungle Jobs Scraper

maximedupre/welcometothejungle-jobs-extractor

Scrape Welcome to the Jungle job listings from keywords, WTTJ search URLs, and job filters. Export titles, companies, locations, salaries, remote policy, contract type, source URLs, and optional job details.

Maxime Dupré

Welcome to the Jungle Scraper

sourabhbgp/welcome-to-the-jungle-scraper

Scrape Welcome to the Jungle job listings by keyword, country, contract type and remote policy, plus full company profiles. Get salary, sectors, benefits, key missions, the role summary and candidate profile text for every job. Export to JSON, CSV or Excel.

Sourabh Kumar

Welcome to the Jungle Scraper

unfenced-group/welcometothejungle-scraper

Extract job listings from Welcome to the Jungle — France, UK, US, and 3 more markets. Salary data on 92% of listings, no proxy needed. $0.99/1,000 results. Keyword search, remote & contract filters.

Unfenced Group

5.0

Welcome to the Jungle Scraper - WTTJ Jobs Fast Export

clearpath/welcome-to-the-jungle-jobs-api

Extract job listings from Welcome to the Jungle (WTTJ) with filters for location, salary, remote work, contract type, experience level, and industry. Get full job details including company info, application URLs, required skills, and benefits. Export to JSON, CSV, or Excel.

ClearPath

407

Jungle Job Scraper 🌴

easyapi/jungle-job-scraper

Efficiently scrape job listings from Welcome to the Jungle with comprehensive details including salaries, company information, and benefits. Perfect for recruitment analysis, market research, and job market monitoring. 🌴✨