# Cadremploi.fr Jobs Scraper (`shahidirfan/cadremploi-fr-jobs-scraper`) Actor

Harvest job opportunities from Cadremploi.fr with precision. Scrape job titles, companies, salaries, locations & descriptions instantly. Built for recruiters, market researchers & talent teams. Real-time data extraction with pagination support. Start analyzing French job market today.

- **URL**: https://apify.com/shahidirfan/cadremploi-fr-jobs-scraper.md
- **Developed by:** [Shahid Irfan](https://apify.com/shahidirfan) (community)
- **Categories:** Jobs, Automation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Cadremploi.fr Jobs Scraper

Extract fresh job listings from Cadremploi.fr with flexible filters and clean, analysis-ready output. Collect key fields such as title, company, location, contract, salary, and offer URL for market research, recruiting intelligence, and job monitoring workflows.

### Features

- **Cadremploi job extraction** — Collect listings from Cadremploi.fr at scale
- **HTTP-only extraction path** — Uses Cadremploi APIs through got-scraping (no browser startup delay)
- **URL or keyword workflow** — Start from a listing URL or use keyword and location filters
- **Rich offer detail coverage** — Capture full descriptions, taxonomy fields, exact dates, and application URLs
- **Pagination controls** — Limit extraction with maximum pages and maximum results
- **Clean dataset output** — Empty/null/invalid scalar values are normalized for consistent records
- **Automation-ready output** — Data can be exported in JSON, CSV, Excel, XML, and consumed in integrations

### Use Cases

#### Talent Market Research
Track active hiring demand by role, location, and contract type. Build recurring snapshots to compare market movement over time.

#### Recruitment Intelligence
Monitor competitor hiring activity and identify recurring position profiles. Use structured data to support sourcing strategy and pipeline planning.

#### Job Aggregation Pipelines
Feed listings into internal systems, dashboards, or search tools. Use consistent output fields to simplify downstream processing.

#### Geo-Focused Opportunity Tracking
Collect roles for targeted cities or areas to analyze local opportunity concentration. Useful for relocation studies and regional job trend reporting.

### Input Parameters

| Parameter | Type | Required | Default | Description |
|-----------|------|----------|---------|-------------|
| `url` | String | No | `"https://www.cadremploi.fr/emploi/liste_offres"` | Cadremploi listing URL (can include filters) |
| `keyword` | String | No | `"infirmier"` | Keyword filter that takes priority over URL keyword |
| `location` | String | No | `"Paris"` | Location text used to apply location filtering |
| `results_wanted` | Integer | No | `20` | Maximum number of jobs to save |
| `max_pages` | Integer | No | `10` | Maximum number of result pages to process |
| `proxyConfiguration` | Object | No | Disabled (`useApifyProxy: false`) | Optional proxy setup (enable only when needed) |

### Output Data

Each item in the dataset contains:

| Field | Type | Description |
|-------|------|-------------|
| `id` | String | Cadremploi offer ID |
| `title` | String | Job title |
| `url` | String | Absolute job offer URL |
| `snippet` | String | Short listing summary |
| `description_text` | String | Combined detailed offer description text |
| `description_html` | String | HTML-formatted detailed description |
| `contract` | String | Contract type |
| `contract_code` | Number | Contract code from source taxonomy |
| `work_type` | String | Work type label when available |
| `location` | String | Full job location label |
| `city` | String | City extracted from detail payload |
| `postal_code` | String | Postal code |
| `region` | String | Region label |
| `country` | String | Country code |
| `salary` | String | Salary summary from listing |
| `salary_min` | Number | Minimum salary |
| `salary_max` | Number | Maximum salary |
| `salary_currency` | String | Salary currency |
| `published_at` | String | Relative publication text |
| `published_date` | String | Exact publication date |
| `expires_at` | String | Offer expiration date |
| `company_name` | String | Company or recruiter name |
| `company_description` | String | Company description |
| `company_type` | String | Company type when available |
| `company_logo` | String | Company logo URL |
| `company_logo_large` | String | Large company logo URL |
| `apply_type` | String | Application channel |
| `apply_url` | String | Application destination URL |
| `is_external_offer` | Boolean | External source flag |
| `has_already_applied` | Boolean | Application state when available |
| `offer_badges` | Array | Offer badges shown on listing/detail |
| `recruiter_badges` | Array | Recruiter badges shown on listing/detail |
| `job_functions` | Array | Function labels linked to the offer |
| `job_function_codes` | Array | Function codes |
| `industries` | Array | Industry labels linked to the offer |
| `industry_codes` | Array | Industry codes |
| `experience_level` | String | Experience level label |
| `accepts_junior` | Boolean | Junior profile accepted flag |
| `occupation_classification_slug` | String | Occupation classification slug |
| `occupation_classification_label` | String | Occupation classification label |
| `status_by_date` | String | Offer status by date |
| `source` | String | Source identifier |
| `search_type` | String | Search mode metadata |
| `search_id` | String | Search identifier metadata |
| `total_results` | Number | Total results reported by source |
| `source_page` | Number | Source page index |

### Usage Examples

#### Basic Run

```json
{
  "url": "https://www.cadremploi.fr/emploi/liste_offres",
  "results_wanted": 20
}
````

#### Keyword and Location

```json
{
  "keyword": "data analyst",
  "location": "Lyon",
  "results_wanted": 50,
  "max_pages": 10
}
```

#### Filtered URL Input

```json
{
  "url": "https://www.cadremploi.fr/emploi/liste_offres?dep=75&tyc=1",
  "results_wanted": 100,
  "max_pages": 20
}
```

### Sample Output

```json
{
  "id": "156535427459969552",
  "title": "Referent Qualite et Certifications H/F",
  "url": "https://www.cadremploi.fr/emploi/detail_offre?offreId=156535427459969552",
  "snippet": "Suite au depart d'un collaborateur experimente...",
  "description_text": "Faites le choix d'une entreprise en mouvement...",
  "description_html": "<p>Faites le choix d'une entreprise en mouvement...</p>",
  "contract": "CDI",
  "location": "Roissy-en-France",
  "city": "Roissy-en-France",
  "salary_min": 45000,
  "salary_max": 52000,
  "salary_currency": "EUR",
  "published_at": "Publiee il y a quelques minutes",
  "published_date": "2026-04-26T10:14:00+02:00",
  "company_name": "PLEIADE CONSULTING",
  "apply_url": "https://www.cadremploi.fr/...",
  "job_functions": ["Qualité", "Opérations"],
  "apply_type": "formulaire",
  "source_page": 1
}
```

### Tips for Best Results

#### Prefer Real Cadremploi Listing URLs

Use valid listing pages from Cadremploi.fr to ensure robust extraction.

#### Start Small, Then Scale

Begin with `results_wanted: 20` to validate filters, then increase limits for production runs.

#### Use Location Text Carefully

Use clear city or area names (for example `Paris`, `Lyon`, `Marseille`) for more consistent location filtering.

#### Keep Pagination Balanced

Set `max_pages` high enough for your goal, but avoid overly large values for fast operational runs.

#### Proxy Configuration

Proxy is optional and disabled by default for speed. Enable it only when needed (for example if your network gets blocked):

```json
{
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  }
}
```

### Integrations

Connect output to:

- **Google Sheets** — Build sortable hiring trackers
- **Airtable** — Create searchable recruitment datasets
- **Make** — Automate no-code workflows
- **Zapier** — Trigger alerts and follow-up actions
- **Webhooks** — Send data to custom services

#### Export Formats

- **JSON** — API and engineering workflows
- **CSV** — Spreadsheet analytics
- **Excel** — Business reporting
- **XML** — System integration pipelines

### Frequently Asked Questions

#### How many jobs can I collect?

You can collect as many as needed by increasing `results_wanted` and `max_pages`, within source availability.

#### Which input has priority, URL or keyword/location?

User-provided keyword and location are prioritized when supplied. If not supplied, URL filters are used.

#### Why are some fields missing in certain records?

Some listings do not expose every field from the source APIs. The actor applies fallbacks and normalization to avoid null/undefined values whenever possible.

# Actor input Schema

## `url` (type: `string`):

Cadremploi listing URL, for example https://www.cadremploi.fr/emploi/liste\_offres or a filtered listing URL.

## `keyword` (type: `string`):

Optional keyword. When provided, it overrides keyword values from the URL.

## `location` (type: `string`):

Optional location text such as Paris, Lyon, or Marseille.

## `results_wanted` (type: `integer`):

Maximum number of jobs to save.

## `max_pages` (type: `integer`):

Safety cap for API pages to fetch.

## `proxyConfiguration` (type: `object`):

Optional. Leave disabled for maximum speed, or enable Apify Proxy when your network gets blocked.

## Actor input object example

```json
{
  "url": "https://www.cadremploi.fr/emploi/liste_offres",
  "keyword": "infirmier",
  "location": "Paris",
  "results_wanted": 20,
  "max_pages": 10,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "url": "https://www.cadremploi.fr/emploi/liste_offres",
    "keyword": "infirmier",
    "location": "Paris",
    "results_wanted": 20,
    "max_pages": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("shahidirfan/cadremploi-fr-jobs-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "url": "https://www.cadremploi.fr/emploi/liste_offres",
    "keyword": "infirmier",
    "location": "Paris",
    "results_wanted": 20,
    "max_pages": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("shahidirfan/cadremploi-fr-jobs-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "url": "https://www.cadremploi.fr/emploi/liste_offres",
  "keyword": "infirmier",
  "location": "Paris",
  "results_wanted": 20,
  "max_pages": 10
}' |
apify call shahidirfan/cadremploi-fr-jobs-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=shahidirfan/cadremploi-fr-jobs-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Cadremploi.fr Jobs Scraper",
        "description": "Harvest job opportunities from Cadremploi.fr with precision. Scrape job titles, companies, salaries, locations & descriptions instantly. Built for recruiters, market researchers & talent teams. Real-time data extraction with pagination support. Start analyzing French job market today.",
        "version": "0.0",
        "x-build-id": "gcO529Z8teCj20CzU"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/shahidirfan~cadremploi-fr-jobs-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-shahidirfan-cadremploi-fr-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/shahidirfan~cadremploi-fr-jobs-scraper/runs": {
            "post": {
                "operationId": "runs-sync-shahidirfan-cadremploi-fr-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/shahidirfan~cadremploi-fr-jobs-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-shahidirfan-cadremploi-fr-jobs-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "url": {
                        "title": "Start URL",
                        "type": "string",
                        "description": "Cadremploi listing URL, for example https://www.cadremploi.fr/emploi/liste_offres or a filtered listing URL."
                    },
                    "keyword": {
                        "title": "Keyword",
                        "type": "string",
                        "description": "Optional keyword. When provided, it overrides keyword values from the URL."
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "Optional location text such as Paris, Lyon, or Marseille."
                    },
                    "results_wanted": {
                        "title": "Maximum number of jobs",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Maximum number of jobs to save.",
                        "default": 20
                    },
                    "max_pages": {
                        "title": "Maximum pages",
                        "minimum": 1,
                        "type": "integer",
                        "description": "Safety cap for API pages to fetch.",
                        "default": 10
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Optional. Leave disabled for maximum speed, or enable Apify Proxy when your network gets blocked.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
