Recherche Entreprises Scraper avatar

Recherche Entreprises Scraper

Under maintenance

Pricing

$10.00/month + usage

Go to Apify Store
Recherche Entreprises Scraper

Recherche Entreprises Scraper

Under maintenance

Extract comprehensive French company data from data-gouv.fr. Search companies using filters (activity, creation date, revenue, location) with automatic pagination. Enriches data with additional information from annuaire-entreprises.data.gouv.fr including legal details, directors, and financial data.

Pricing

$10.00/month + usage

Rating

0.0

(0)

Developer

Corentin Robert

Corentin Robert

Maintained by Community

Actor stats

0

Bookmarked

12

Total users

3

Monthly active users

4 days ago

Last modified

Share

Recherche Entreprises — Search & Enrich French Companies

Search and enrich French company data from the official recherche-entreprises.api.gouv.fr API. No API key required. No scraping — 100% legal, direct API calls.


What it does

Two modes in one Actor:

ModeInputWhat happens
Search URLOne or more URLs from Pappers.fr or recherche-entreprises.data.gouv.frFilters are extracted from each URL. The Actor runs a paginated search for each URL and returns all matching companies. With multiple URLs, results are deduplicated by SIREN.
SIREN listList of 9-digit SIREN numbersThe Actor fetches full company data for each SIREN from the API. SIRENs with fewer digits (e.g. 58801879) are auto-padded with leading zeros. Duplicates are removed.

Input

Mode: Search URL

Paste one or more URLs — one per line:

{
"mode": "searchUrl",
"searchUrls": [
"https://www.pappers.fr/recherche?ville=74160&en_activite=true",
"https://recherche-entreprises.data.gouv.fr/search?departement=75&activite_principale=62.01Z"
]
}

Results are deduplicated by SIREN across all URLs.

URL parameters extracted:

SourceParameters
recherche-entreprises.data.gouv.frq, activite_principale, etat_administratif, departement, region, code_postal, commune, categorie_entreprise, est_association, est_entrepreneur_individuel, est_organisme_formation, est_ess
Pappers.frSame, plus: villecommune, en_activite=trueetat_administratif=A

Unsupported params (e.g. age_dirigeant_min) are ignored.

Mode: SIREN list

{
"mode": "sirens",
"sirens": ["732829320", "552032534", "380129866"]
}

Aliases: mode: "enrich" also works.

  • One SIREN per line (9 digits)
  • 8-digit numbers (e.g. 58801879) are padded to 058801879
  • Automatic deduplication

Output

Each company is one row in the dataset. Fields:

CategoryFields
Identitysiren, nom_complet, nom_raison_sociale, date_creation, etat_administratif
Activityactivite_principale, libelle_activite_principale, categorie_entreprise
Addressadresse, code_postal, ville, departement, departement_nom, region, region_nom, latitude, longitude
Legalforme_juridique, nature_juridique, tva_intracommunautaire, siret_siege
Employeeseffectif_salarie, caractere_employeur
Directorsdirigeants (text), dirigeant_1dirigeant_5, role_1role_5, dirigeant_N_date_naissance, dirigeant_N_nationalite
Financialschiffre_affaires, resultat_net, annee_finances
Establishmentsnombre_etablissements, nombre_etablissements_ouverts
Extrasconvention_collective, est_entrepreneur_individuel, est_organisme_formation, donnees_diffusibles
  • Commissaires aux comptes are excluded from the directors list.
  • Empty fields are omitted from the output (no "").
  • donnees_diffusibles: false when data contains [NON-DIFFUSIBLE].
  • Booleans: est_entrepreneur_individuel and est_organisme_formation are real booleans.

When enrichment fails for a SIREN, the row includes _error with the error message.


Data source

All data comes from the official French government API recherche-entreprises.api.gouv.fr. Public data, updated daily by INSEE and commercial court registries. No API key required.


Local development

npm install
# Edit input.json with your input
apify run

Example inputs: input-search-url.json, input-enrich.json.


Support

Questions or issues: corentin@outreacher.fr