French Legal Notices Scraper (Mentions Légales) avatar

French Legal Notices Scraper (Mentions Légales)

Pricing

from $5.00 / 1,000 results

Go to Apify Store
French Legal Notices Scraper (Mentions Légales)

French Legal Notices Scraper (Mentions Légales)

Extract SIRET, SIREN, VAT number, company name, manager, emails & phones from the "mentions légales" pages of French websites. Feed it domains, get a CRM-ready B2B database. Luhn-validated identifiers.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

D N

D N

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

21 hours ago

Last modified

Share

French Legal Notices Scraper (Mentions Légales) 🇫🇷

Extract SIRET, SIREN, VAT number, company name, legal form, manager name, emails, phone numbers and registered address from any French website — automatically.

Every French website is legally required to publish a "mentions légales" page containing the company's official identification data. This Actor finds that page for you and turns it into clean, structured B2B data.

Give it a list of domains. Get back a ready-to-use company database.

💡 What can you do with it?

  • B2B lead enrichment — turn a list of prospect websites into a CRM-ready file with SIRET, decision-maker name, email and phone.
  • Sales prospecting in France — qualify local businesses (plumbers, restaurants, agencies…) at scale.
  • KYB / compliance checks — verify that a website belongs to a real registered company (SIREN validated with the Luhn algorithm).
  • Market research — map legal forms, locations and hosting providers across an industry.
  • Agency workflows — enrich Google Maps scraping results with official company identifiers.
  • Enrich further — pipe the extracted SIRETs into the French Company Data Enricher to get directors' names, headcount and financials.

⚙️ How it works

  1. For each input domain, the Actor loads the homepage and looks for a link to the legal notice page (mentions légales, legal notice, informations légales…).
  2. If no link is found, it probes 14 common French URL patterns (/mentions-legales, /mentions_legales.html, /pages/mentions-legales, …) with a soft-404 guard.
  3. It parses the page and extracts structured data with France-specific logic:
    • SIRET (14 digits) and SIREN (9 digits), both validated with the Luhn checksum
    • VAT number (TVA intracommunautaire), RCS city and number, NAF/APE code
    • Company name, legal form (SARL, SAS, SASU, EURL, SCI…), share capital
    • Manager (gérant, président) and publication director
    • Emails (including de-obfuscation of contact [at] domain [dot] fr), French phone numbers
    • Registered address, postal code, city, and hosting provider
  4. One dataset item is produced per domain, with a clear status: found, legal_notice_not_found, or unreachable.

No headless browser: the Actor uses fast, lightweight HTTP crawling, so runs are quick and cheap.

📥 Input

{
"startUrls": [
{ "url": "example-company.fr" },
{ "url": "https://www.another-company.fr" }
],
"maxConcurrency": 10,
"proxyConfiguration": { "useApifyProxy": true }
}

Plain domains work — no need to find the legal notice URL yourself.

📤 Output example

{
"domain": "exemple-plomberie.fr",
"status": "found",
"legalNoticeUrl": "https://exemple-plomberie.fr/mentions-legales",
"companyName": "PLOMBERIE MARTIN",
"legalForm": "SARL",
"siren": "732829320",
"sirenLuhnValid": true,
"siret": "73282932000074",
"siretLuhnValid": true,
"tvaIntra": "FR40732829320",
"rcsCity": "Lille Métropole",
"rcsNumber": "732829320",
"naf": "4322A",
"capital": "10 000 €",
"managerName": "Jean Martin",
"publicationDirector": "Jean Martin",
"emails": ["contact@exemple-plomberie.fr"],
"phones": ["0320123456"],
"address": "12 rue des Artisans, 59000 Lille",
"postalCode": "59000",
"city": "Lille",
"hostingProvider": "OVH",
"scrapedAt": "2026-07-03T10:00:00.000Z"
}

Export to CSV, Excel, JSON or connect directly to Make, Zapier, Google Sheets and 5,000+ tools via Apify integrations.

❓ FAQ

Is this legal? The Actor only reads information that French law (LCEN, art. 6-III) requires companies to publish publicly on their own websites. You remain responsible for how you use the data, in particular for GDPR compliance of your outreach.

What about JavaScript-heavy websites? This version uses fast HTTP crawling. Sites that render their entire content with client-side JavaScript (rare for legal notice pages) may return legal_notice_not_found.

How accurate is the extraction? SIREN/SIRET numbers are checksum-validated (Luhn), so false positives are flagged. Free-text fields (company name, manager) use multiple extraction strategies but depend on how each site wrote its legal notice.

Can I scrape thousands of domains? Yes — that's the point. Concurrency is configurable and one result costs a fraction of a cent.


Keywords: mentions légales scraper, SIRET scraper, SIREN extractor, French company data, B2B leads France, KBIS alternative, TVA intracommunautaire, French business scraper, extraction mentions légales, enrichissement de leads.