French Legal Notices Scraper (Mentions Légales)
Pricing
from $5.00 / 1,000 results
French Legal Notices Scraper (Mentions Légales)
Extract SIRET, SIREN, VAT number, company name, manager, emails & phones from the "mentions légales" pages of French websites. Feed it domains, get a CRM-ready B2B database. Luhn-validated identifiers.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer
D N
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
21 hours ago
Last modified
Categories
Share
French Legal Notices Scraper (Mentions Légales) 🇫🇷
Extract SIRET, SIREN, VAT number, company name, legal form, manager name, emails, phone numbers and registered address from any French website — automatically.
Every French website is legally required to publish a "mentions légales" page containing the company's official identification data. This Actor finds that page for you and turns it into clean, structured B2B data.
Give it a list of domains. Get back a ready-to-use company database.
💡 What can you do with it?
- B2B lead enrichment — turn a list of prospect websites into a CRM-ready file with SIRET, decision-maker name, email and phone.
- Sales prospecting in France — qualify local businesses (plumbers, restaurants, agencies…) at scale.
- KYB / compliance checks — verify that a website belongs to a real registered company (SIREN validated with the Luhn algorithm).
- Market research — map legal forms, locations and hosting providers across an industry.
- Agency workflows — enrich Google Maps scraping results with official company identifiers.
- Enrich further — pipe the extracted SIRETs into the French Company Data Enricher to get directors' names, headcount and financials.
⚙️ How it works
- For each input domain, the Actor loads the homepage and looks for a link to the legal notice page (mentions légales, legal notice, informations légales…).
- If no link is found, it probes 14 common French URL patterns (
/mentions-legales,/mentions_legales.html,/pages/mentions-legales, …) with a soft-404 guard. - It parses the page and extracts structured data with France-specific logic:
- SIRET (14 digits) and SIREN (9 digits), both validated with the Luhn checksum
- VAT number (TVA intracommunautaire), RCS city and number, NAF/APE code
- Company name, legal form (SARL, SAS, SASU, EURL, SCI…), share capital
- Manager (gérant, président) and publication director
- Emails (including de-obfuscation of
contact [at] domain [dot] fr), French phone numbers - Registered address, postal code, city, and hosting provider
- One dataset item is produced per domain, with a clear
status:found,legal_notice_not_found, orunreachable.
No headless browser: the Actor uses fast, lightweight HTTP crawling, so runs are quick and cheap.
📥 Input
{"startUrls": [{ "url": "example-company.fr" },{ "url": "https://www.another-company.fr" }],"maxConcurrency": 10,"proxyConfiguration": { "useApifyProxy": true }}
Plain domains work — no need to find the legal notice URL yourself.
📤 Output example
{"domain": "exemple-plomberie.fr","status": "found","legalNoticeUrl": "https://exemple-plomberie.fr/mentions-legales","companyName": "PLOMBERIE MARTIN","legalForm": "SARL","siren": "732829320","sirenLuhnValid": true,"siret": "73282932000074","siretLuhnValid": true,"tvaIntra": "FR40732829320","rcsCity": "Lille Métropole","rcsNumber": "732829320","naf": "4322A","capital": "10 000 €","managerName": "Jean Martin","publicationDirector": "Jean Martin","emails": ["contact@exemple-plomberie.fr"],"phones": ["0320123456"],"address": "12 rue des Artisans, 59000 Lille","postalCode": "59000","city": "Lille","hostingProvider": "OVH","scrapedAt": "2026-07-03T10:00:00.000Z"}
Export to CSV, Excel, JSON or connect directly to Make, Zapier, Google Sheets and 5,000+ tools via Apify integrations.
❓ FAQ
Is this legal? The Actor only reads information that French law (LCEN, art. 6-III) requires companies to publish publicly on their own websites. You remain responsible for how you use the data, in particular for GDPR compliance of your outreach.
What about JavaScript-heavy websites?
This version uses fast HTTP crawling. Sites that render their entire content with client-side JavaScript (rare for legal notice pages) may return legal_notice_not_found.
How accurate is the extraction? SIREN/SIRET numbers are checksum-validated (Luhn), so false positives are flagged. Free-text fields (company name, manager) use multiple extraction strategies but depend on how each site wrote its legal notice.
Can I scrape thousands of domains? Yes — that's the point. Concurrency is configurable and one result costs a fraction of a cent.
Keywords: mentions légales scraper, SIRET scraper, SIREN extractor, French company data, B2B leads France, KBIS alternative, TVA intracommunautaire, French business scraper, extraction mentions légales, enrichissement de leads.