Experts Du Patrimoine Scraper avatar

Experts Du Patrimoine Scraper

Pricing

from $1.99 / actor start

Go to Apify Store
Experts Du Patrimoine Scraper

Experts Du Patrimoine Scraper

Scrape the ~387 published wealth management expert profiles from experts-du-patrimoine.fr. Three modes: (1) all — fetch all profiles via the WordPress sitemap, (2) crawl — auto-discover from category/service pages, (3) urls — enrich a specific list of profile URLs.

Pricing

from $1.99 / actor start

Rating

0.0

(0)

Developer

Corentin Robert

Corentin Robert

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

17 days ago

Last modified

Share

Scrape wealth management expert profiles from experts-du-patrimoine.fr — the French directory of patrimony management professionals (CGP, CIF, Family Office, Courtier, Immobilier, etc.).

Built with CheerioCrawler (no browser, no Puppeteer). Fast, lightweight, and cloud-ready on Apify.


Features

  • Two scraping modes: direct URL list or full category/service crawl with automatic pagination
  • Extracts contacts (email, phone, website, social links), identity data (SIREN, ORIAS, AMF), expertise areas, products, and more
  • Phone numbers normalized to E.164 format (+33…)
  • Email-to-website fallback for profiles without an explicit website link
  • Deduplication via Crawlee's built-in request queue
  • Configurable concurrency and delay to respect server load

Modes

Fetches the WordPress Yoast sitemap (/fiche-societe-sitemap.xml) which contains the complete and up-to-date list of every published expert profile URL. No pagination, no listing pages — direct access to all ~387 profiles at once.

{
"mode": "all"
}

This is the most reliable approach: the sitemap (fiche-societe-sitemap.xml) is updated daily by WordPress/Yoast and lists every expert that has a published profile page. Under the hood it simply reads:

https://www.experts-du-patrimoine.fr/fiche-societe-sitemap.xml
<loc>https://.../societes/expert-1/</loc>
<loc>https://.../societes/expert-2/</loc>
→ ... (~387 profile URLs)

Then scrapes each profile directly, in parallel.

You will get ~387 complete expert profiles, each with name, email, phone, website, address, SIREN, ORIAS, categories, expertise areas, and more.

crawl

Provide one or more category or service listing page URLs. Useful to target a specific segment (e.g. only CIF advisors, or only family offices).

{
"mode": "crawl",
"categoryUrls": [
"https://www.experts-du-patrimoine.fr/categorie/family-office-et-mfo/",
"https://www.experts-du-patrimoine.fr/les-services/optimisation-fiscale/",
"https://www.experts-du-patrimoine.fr/categorie/cif/"
]
}

Supported listing URL patterns:

  • /categorie/{slug}/ — by professional category
  • /les-services/{slug}/ — by client need / service type

Each category may overlap with others (the same expert can appear under multiple categories). Crawlee's built-in deduplication ensures each profile is scraped only once regardless of the starting URLs.

urls

Provide specific expert profile URLs to scrape directly (no discovery).

{
"mode": "urls",
"startUrls": [
"https://www.experts-du-patrimoine.fr/societes/osl-conseil/",
"https://www.experts-du-patrimoine.fr/societes/edouard-7-gestion-privee/"
]
}

Both /societes/{slug}/ and /expert/{slug}/ URL patterns are accepted.


Input parameters

ParameterTypeDefaultDescription
modestringallall, crawl, or urls
startUrlsarray[][mode=urls] Expert profile URLs
categoryUrlsarray3 defaults[mode=crawl] Listing page URLs
maxRequestsPerCrawlinteger0Max pages to request total (0 = unlimited)
maxConcurrencyinteger5Parallel requests (1–10)
delayBetweenRequestsinteger1Seconds between detail page requests

Output fields

Each dataset item represents one expert profile:

FieldTypeDescription
urlstringProfile URL
namestring | nullExpert / company name
slugstring | nullURL slug
emailstring | nullEmail address (when publicly listed)
phonestring | nullPhone number
phoneE164string | nullPhone in E.164 format (+33…)
websitestring | nullWebsite URL (or inferred from email domain)
socialLinksobject | null{ linkedin, twitter, youtube, … }
addressstring | nullStreet address
citystring | nullCity
zipcodestring | nullPostal code
countrystring | nullCountry
addressFullstring | nullFull formatted address
sirenstring | nullSIREN number
oriasNumberstring | nullORIAS registration number
amfApprovalstring | nullAMF approval number (e.g. GP-11000041)
descriptionstring | nullProfile presentation text
categoriesstring[]Business categories (CGP, CIF, Family office…)
registrationCategoriesstring[]ORIAS registration codes (COA, COBSP, CIF…)
staffSizestring | nullNumber of staff (e.g. "2 à 10 collaborateurs")
experiencestring | nullYears of experience range
assetsUnderManagementstring | nullAUM range (e.g. "Plus de 500 millions")
clientTypesstring[]Target client types
remunerationstring | nullRemuneration type (e.g. Honoraires)
expertiseAreasstring[]Service expertise areas
productsstring[]Financial products offered
regionsstring[]Geographic regions served
ratingnumber | nullAverage rating (0–5)

Example output

{
"url": "https://www.experts-du-patrimoine.fr/societes/osl-conseil/",
"name": "OSL CONSEIL",
"slug": "osl-conseil",
"email": "contact@oslconseil.fr",
"phone": null,
"phoneE164": null,
"website": "https://oslconseil.fr/",
"socialLinks": { "linkedin": "https://www.linkedin.com/company/osl-conseil" },
"address": "13 boulevard Richemond",
"city": "Caen",
"zipcode": null,
"country": null,
"addressFull": "13 boulevard Richemond, Caen",
"siren": "814116828",
"oriasNumber": "15006611",
"amfApproval": null,
"description": "OSL Conseil vous accompagne dans la gestion, le développement…",
"categories": ["CIF"],
"registrationCategories": ["CIF", "COA", "COBSP"],
"staffSize": "0 à 1 collaborateurs",
"experience": "5 à 10 ans",
"assetsUnderManagement": "Entre 10 et 50 millions",
"clientTypes": ["Dirigeants", "Pro. libérales", "Retraités", "Salariés"],
"remuneration": "Honoraires",
"expertiseAreas": ["Optimisation fiscale", "Transmission du patrimoine"],
"products": ["Plan d'Épargne Retraite", "Retraite"],
"regions": [],
"rating": 0
}

Local testing

  1. Install dependencies: npm install
  2. Edit input.json with your test parameters
  3. Run locally: apify run
  4. Results are stored in storage/datasets/default/
npm install
apify run

Note: Local results are stored only on your filesystem in storage/. To see results in the Apify Console, deploy with apify push and run from the platform.


Deployment

apify login
apify push

Notes

  • Email availability: Not all profiles expose a public email address. Profiles without a mailto: link will have email: null.
  • Director name: Not available in the public HTML of profile pages.
  • Rate limiting: Default settings (concurrency 5, delay 1s) are conservative. Increase maxConcurrency and set delayBetweenRequests: 0 for faster runs on large datasets.
  • Deduplication: The same expert may appear under multiple categories. Crawlee's request queue deduplicates URLs automatically, so each profile is scraped only once.