PagesJaunes FR $0.8/1K Comprehensive Business Data avatar

PagesJaunes FR $0.8/1K Comprehensive Business Data

Pricing

from $0.80 / 1,000 results

Go to Apify Store
PagesJaunes FR $0.8/1K Comprehensive Business Data

PagesJaunes FR $0.8/1K Comprehensive Business Data

From $0.8/1K. Scrapes business listings from PagesJaunes.fr (French Yellow Pages). Extract comprehensive business information, including contact details, ratings, reviews, opening hours, payment methods, and category hierarchy across all of France.

Pricing

from $0.80 / 1,000 results

Rating

0.0

(0)

Developer

AbotAPI

AbotAPI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Share

PagesJaunes FR Business Scraper

Scrapes business listings from PagesJaunes.fr (French Yellow Pages). Extract comprehensive business information including contact details, ratings, reviews, opening hours, payment methods, and category hierarchy across all of France.

Features

Core Capabilities

  • Comprehensive Data Extraction: Business names, phones, full addresses, ratings, reviews, opening hours, descriptions, payment methods, services, products, website + social profiles, and category breadcrumb
  • Latitude / Longitude: Attached automatically for every well-matched address — geo-targeting and map plotting ready
  • Two Input Modes: Search by keyword + location, or paste one or more PagesJaunes search URLs
  • Auto Page Forwarding: Each URL is crawled forward from its starting page up to your page limit — no need to list every page manually
  • Pagination Support: Automatically handles multi-page results, with global deduplication across all queries
  • Detail Page Enrichment: Optional deep scraping for full customer reviews, structured opening hours, and rich business profiles
  • Restaurant Specialty: Automatic price range and cuisine extraction for dining venues
  • Proxy Support: Built-in proxy configuration for reliable, uninterrupted scraping

Perfect for Lead Generation

  • Direct Contact Information: Phone numbers and business websites ready for immediate outreach
  • Social Profile Discovery: Facebook, Instagram, X, LinkedIn, YouTube, and TikTok URLs when the business publishes them
  • Location Intelligence: Complete structured addresses plus latitude/longitude coordinates for geo-targeting, mapping, and territory assignment
  • Qualified Leads: Filter by rating, review count, and opening hours to find actively-trading, well-reviewed businesses
  • Category Targeting: Rich breadcrumb hierarchy lets you segment leads by trade, specialty, and sub-category
  • Service & Product Matching: Structured lists of services and products let you match leads to specific buyer needs
  • Verification Signals: Payment methods accepted, opening hours declared, and review counts help validate real, active businesses
  • Market Coverage: 500+ pages of results for popular categories in major cities — build comprehensive regional lead lists in minutes

AI-Ready Data Structure

  • Structured JSON Output: Clean, consistent data format ready for AI processing, matching, and enrichment pipelines
  • Schema.org Typing: Each record is tagged with its schema.org business type (Restaurant, LocalBusiness, etc.) for downstream classification
  • Rich Context: Business descriptions and customer reviews with timestamps — ideal for sentiment analysis, summarization, and RAG
  • Searchable Metadata: Category breadcrumbs, opening hours, and location fields ready for AI-powered search, recommendation, and matching
  • Training Data: Comprehensive business profiles suited to training local-business recommendation, review-summarization, and quality-scoring models

Input Parameters

ParameterTypeDescriptionDefault
searchTermsstringKeyword — e.g. plombier, restaurant, dentiste. Ignored if startUrls is provided.plombier
locationstringCity / department / region — e.g. Paris, Lyon, 75. Ignored if startUrls is provided.Paris
startUrlsarrayPaste one or more PagesJaunes search URLs. Each is auto-crawled forward from its starting page.[]
maxPagesintegerMaximum pages per query (20 results per page) — 1..1002
scrapeDetailsbooleanFetch each business's detail page for reviews, opening hours, full descriptiontrue
maxConcurrencyintegerConcurrent detail requests — 1..205
proxyobjectRecommended. Default proxy configuration is provided.managed

Using Start URLs

Paste any PagesJaunes search URL and the actor crawls forward from where you pointed it:

{
"startUrls": [
{"url": "https://www.pagesjaunes.fr/annuaire/chercherlespros?quoiqui=plombier&ou=Paris"},
{"url": "https://www.pagesjaunes.fr/annuaire/chercherlespros?quoiqui=restaurant&ou=Lyon&page=3"}
],
"maxPages": 5
}
  • URL #1 → crawls pages 1–5 of plumbers in Paris.
  • URL #2 → crawls pages 3–7 of restaurants in Lyon (auto-forwards from the starting page).
  • Businesses that appear in multiple queries are pushed only once (global dedup by client ID).

Output Format

Search-Only Data

Each business record includes:

{
"clientId": "XXXXXXXX",
"name": "Sample Business Name",
"phone": "01 XX XX XX XX",
"address": "XX rue Example 75001 Paris",
"activities": ["plombiers"],
"rating": 4.8,
"reviewCount": 42,
"latitude": 48.852000,
"longitude": 2.350000,
"detailUrl": "https://www.pagesjaunes.fr/pros/XXXXXXXX",
"searchTerms": "plombier",
"searchLocation": "Paris"
}

Enriched Data (when scrapeDetails: true)

Additional fields from the business detail page:

{
"businessType": "LocalBusiness",
"fullAddress": {
"street": "XX rue Example",
"locality": "Paris",
"postalCode": "75001",
"country": null
},
"description": "Business description provided by the owner...",
"website": "https://www.example-business.fr",
"facebook": "https://www.facebook.com/samplepage",
"instagram": "https://www.instagram.com/samplehandle",
"twitter": "https://x.com/samplehandle",
"linkedin": "https://www.linkedin.com/company/sample",
"imageUrl": "https://www.pagesjaunes.fr/media/...",
"photos": [
"https://www.pagesjaunes.fr/media/.../photo1.jpg",
"https://www.pagesjaunes.fr/media/.../photo2.jpg"
],
"openingHours": [
"Mo 08:00-20:00",
"Tu 08:00-20:00",
"We 08:00-20:00",
"Th 08:00-20:00",
"Fr 08:00-20:00",
"Sa 09:00-18:00"
],
"paymentAccepted": "CB,Visa,Espèces,Chèque",
"breadcrumb": [
"Plombiers",
"Plombiers à Paris (75)",
"Dépannage plomberie",
"Dépannage plomberie à Paris (75)"
],
"services": ["Service A", "Service B", "Service C"],
"products": ["Product A", "Product B"],
"reviews": [
{
"author": "Customer Name",
"rating": 5.0,
"text": "Customer review text...",
"date": "2025-06-11"
}
]
}

Restaurant-Only Fields

When scraping restaurants, records include two additional fields:

{
"priceRange": "21-30 euros",
"servesCuisine": ["restaurant français"]
}

Example Categories

Popular keyword examples that work out of the box:

  • Trades & Services: plombier, électricien, serrurier, menuisier, peintre, couvreur
  • Food & Hospitality: restaurant, boulangerie, pâtisserie, bar, café, hôtel
  • Health & Beauty: dentiste, médecin, pharmacie, coiffeur, opticien, ophtalmologue
  • Automotive: garage automobile, carrosserie, contrôle technique, station service
  • Retail & Shopping: fleuriste, boucherie, librairie, bijouterie, magasin de meubles
  • Professional Services: avocat, notaire, comptable, agence immobilière, architecte

Location accepts free text: city names (Paris, Lyon, Marseille), department numbers (75, 13, 69), or regions (Île-de-France, Provence-Alpes-Côte d'Azur).