Paginegialle Search Scraper avatar

Paginegialle Search Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Paginegialle Search Scraper

Paginegialle Search Scraper

Scrape business listings from Pagine Gialle — Italy's leading online directory. Collect names, addresses, contacts, ratings, opening hours, and 70+ fields per business. Perfect for lead generation, market research, and local SEO analysis.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Stealth mode

Stealth mode

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Pagine Gialle Search Scraper: Extract Italian Business Listings at Scale


What Is Pagine Gialle?

Pagine Gialle (paginegialle.it) is Italy's equivalent of the Yellow Pages — one of the country's most comprehensive online business directories. It indexes millions of Italian businesses across categories such as restaurants, legal services, healthcare, retail, and more, organized by region, province, and city.

For businesses and analysts needing Italian market data, manually browsing Pagine Gialle is impractical at scale. The Pagine Gialle Search Scraper automates extraction from category and search result pages, delivering structured, analysis-ready records.


Overview

This scraper targets Pagine Gialle business listing pages and extracts rich profiles for each result. It is suited for:

  • Sales & lead generation teams building prospect lists by city or category
  • Market researchers mapping business density across Italian regions
  • Local SEO agencies auditing competitor listings
  • Developers populating databases or CRMs with verified Italian business data

The scraper handles pagination automatically and supports bulk URL inputs, making it efficient for large-scale collection across multiple categories or cities.


Input Format

{
"urls": [
"https://www.paginegialle.it/lazio/roma/ristoranti/p-3.html"
],
"max_items_per_url": 200,
"ignore_url_failures": true
}
FieldTypeDescription
urlsarrayOne or more Pagine Gialle category/search result page URLs. Supports bulk input. Example: paginegialle.it/{region}/{city}/{category}/p-{page}.html
max_items_per_urlintegerMaximum number of business records to collect per URL. Default: 20. Set higher (e.g., 200) for large category pages.
ignore_url_failuresbooleanIf true, the scraper skips failed URLs and continues the run rather than stopping. Recommended for bulk jobs. Default: true.

Tip: To collect across multiple pages, add each paginated URL separately (e.g., /p-1.html, /p-2.html). To collect across cities, add one URL per city/category combination.


Output Format

Sample output

{
"parent_office_id": "c170c123-7f02-42a7-88ba-78f96f59b102",
"office_id": "c170c123-7f02-42a7-88ba-78f96f59b102",
"commission_id": "B16408713",
"opec_code": "H055D066",
"postal_code": "00136",
"city": "Roma",
"region": "Lazio",
"province": "RM",
"address": "Via Domenico Millelire, 20",
"latitude": 41.9053,
"longitude": 12.44652,
"business_name": "CASAMADRE",
"company_name": "Da Zacca ar 20",
"vat_number": "14448361007",
"category_id": "007585100",
"category_name": "Ristoranti",
"category_ids": [
"009152600",
"007585100"
],
"emails": [
"trattoriadazacca@gmail.com"
],
"phones": [
"06 83770169",
"345 5017051"
],
"whatsapp_phones": [
"345 5017051"
],
"logo_url": "https://img.italiaonline.it/0WO5p000004cC53GAE_1.png",
"logo_format": "png",
"images": [
{
"type": "gal",
"alt": "trippa",
"width": "800",
"height": "800",
"path": "https://img.italiaonline.it/0WO5p000004cC53GAE_2.png",
"fmt": "png",
"weight": ""
}
],
"media_types": [
1,
2,
4
],
"review_score": 0.9482,
"average_rating": 4.6,
"total_reviews": 94,
"short_description": "Un tuffo nei sapori genuini di Roma, con l'energia e la passione di chi ama davvero quello che fa.",
"description": "Un tuffo nei sapori genuini di Roma, con l'energia e la passione di chi ama davvero quello che fa.",
"has_timeline": true,
"is_open_now_enabled": true,
"is_product_visible": true,
"has_thefork_booking": true,
"slug": "trattoria-da-zacca-roma",
"distance": 0,
"schema_type": "Restaurant",
"opening_hours_raw": "{\"6\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"7\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"1\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"2\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"3\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"4\":[\"19:00 - 23:00\"],\"5\":[\"12:30 - 15:00\",\"19:00 - 23:00\"]}",
"score": 10500.0,
"contract_type": 15,
"quotation_count": 0,
"credit_count": 0,
"visit_count": 1.0,
"level": 0,
"answer_count": 0,
"commercial_weight": 414.0,
"result_type": "a",
"logo_weight": "",
"geo_value": 0.0,
"visibility_flags": [
"VIS",
"SGBAS",
"SBE",
"GODB",
"REPL",
"CRMCO"
],
"location_name": "Roma",
"location_code": 70464,
"current_day": "29/05/2026",
"current_time": "13:01",
"vote_commercial_enabled": true,
"vote_category_enabled": true,
"vote_observer_enabled": false,
"average_rating_main": 0.0,
"total_reviews_main": 0,
"average_rating_others": 4.6,
"total_reviews_others": 94,
"vote_slug": "trattoria-da-zacca-roma",
"price_list": {
"nome_listino": "Prezzi",
"ds_listino": "Menù del ristorante",
"fl_menu_thefork": true,
"gruppi_listino": [
{
"elementi": [
{
"prezzo": "3.00",
"ds_elemento": "Cocacola zero"
},
{
"prezzo": "3.00",
"ds_elemento": "Fanta"
},
{
"prezzo": "2.00",
"ds_elemento": "Acqua naturale/frizzante"
}
],
"ds_gruppo": "Specialità"
}
]
},
"quote_info": {
"email": "trattoriadazacca@gmail.com"
},
"simple_office_flag": "1",
"thefork_data": {
"ds_url_widget": "https://module.thefork.com/it_IT/module/377873-da01e",
"fl_attivazione": "1"
},
"links_data": {
"multilinks": [
{
"lbl": "Instagram",
"url": "https://www.instagram.com/trattoriadazacca"
},
{
"lbl": "Facebook",
"url": "https://www.facebook.com/trattoriadazacca"
},
{
"lbl": "TheFork",
"url": "https://thefork.it/ristorante/trattoria-da-zacca-ar-20-r377873"
}
]
},
"position": 0,
"keyword_evidence": [
"tavoli all'aperto"
],
"keyword_evidence_processed": [
{
"lbl": "tavoli all'aperto",
"tot_chars": 17
}
],
"preload_enabled": true,
"extra_data": {
"disp_map": true,
"raggr": false,
"t": "1",
"i": 1,
"site_link": {
"url": "https://www.instagram.com/trattoriadazacca",
"pag": "multilink/https://www.instagram.com/trattoriadazacca",
"origin": "multilink"
},
"has_comments": true,
"shiny_user": "c170c123-7f02-42a7-88ba-78f96f59b102",
"urlms": "https://www.paginegialle.it/trattoria-da-zacca-roma",
"sedi": {}
},
"cta_data": {
"chiama_ora": true,
"contattaci": true,
"prenota_tavolo_thefork": true,
"whatsapp": true,
"sitoweb": true,
"ordered_ctas": [
{
"cta": "chiama"
},
{
"cta": "prenota_tavolo_thefork"
}
]
},
"open_now_data": {
"type": "NowOpened",
"lbl": "chiude_a",
"today_lbl": "venerdi",
"today": "29/5",
"val": "15:00",
"open_time": [
"12:30 - 15:00",
"19:00 - 23:00"
],
"today_lbl_with_lang": "Venerdì",
"lbl_with_lang": "<strong>Aperto</strong> fino alle 15:00"
},
"wips_pay_item_image_url": {
"url": "https://wips.plug.it/scips/VxP_iGAmab3NEhVV1iOX-Q/img.italiaonline.it/0WO5p000004cC53GAE_2.png?a=rc&w=217&h=270&ssl=1",
"image_type": "gall"
},
"pay_for_category_enabled": true,
"from_url": "https://www.paginegialle.it/lazio/roma/ristoranti/p-3.html?output=json"
}

Each business record contains up to 70+ fields. Below is a field-by-field reference:

Identity & Location

FieldMeaning
Business NameTrading name displayed on the listing
Company NameLegal registered company name
VAT NumberItalian Partita IVA — useful for B2B verification
AddressStreet address of the business
Postal CodeItalian CAP code
CityMunicipality
ProvinceItalian province (e.g., RM for Rome)
RegionItalian region (e.g., Lazio)
Latitude / LongitudeGeographic coordinates for mapping
Location Name / Location CodePagine Gialle's internal location identifiers
SlugURL-friendly business identifier
DistanceDistance from search center point (when applicable)

Contact Information

FieldMeaning
PhonesPrimary phone number(s)
WhatsApp PhonesWhatsApp-enabled numbers, if listed
EmailsContact email address(es)

Category & Classification

FieldMeaning
Category NamePrimary business category (e.g., Ristoranti)
Category IDPagine Gialle internal category code
Category IDsAll associated category codes
Schema TypeStructured data type (e.g., LocalBusiness, Restaurant)
OPEC CodeInternal operational classification code

Ratings & Reviews

FieldMeaning
Review ScoreComposite score used for ranking
Average RatingMean star rating across all reviews
Total ReviewsTotal review count
Average Rating MainRating from Pagine Gialle's primary review source
Total Reviews MainReview count from primary source
Average Rating OthersRating aggregated from third-party sources
Total Reviews OthersReview count from third-party sources
Vote SlugReview page identifier
Vote Commercial EnabledWhether commercial reviews are active
Vote Category EnabledWhether category-based voting is active
Vote Observer EnabledWhether observer voting is active

Business Description & Media

FieldMeaning
Short DescriptionBrief business summary
DescriptionFull business profile description
Logo URLDirect URL to the business logo image
Logo FormatImage format of the logo (e.g., JPEG, PNG)
ImagesArray of additional business photo URLs
Media TypesTypes of media attached to the listing

Operational Data

FieldMeaning
Opening Hours RawRaw opening hours string as listed
Current Day / Current TimeServer-side day and time at scrape moment
Is Open Now EnabledWhether real-time open/closed status is active
Open Now DataStructured open/closed status object
Has TimelineWhether the business has an event/post timeline
TheFork DataRestaurant booking data from TheFork integration
Has TheFork BookingWhether TheFork reservations are enabled
Price ListMenu or service price information, if available

Commercial & Visibility Metrics

FieldMeaning
ScorePagine Gialle internal relevance score
Commercial WeightWeight reflecting paid listing tier
Logo WeightRanking boost from having a logo
Geo ValueGeographic relevance score
LevelSubscription/listing tier
Contract TypeCommercial contract category
Credit CountPlatform credits associated with the listing
Visit CountProfile view count
Quotation CountNumber of quote requests received
Answer CountNumber of business responses to reviews/queries
Result TypeType of result (organic, sponsored, etc.)
Visibility FlagsObject encoding various display/promotion flags
Is Product VisibleWhether the product/service listing is visible
Pay For Category EnabledWhether paid category placement is active

IDs & Internal References

FieldMeaning
Office IDUnique identifier for this business location
Parent Office IDID of the parent entity (for chains/multi-location businesses)
Commission IDInternal commercial commission reference
PositionRank position in the search result page

Supplementary Data

FieldMeaning
Links DataExternal links attached to the listing (website, social)
CTA DataCall-to-action configuration (e.g., "Call Now", "Get Quote")
Quote InfoDetails about quote/inquiry functionality
Extra DataMiscellaneous structured data not covered by standard fields
Keyword EvidenceRaw keyword match signals used for ranking
Keyword Evidence ProcessedNormalized keyword relevance data
Preload EnabledWhether listing data is preloaded for performance
Simple Office FlagIndicates a basic (non-enriched) listing
WIPS Pay Item Image URLImage URL for paid product/service item

How to Use

  1. Find target URLs — Navigate to paginegialle.it, select a region, city, and category. Copy the result page URL (e.g., .../lazio/roma/ristoranti/p-1.html).
  2. Build your URL list — Add one URL per page or per category. Use the bulk edit option for large lists.
  3. Set item limit — Adjust max_items_per_url based on how many results you need per page (typically 10–30 per page; set higher to handle variable page sizes).
  4. Run and export — Start the scraper and download results as JSON, CSV, or Excel.

Best practices:

  • Use paginated URLs (/p-1.html, /p-2.html, ...) to cover full categories.
  • Enable ignore_url_failures: true for any bulk run to avoid interruptions from occasionally unavailable pages.
  • Cross-reference VAT Number with Italian business registries for data enrichment.

Use Cases & Business Value

  • Lead generation: Build targeted prospect lists of Italian businesses by city, category, and region
  • Restaurant tech: Aggregate dining options with ratings, TheFork booking status, and pricing
  • Real estate & local analytics: Map business density across neighborhoods using lat/long data
  • CRM enrichment: Append verified phone, email, and address data to existing Italian business records

Conclusion

The Pagine Gialle Search Scraper provides one of the most complete extractions available from Italy's primary business directory. With 70+ output fields spanning contacts, ratings, commercial metadata, and geolocation, it delivers the depth needed for serious lead generation and market analysis. Start with a single category page and scale to thousands of Italian business records in minutes.