Paginegialle Search Scraper
Pricing
from $3.00 / 1,000 results
Paginegialle Search Scraper
Scrape business listings from Pagine Gialle — Italy's leading online directory. Collect names, addresses, contacts, ratings, opening hours, and 70+ fields per business. Perfect for lead generation, market research, and local SEO analysis.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Stealth mode
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Pagine Gialle Search Scraper: Extract Italian Business Listings at Scale
What Is Pagine Gialle?
Pagine Gialle (paginegialle.it) is Italy's equivalent of the Yellow Pages — one of the country's most comprehensive online business directories. It indexes millions of Italian businesses across categories such as restaurants, legal services, healthcare, retail, and more, organized by region, province, and city.
For businesses and analysts needing Italian market data, manually browsing Pagine Gialle is impractical at scale. The Pagine Gialle Search Scraper automates extraction from category and search result pages, delivering structured, analysis-ready records.
Overview
This scraper targets Pagine Gialle business listing pages and extracts rich profiles for each result. It is suited for:
- Sales & lead generation teams building prospect lists by city or category
- Market researchers mapping business density across Italian regions
- Local SEO agencies auditing competitor listings
- Developers populating databases or CRMs with verified Italian business data
The scraper handles pagination automatically and supports bulk URL inputs, making it efficient for large-scale collection across multiple categories or cities.
Input Format
{"urls": ["https://www.paginegialle.it/lazio/roma/ristoranti/p-3.html"],"max_items_per_url": 200,"ignore_url_failures": true}
| Field | Type | Description |
|---|---|---|
urls | array | One or more Pagine Gialle category/search result page URLs. Supports bulk input. Example: paginegialle.it/{region}/{city}/{category}/p-{page}.html |
max_items_per_url | integer | Maximum number of business records to collect per URL. Default: 20. Set higher (e.g., 200) for large category pages. |
ignore_url_failures | boolean | If true, the scraper skips failed URLs and continues the run rather than stopping. Recommended for bulk jobs. Default: true. |
Tip: To collect across multiple pages, add each paginated URL separately (e.g.,
/p-1.html,/p-2.html). To collect across cities, add one URL per city/category combination.
Output Format
Sample output
{"parent_office_id": "c170c123-7f02-42a7-88ba-78f96f59b102","office_id": "c170c123-7f02-42a7-88ba-78f96f59b102","commission_id": "B16408713","opec_code": "H055D066","postal_code": "00136","city": "Roma","region": "Lazio","province": "RM","address": "Via Domenico Millelire, 20","latitude": 41.9053,"longitude": 12.44652,"business_name": "CASAMADRE","company_name": "Da Zacca ar 20","vat_number": "14448361007","category_id": "007585100","category_name": "Ristoranti","category_ids": ["009152600","007585100"],"emails": ["trattoriadazacca@gmail.com"],"phones": ["06 83770169","345 5017051"],"whatsapp_phones": ["345 5017051"],"logo_url": "https://img.italiaonline.it/0WO5p000004cC53GAE_1.png","logo_format": "png","images": [{"type": "gal","alt": "trippa","width": "800","height": "800","path": "https://img.italiaonline.it/0WO5p000004cC53GAE_2.png","fmt": "png","weight": ""}],"media_types": [1,2,4],"review_score": 0.9482,"average_rating": 4.6,"total_reviews": 94,"short_description": "Un tuffo nei sapori genuini di Roma, con l'energia e la passione di chi ama davvero quello che fa.","description": "Un tuffo nei sapori genuini di Roma, con l'energia e la passione di chi ama davvero quello che fa.","has_timeline": true,"is_open_now_enabled": true,"is_product_visible": true,"has_thefork_booking": true,"slug": "trattoria-da-zacca-roma","distance": 0,"schema_type": "Restaurant","opening_hours_raw": "{\"6\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"7\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"1\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"2\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"3\":[\"12:30 - 15:00\",\"19:00 - 23:00\"],\"4\":[\"19:00 - 23:00\"],\"5\":[\"12:30 - 15:00\",\"19:00 - 23:00\"]}","score": 10500.0,"contract_type": 15,"quotation_count": 0,"credit_count": 0,"visit_count": 1.0,"level": 0,"answer_count": 0,"commercial_weight": 414.0,"result_type": "a","logo_weight": "","geo_value": 0.0,"visibility_flags": ["VIS","SGBAS","SBE","GODB","REPL","CRMCO"],"location_name": "Roma","location_code": 70464,"current_day": "29/05/2026","current_time": "13:01","vote_commercial_enabled": true,"vote_category_enabled": true,"vote_observer_enabled": false,"average_rating_main": 0.0,"total_reviews_main": 0,"average_rating_others": 4.6,"total_reviews_others": 94,"vote_slug": "trattoria-da-zacca-roma","price_list": {"nome_listino": "Prezzi","ds_listino": "Menù del ristorante","fl_menu_thefork": true,"gruppi_listino": [{"elementi": [{"prezzo": "3.00","ds_elemento": "Cocacola zero"},{"prezzo": "3.00","ds_elemento": "Fanta"},{"prezzo": "2.00","ds_elemento": "Acqua naturale/frizzante"}],"ds_gruppo": "Specialità"}]},"quote_info": {"email": "trattoriadazacca@gmail.com"},"simple_office_flag": "1","thefork_data": {"ds_url_widget": "https://module.thefork.com/it_IT/module/377873-da01e","fl_attivazione": "1"},"links_data": {"multilinks": [{"lbl": "Instagram","url": "https://www.instagram.com/trattoriadazacca"},{"lbl": "Facebook","url": "https://www.facebook.com/trattoriadazacca"},{"lbl": "TheFork","url": "https://thefork.it/ristorante/trattoria-da-zacca-ar-20-r377873"}]},"position": 0,"keyword_evidence": ["tavoli all'aperto"],"keyword_evidence_processed": [{"lbl": "tavoli all'aperto","tot_chars": 17}],"preload_enabled": true,"extra_data": {"disp_map": true,"raggr": false,"t": "1","i": 1,"site_link": {"url": "https://www.instagram.com/trattoriadazacca","pag": "multilink/https://www.instagram.com/trattoriadazacca","origin": "multilink"},"has_comments": true,"shiny_user": "c170c123-7f02-42a7-88ba-78f96f59b102","urlms": "https://www.paginegialle.it/trattoria-da-zacca-roma","sedi": {}},"cta_data": {"chiama_ora": true,"contattaci": true,"prenota_tavolo_thefork": true,"whatsapp": true,"sitoweb": true,"ordered_ctas": [{"cta": "chiama"},{"cta": "prenota_tavolo_thefork"}]},"open_now_data": {"type": "NowOpened","lbl": "chiude_a","today_lbl": "venerdi","today": "29/5","val": "15:00","open_time": ["12:30 - 15:00","19:00 - 23:00"],"today_lbl_with_lang": "Venerdì","lbl_with_lang": "<strong>Aperto</strong> fino alle 15:00"},"wips_pay_item_image_url": {"url": "https://wips.plug.it/scips/VxP_iGAmab3NEhVV1iOX-Q/img.italiaonline.it/0WO5p000004cC53GAE_2.png?a=rc&w=217&h=270&ssl=1","image_type": "gall"},"pay_for_category_enabled": true,"from_url": "https://www.paginegialle.it/lazio/roma/ristoranti/p-3.html?output=json"}
Each business record contains up to 70+ fields. Below is a field-by-field reference:
Identity & Location
| Field | Meaning |
|---|---|
Business Name | Trading name displayed on the listing |
Company Name | Legal registered company name |
VAT Number | Italian Partita IVA — useful for B2B verification |
Address | Street address of the business |
Postal Code | Italian CAP code |
City | Municipality |
Province | Italian province (e.g., RM for Rome) |
Region | Italian region (e.g., Lazio) |
Latitude / Longitude | Geographic coordinates for mapping |
Location Name / Location Code | Pagine Gialle's internal location identifiers |
Slug | URL-friendly business identifier |
Distance | Distance from search center point (when applicable) |
Contact Information
| Field | Meaning |
|---|---|
Phones | Primary phone number(s) |
WhatsApp Phones | WhatsApp-enabled numbers, if listed |
Emails | Contact email address(es) |
Category & Classification
| Field | Meaning |
|---|---|
Category Name | Primary business category (e.g., Ristoranti) |
Category ID | Pagine Gialle internal category code |
Category IDs | All associated category codes |
Schema Type | Structured data type (e.g., LocalBusiness, Restaurant) |
OPEC Code | Internal operational classification code |
Ratings & Reviews
| Field | Meaning |
|---|---|
Review Score | Composite score used for ranking |
Average Rating | Mean star rating across all reviews |
Total Reviews | Total review count |
Average Rating Main | Rating from Pagine Gialle's primary review source |
Total Reviews Main | Review count from primary source |
Average Rating Others | Rating aggregated from third-party sources |
Total Reviews Others | Review count from third-party sources |
Vote Slug | Review page identifier |
Vote Commercial Enabled | Whether commercial reviews are active |
Vote Category Enabled | Whether category-based voting is active |
Vote Observer Enabled | Whether observer voting is active |
Business Description & Media
| Field | Meaning |
|---|---|
Short Description | Brief business summary |
Description | Full business profile description |
Logo URL | Direct URL to the business logo image |
Logo Format | Image format of the logo (e.g., JPEG, PNG) |
Images | Array of additional business photo URLs |
Media Types | Types of media attached to the listing |
Operational Data
| Field | Meaning |
|---|---|
Opening Hours Raw | Raw opening hours string as listed |
Current Day / Current Time | Server-side day and time at scrape moment |
Is Open Now Enabled | Whether real-time open/closed status is active |
Open Now Data | Structured open/closed status object |
Has Timeline | Whether the business has an event/post timeline |
TheFork Data | Restaurant booking data from TheFork integration |
Has TheFork Booking | Whether TheFork reservations are enabled |
Price List | Menu or service price information, if available |
Commercial & Visibility Metrics
| Field | Meaning |
|---|---|
Score | Pagine Gialle internal relevance score |
Commercial Weight | Weight reflecting paid listing tier |
Logo Weight | Ranking boost from having a logo |
Geo Value | Geographic relevance score |
Level | Subscription/listing tier |
Contract Type | Commercial contract category |
Credit Count | Platform credits associated with the listing |
Visit Count | Profile view count |
Quotation Count | Number of quote requests received |
Answer Count | Number of business responses to reviews/queries |
Result Type | Type of result (organic, sponsored, etc.) |
Visibility Flags | Object encoding various display/promotion flags |
Is Product Visible | Whether the product/service listing is visible |
Pay For Category Enabled | Whether paid category placement is active |
IDs & Internal References
| Field | Meaning |
|---|---|
Office ID | Unique identifier for this business location |
Parent Office ID | ID of the parent entity (for chains/multi-location businesses) |
Commission ID | Internal commercial commission reference |
Position | Rank position in the search result page |
Supplementary Data
| Field | Meaning |
|---|---|
Links Data | External links attached to the listing (website, social) |
CTA Data | Call-to-action configuration (e.g., "Call Now", "Get Quote") |
Quote Info | Details about quote/inquiry functionality |
Extra Data | Miscellaneous structured data not covered by standard fields |
Keyword Evidence | Raw keyword match signals used for ranking |
Keyword Evidence Processed | Normalized keyword relevance data |
Preload Enabled | Whether listing data is preloaded for performance |
Simple Office Flag | Indicates a basic (non-enriched) listing |
WIPS Pay Item Image URL | Image URL for paid product/service item |
How to Use
- Find target URLs — Navigate to paginegialle.it, select a region, city, and category. Copy the result page URL (e.g.,
.../lazio/roma/ristoranti/p-1.html). - Build your URL list — Add one URL per page or per category. Use the bulk edit option for large lists.
- Set item limit — Adjust
max_items_per_urlbased on how many results you need per page (typically 10–30 per page; set higher to handle variable page sizes). - Run and export — Start the scraper and download results as JSON, CSV, or Excel.
Best practices:
- Use paginated URLs (
/p-1.html,/p-2.html, ...) to cover full categories. - Enable
ignore_url_failures: truefor any bulk run to avoid interruptions from occasionally unavailable pages. - Cross-reference
VAT Numberwith Italian business registries for data enrichment.
Use Cases & Business Value
- Lead generation: Build targeted prospect lists of Italian businesses by city, category, and region
- Restaurant tech: Aggregate dining options with ratings, TheFork booking status, and pricing
- Real estate & local analytics: Map business density across neighborhoods using lat/long data
- CRM enrichment: Append verified phone, email, and address data to existing Italian business records
Conclusion
The Pagine Gialle Search Scraper provides one of the most complete extractions available from Italy's primary business directory. With 70+ output fields spanning contacts, ratings, commercial metadata, and geolocation, it delivers the depth needed for serious lead generation and market analysis. Start with a single category page and scale to thousands of Italian business records in minutes.