Pricing

Pay per event

Directory & Map Data Extractor

Directory Listing Extractor collects structured organization data from public directory and map pages — even when listings are rendered via embedded JavaScript objects. It is designed for business, research, and institutional use, not personal data scraping. Exports clean CSV/XLSX ready for CRM

Pricing

Pay per event

Rating

0.0

(0)

Developer

Artashes Arakelyan

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Categories

Lead generation

Automation

Developer tools

Directory Listing Extractor Clean, structured, Excel-ready directories from interactive maps and listing pages This Actor extracts high-quality, analysis-ready datasets from directory and map pages such as NGO directories, agricultural maps, association member listings, partner registries, and store locators. It is optimized for JavaScript-heavy websites where listing data is embedded directly in HTML or inline JavaScript objects (rather than rendered line-by-line in the DOM). The output is designed to be immediately usable in Excel, Google Sheets, BI tools, or research workflows.

What this Actor does For each organization, farm, or business listing found on a directory or map page, the Actor extracts structured information including: Core fields • Entity / organization name • Location (city, region, country) • Address (when available) • Website • Profile or detail page URL • Public email address • Public phone number Classification & enrichment • Categories (decoded into human-readable names) • Services (decoded into human-readable names) • Products (if available) • Semantic alignment of product labels to standardized categories Optional descriptive fields When available in the source data: • Size • Results / outcomes • Quotes / descriptions • Logos Each listing is saved as one dataset item, normalized and ready for export.

Typical use cases • NGO and non-profit directories • Agricultural, food system, and sustainability maps • Association or member registries • Partner, supplier, or producer listings • Store locators and business directories • Market research, benchmarking, and data enrichment projects Most users analyze results by entity name, category, or services, not by street-level geocoding.

Supported extraction modes embedded_js (recommended) Use this mode when listing data is embedded directly in the page HTML or JavaScript. You configure: • embedded.anchorKey – a field that reliably exists in every listing object (used to detect listings) • embedded.keys – raw fields to extract from embedded objects • embedded.fieldMap – mapping from raw keys to clean output field names This mode is: • Fast • Reliable • Stable on JS-heavy pages • Does not require clicking or pagination simulation auto (future) A future version may include automatic heuristics combining multiple extraction strategies.

Category & service decoding (important) Many directories store categories and services as internal codes (for example: rpc5, rpc12, rss6). These codes are not user-friendly on their own. This Actor supports: • Static canonical taxonomies (built-in) • Automatic detection from embedded HTML or JavaScript (when available) • Manual overrides via input: o taxonomy.categoryMap o taxonomy.serviceMap The output includes: • category_codes / service_codes (raw values) • category_names / service_names (human-readable, Excel-friendly) This makes the dataset immediately usable for filtering, grouping, and reporting.

Product category normalization Some directories expose product filters in the UI that are not stored as canonical categories. This Actor: • Aligns product labels (e.g. Meat & poultry, Eggs, Vegetables) to standardized category codes • Preserves transparency between filters, products, and true entity categories • Avoids guessing or over-assignment This is especially important for research-grade or policy-oriented datasets.

Quick start (default test) The Actor includes a working default configuration: • Start URL: https://regenerationcanada.org/en/map/ • Mode: embedded_js • Max pages: 1 Simply click Run in the Apify Console to see results.

Input parameters (overview) • mode – Extraction strategy (default: embedded_js) • startUrls – One or more directory or map pages • maxListings – Safety cap on extracted listings • maxPages – Safety cap on visited pages • embedded.anchorKey – Field identifying listing objects • embedded.keys – Fields to extract from embedded objects • embedded.fieldMap – Rename raw fields to clean output fields • taxonomy.categoryMap – Optional manual category decoding • taxonomy.serviceMap – Optional manual service decoding • debug – Enable verbose logging for troubleshooting

Example output Each dataset item is a normalized object, for example: { "entity_name": "Southbrook Vineyards", "category_names": "Fruit; Value-added products", "service_names": "Farm tour (for general public)", "products": "Wine; Beef; Eggs", "email": "info@southbrook.com", "phone": "905-380-9095", "website": "https://www.southbrook.com", "location": "Niagara-on-the-Lake, Ontario", "profile_url": "https://regenerationcanada.org/en/southbrook-vineyard/", "source_url": "https://regenerationcanada.org/en/map/" }

CSV / XLSX output (for clients & non-technical users) Although data is processed internally as JSON, no JSON handling is required. Delivered formats • CSV • XLSX (Excel) Files are: • UTF-8 encoded • Excel-safe • Ready for Google Sheets or BI tools Exporting from Apify Console

Open the Actor run
Go to the Dataset tab
Click Export
Choose CSV or XLSX
Download No additional configuration is required.

ISSA Directory Scraper

songd/ISSA-Directory-Scraper

Scrapes company directory data from ISSA's member directory.

Singed

Google Map Scraper

chatgpt.extkit/my-actor

Shad Maggio

Lead Extractor-Google Map

happitap/lead-extractor-google-map

Lead Extractor is an automation tool that extracts business leads and contact details from Google Map. Instantly collect company names, phone numbers, and more for easy export and outreach. Perfect for sales, marketing, and lead generation.

HappiTap

Google Map Business Scraper With Email

bhansalisoft/google-map-business-scraper

Google Map Business Extractor with Email – Powerful Google Map Scraper software to extract complete business details including name, address, phone, Email (if available on website), website, ratings, reviews, images, working hours, category, and GPS coordinates from google map.

bhansalisoft

506

2.0

Google Map Extractor Genie

salmanrajz/google-maps-fetcher

Google Map Extractor Genie quickly extracts detailed location data from Google Maps, business info, reviews, contacts, and more—perfect for lead generation and market research.

Salman Ahmed

Europages Business Directory Scraper

easyapi/europages-business-directory-scraper

🏭 Extract detailed company information from Europages business directory. Get comprehensive data including company profiles, products, certifications, and contact details. Perfect for lead generation, market research, and B2B prospecting.

EasyApi

5.0

UAE Business Directory Scraper

nickslam/yello-ae-scraper

The Actor scrapes UAE business directory data from yello.ae. Search by categories, keywords, or cities. Extract company info, reviews, contact details, product information, media, and more.

Nick

Bcorp Directory Scraper

fortuitous_pirate/bcorp-directory-scraper

Fortuitous Pirate

192.com Businesses Scraper

lexis-solutions/192-com-businesses-scraper

Effortlessly scrape business directory data from 192.com, the UK's top business directory. Extract names, contacts, locations, and sectors for market research, business intelligence, or CRM integration—fully automated and customizable for your needs.