PeachParser (Beta)

Pricing

Pay per event

PeachParser (Beta)

Crawl arbitrary websites, checks which are alive, and crawls them for emails and social links. Filters common telemetry and template junk.

Pricing

Pay per event

Rating

0.0

(0)

Developer

SLASH

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

PeachParser

PeachParser is an Apify Actor developed by SLASH for crawling websites to extract:

Emails (from visible content and mailto: links)
Social profiles (Facebook, Instagram, LinkedIn, X, YouTube, TikTok, Pinterest)
Optional listing items from directory-like pages (for example, lists of choirs, restaurants, or organizations)

It is optimized for small to medium websites where you want both contact details and, optionally, all items listed on a directory page (such as https://www.sverigeskorforbund.se/korer).

Key Features

Site-level contact extraction
- Extracts emails from:
  - mailto: links
  - Visible page text (up to a configurable limit)
- Filters:
  - Removes tracking / telemetry addresses
  - Blocks placeholder / bogus domains (e.g. mysite.com, *.wixpress.com)
  - Accepts emails that match the website’s domain or come from generic providers (Gmail, Outlook, etc.)
Social profile detection
- Detects social links from:
  - JSON-LD (sameAs arrays)
  - Regular anchor tags (<a href="...">)
- Supports:
  - Facebook
  - Instagram
  - LinkedIn
  - X (Twitter)
  - YouTube
  - TikTok
  - Pinterest
- Applies a brand token (derived from the hostname) to avoid irrelevant social links from third-party widgets when possible.
Optional listing extraction
- When extract_listings is enabled, PeachParser tries to identify listing items on pages:
  - Looks for same-domain <a> links with meaningful text
  - Skips obviously generic link text such as “read more”, “les mer”, “more info”, etc.
  - Avoids file downloads and non-HTML resources
- Each listing item is stored as a separate dataset record with:
  - record_type = "listing_item"
  - item_name
  - item_url
  - item_source_page
Smart crawling
- Restricts crawling to a single domain (supports www. and bare domain equivalence)
- Skips non-HTML responses and resources with unwanted file extensions (.pdf, images, archives, etc.)
- Prioritizes URLs with contact-related keywords (kontakt, contact, om-oss, about, personvern, etc.)
- Respects max_pages_per_site to control workload
Robots.txt (optional)
- When respect_robots_txt is enabled, PeachParser:
  - Fetches and parses robots.txt (with a short timeout and size limit)
  - Uses it to decide whether a URL may be crawled

Notes

Keep max_pages_per_site modest for reliability and to avoid hitting rate limits.
Results depend on site structure and the presence of contact information in public pages.
Respect terms of service and local laws.

Supported & planned regions

Region	Status	Details	Link
Nordics	Optimized	Last optimized: 2025-11-11 (NO/SE/DK/FI/IS)	—
Western EU	Planned	—	—
Eastern EU	Planned	—	—
North America	Not started	—	—
South America	Not started	—	—
East/SE Asia	Not started	—	—
Middle East	Not started	—	—
Africa	Not started	—	—
Oceania	Not started	—	—

Create an issue if you’d like your country prioritized.

Disclaimer & License

This Apify Actor is provided “as is”, without warranty of any kind — express or implied — including but not limited to the warranties of merchantability, fitness for a particular purpose, and non-infringement. Please follow local laws and do not use for malicious purposes.

ToS & legality (Reminder): Great scraping comes with great responsibility. Follow local laws and do not use my code to spam.

Wolt Scraper

odaudlegur/wolt-scraper

Retrieves restaurant information on wolt.com. Checks which restaurant websites are alive, and crawls them for emails and social links. Filters common telemetry and template junk.

SLASH

Proff.no Lead Scraper (Beta)

odaudlegur/proff-no-lead-scraper-beta

Retrieve leads on proff.no, the easy way. This actor will retrieve the business' name, address, email addresses, phone numbers and social links.

SLASH

Monday.com Exporter (Pay per event)

odaudlegur/monday-com-exporter-ppe

Exports items from an Apify dataset to Monday.com as items or subitems. Designed to be triggered automatically when a source actor run succeeds, but can also run manually.

SLASH

Monday.com Exporter (Subscription)

odaudlegur/monday-com-exporter-subscription

Unlimited exports of items from an Apify dataset to Monday.com as items or subitems. Designed to be triggered automatically when a source actor run succeeds, but can also run manually. If you want to try it out, please see the pay per event version.

SLASH

SCOUTR Nordics (Google Maps Scraper)

odaudlegur/scoutr-nordics-google-maps-scraper

Specialized Google Maps scraper for Nordic countries. Geocodes a start address, finds businesses within a radius, and extracts name, address, website, phone. Visits each website to fetch real contact information. Other countries will be added in different actors when the code is fully optimized.

SLASH

Hitta.se Lead Scraper (Beta)

odaudlegur/hitta-se-lead-scraper-beta

Retrieve leads on hitta.se, the easy way. This actor will retrieve the business' name, address, email addresses, phone numbers and social links.

SLASH

My Actor

storeleads/my-actor

Storeleads

Hitta.se Business Search Scraper

powerai/hitta-search-scraper

Scrape business listings from Hitta.se (Swedish directory) with automatic pagination and comprehensive company data extraction.

PowerAI

5.0

My Actor

prospeo/storleads

Prospeo

Allabolag Business Details Scraper

ecomscrape/allabolag-business-details-scraper

Scrape detailed Swedish business profiles from Allabolag.se automatically. Extract company info, contact details, reviews, ratings & geo data in JSON/CSV/Excel. Ideal for lead generation & market research. User-friendly tool with proxy support & bulk processing capabilities.

ecomscrape