PeachParser avatar
PeachParser

Pricing

Pay per event

Go to Apify Store
PeachParser

PeachParser

Developed by

SLASH

SLASH

Maintained by Community

Crawl arbitrary websites, checks which are alive, and crawls them for emails and social links. Filters common telemetry and template junk.

0.0 (0)

Pricing

Pay per event

3

2

1

Last modified

9 days ago

This Actor crawls one or more websites and extracts emails plus social links. It stays on the same domain, prioritizes contact/about pages, and limits the crawl to a small number of pages for speed and safety.

Features

  • Same-domain crawl
  • Email discovery (from mailto: links and visible text)
  • Social links (Facebook, Instagram, LinkedIn, X/Twitter, YouTube, TikTok, Pinterest)
  • Lightweight and configurable page cap per site
  • Optional robots.txt compliance

Input

Defined in input_schema.json.

FieldTypeDescription
start_urlsarrayRoot websites to crawl (same-domain only)
max_pages_per_siteintegerMax pages to visit per website (default 6)
timeout_secondsintegerPer-request read timeout (default 30)
respect_robots_txtbooleanObey robots.txt if true (default false)

Example:

{
"start_urls": [
{"url": "https://www.goderom.no"},
{"url": "https://www.goderom.net"},
{"url": "https://www.wagoi.eu"},
{"url": "https://datsproject.io"},
{"url": "https://denet.pro"}
],
"max_pages_per_site": 6,
"timeout_seconds": 30,
"respect_robots_txt": false
}

Output

Each dataset item summarizes the site:

  • root_url, website_details (ok/unavailable/n/a), pages_crawled
  • email1..email5
  • social_facebook, social_instagram, social_linkedin, social_x, social_youtube, social_tiktok, social_pinterest

Example:

{
"root_url": "https://example.com",
"website_details": "ok",
"pages_crawled": 6,
"email1": "info@example.com",
"social_facebook": "https://www.facebook.com/example",
"social_instagram": "n/a",
"social_linkedin": "n/a",
"social_x": "n/a",
"social_youtube": "n/a",
"social_tiktok": "n/a",
"social_pinterest": "n/a"
}

Notes

  • Keep max_pages_per_site modest for reliability and to avoid hitting rate limits.
  • Results depend on site structure and the presence of contact information in public pages.
  • Respect terms of service and local laws.

Disclaimer & License

This Apify Actor is provided "as is", without warranty of any kind — express or implied — including but not limited to the warranties of merchantability, fitness for a particular purpose, and non-infringement. Use it, modify it, break it, or improve it — but you do so at your own risk.

© 2025 SLSH. All rights reserved. Copying or modifying the source code is prohibited.