Website Social Links Extractor avatar
Website Social Links Extractor

Pricing

$3.00/month + usage

Go to Apify Store
Website Social Links Extractor

Website Social Links Extractor

An advanced actor that extracts official social media links (Facebook, X, LinkedIn, GitHub, etc.) from a list of websites. It uses Playwright to reliably scan modern JavaScript sites (SPAs). Smart logic filters out "junk" links and uses relevancy scoring to find the true company profile.

Pricing

$3.00/month + usage

Rating

5.0

(1)

Developer

CodeScraper

CodeScraper

Maintained by Community

Actor stats

0

Bookmarked

7

Total users

3

Monthly active users

2 days ago

Last modified

Share

πŸ”— Advanced Social Links Extractor – Bulk Website Social Scraper

This Apify actor automatically crawls any website and extracts all social media links with high accuracy. It supports bulk URLs, renders JavaScript content using Playwright, blocks unnecessary resources for maximum speed, and intelligently detects social links even in complex or dynamic websites.

The output is a clean, flat JSON object containing the detected links for each platform.


πŸš€ What It Does

For every website, the actor extracts:

  • πŸ“˜ Facebook URLs
  • 🐦 Twitter / X
  • πŸ“Έ Instagram
  • ▢️ YouTube channels / videos
  • πŸ’Ό LinkedIn company / profile
  • πŸ“Œ Pinterest
  • 🎡 TikTok
  • πŸ’¬ WhatsApp & WhatsApp Business
  • πŸ“± Messenger
  • 🎧 SoundCloud
  • 🎨 Behance / Dribbble
  • πŸ”— Any other social profile URLs

It automatically normalizes, filters, and deduplicates social links.


⚑ It Handles

  • βœ… Bulk website lists (unlimited size)
  • 🌐 Static + dynamic SPAs (React, Vue, Angular)
  • 🧠 Smart detection of hidden social icons
  • πŸ–ΌοΈ Blocks images, fonts, videos for faster scraping
  • πŸ”„ Automatic retry handling
  • πŸš€ Playwright-based rendering for JavaScript sites
  • 🧹 Purges old dataset before each run

🧠 How It Works

The actor:

  1. Loads your list of URLs
  2. Uses PlaywrightCrawler to render each page
  3. Blocks heavy resources (images, fonts, media)
  4. Extracts the final rendered HTML
  5. Uses a smart parser to detect all social links
  6. Outputs clean JSON results into a dataset

βš™οΈ Input Configuration

FieldTypeDescriptionExample Value
URLs to ScrapeArrayList of website URLs to scrape["https://example.com", "https://shop.com"]

🧩 Example Input

{
"URLs to Scrape": [
"https://colorlib.com",
"https://w3layouts.com",
"https://templatemo.com"
]
}

πŸ“Š Example Output

{
"scrapedUrl": "https://example.com",
"facebook": "https://facebook.com/example",
"instagram": "https://instagram.com/example",
"twitter": "https://twitter.com/example",
"youtube": "https://youtube.com/example",
"linkedin": "https://linkedin.com/company/example",
"pinterest": "https://pinterest.com/example",
"tiktok": "https://tiktok.com/@example",
"reddit": "https://reddit.com/user/example",
"snapchat": "https://snapchat.com/add/example",
"whatsapp": "https://wa.me/1234567890",
"telegram": "https://t.me/example",
"discord": "https://discord.gg/example",
"wechat": "https://wechat.com/example",
"skype": "skype:example?call",
"github": "https://github.com/example",
"gitlab": "https://gitlab.com/example",
"behance": "https://behance.net/example",
"dribbble": "https://dribbble.com/example",
"medium": "https://medium.com/@example",
"substack": "https://substack.com/@example",
"vimeo": "https://vimeo.com/example",
"flickr": "https://flickr.com/photos/example",
"soundcloud": "https://soundcloud.com/example",
"spotify": "https://open.spotify.com/user/example",
"mastodon": "https://mastodon.social/@example",
"threads": "https://threads.net/@example",
"xing": "https://xing.com/profile/example",
"vk": "https://vk.com/example",
"patreon": "https://patreon.com/example",
"tumblr": "https://example.tumblr.com",
"twitch": "https://twitch.tv/example",
"email": "mailto:example@example.com",
"tel": "tel:+1234567890"
}

If no links are found:

{
"scrapedUrl": "https://example.com",
"facebook": null,
"instagram": null,
"youtube": null
}

If the page fails to load:

{
"scrapedUrl": "https://example.com",
"error": "Failed to load page after 3 retries."
}

🧠 Features

  • πŸ” Detects 30+ social networks
  • πŸš€ Fast scraping with resource blocking
  • 🧩 Works on any website (static or JS-heavy)
  • πŸ›‘οΈ Automatic retry logic
  • 🧹 Purges dataset before each run
  • πŸ“¦ Clean, flat output for CSV/Excel exports
  • 🌍 Fully optimized for Apify infrastructure

πŸ’‘ Use Cases

  • Lead generation (collect social links from business websites)
  • Marketing enrichment (expand brand profiles)
  • Data aggregation tools
  • Competitor research
  • Automation workflows
  • SEO analysis
  • eCommerce store scraping

❓ FAQs

1. Does it work on JavaScript-heavy websites?

Yes β€” it renders the page using Playwright and waits until network idle.

2. Will it detect hidden or icon-only social buttons?

Yes β€” the extractor checks all href links, even if they have no text.

3. Does it block heavy resources?

Yes β€” images, videos, fonts are blocked for maximum speed.

4. Can I input 1000+ URLs?

Absolutely. Crawlee handles large lists efficiently.

The actor returns null values for missing platforms.


πŸ§‘β€πŸ’» Developer Info

Author: codescraper Email: codescraper011@gmail.com


🏷️ Tags

website-social-links-extractor . website-social-extractor . social-links-extractor . social-extractor Β· social-links Β· facebook Β· instagram Β· linkedin youtube Β· scraper Β· website-crawler Β· bulk-scraper Β· apify tiktok Β· twitter Β· marketing Β· automation Β· crawlee