Website Social Links Extractor avatar
Website Social Links Extractor

Pricing

$3.00/month + usage

Go to Apify Store
Website Social Links Extractor

Website Social Links Extractor

An advanced actor that extracts official social media links (Facebook, X, LinkedIn, GitHub, etc.) from a list of websites. It uses Playwright to reliably scan modern JavaScript sites (SPAs). Smart logic filters out "junk" links and uses relevancy scoring to find the true company profile.

Pricing

$3.00/month + usage

Rating

5.0

(1)

Developer

CodeScraper

CodeScraper

Maintained by Community

Actor stats

1

Bookmarked

44

Total users

10

Monthly active users

2 months ago

Last modified

Share

๐Ÿ”— Advanced Social Links Extractor โ€“ Bulk Website Social Scraper

This Apify actor automatically crawls any website and extracts all social media links with high accuracy. It supports bulk URLs, renders JavaScript content using Playwright, blocks unnecessary resources for maximum speed, and intelligently detects social links even in complex or dynamic websites.

The output is a clean, flat JSON object containing the detected links for each platform.


๐Ÿš€ What It Does

For every website, the actor extracts:

  • ๐Ÿ“˜ Facebook URLs
  • ๐Ÿฆ Twitter / X
  • ๐Ÿ“ธ Instagram
  • โ–ถ๏ธ YouTube channels / videos
  • ๐Ÿ’ผ LinkedIn company / profile
  • ๐Ÿ“Œ Pinterest
  • ๐ŸŽต TikTok
  • ๐Ÿ’ฌ WhatsApp & WhatsApp Business
  • ๐Ÿ“ฑ Messenger
  • ๐ŸŽง SoundCloud
  • ๐ŸŽจ Behance / Dribbble
  • ๐Ÿ”— Any other social profile URLs

It automatically normalizes, filters, and deduplicates social links.


โšก It Handles

  • โœ… Bulk website lists (unlimited size)
  • ๐ŸŒ Static + dynamic SPAs (React, Vue, Angular)
  • ๐Ÿง  Smart detection of hidden social icons
  • ๐Ÿ–ผ๏ธ Blocks images, fonts, videos for faster scraping
  • ๐Ÿ”„ Automatic retry handling
  • ๐Ÿš€ Playwright-based rendering for JavaScript sites
  • ๐Ÿงน Purges old dataset before each run

๐Ÿง  How It Works

The actor:

  1. Loads your list of URLs
  2. Uses PlaywrightCrawler to render each page
  3. Blocks heavy resources (images, fonts, media)
  4. Extracts the final rendered HTML
  5. Uses a smart parser to detect all social links
  6. Outputs clean JSON results into a dataset

โš™๏ธ Input Configuration

FieldTypeDescriptionExample Value
URLs to ScrapeArrayList of website URLs to scrape["https://example.com", "https://shop.com"]

๐Ÿงฉ Example Input

{
"URLs to Scrape": [
"https://colorlib.com",
"https://w3layouts.com",
"https://templatemo.com"
]
}

๐Ÿ“Š Example Output

{
"scrapedUrl": "https://example.com",
"facebook": "https://facebook.com/example",
"instagram": "https://instagram.com/example",
"twitter": "https://twitter.com/example",
"youtube": "https://youtube.com/example",
"linkedin": "https://linkedin.com/company/example",
"pinterest": "https://pinterest.com/example",
"tiktok": "https://tiktok.com/@example",
"reddit": "https://reddit.com/user/example",
"snapchat": "https://snapchat.com/add/example",
"whatsapp": "https://wa.me/1234567890",
"telegram": "https://t.me/example",
"discord": "https://discord.gg/example",
"wechat": "https://wechat.com/example",
"skype": "skype:example?call",
"github": "https://github.com/example",
"gitlab": "https://gitlab.com/example",
"behance": "https://behance.net/example",
"dribbble": "https://dribbble.com/example",
"medium": "https://medium.com/@example",
"substack": "https://substack.com/@example",
"vimeo": "https://vimeo.com/example",
"flickr": "https://flickr.com/photos/example",
"soundcloud": "https://soundcloud.com/example",
"spotify": "https://open.spotify.com/user/example",
"mastodon": "https://mastodon.social/@example",
"threads": "https://threads.net/@example",
"xing": "https://xing.com/profile/example",
"vk": "https://vk.com/example",
"patreon": "https://patreon.com/example",
"tumblr": "https://example.tumblr.com",
"twitch": "https://twitch.tv/example",
"email": "mailto:example@example.com",
"tel": "tel:+1234567890"
}

If no links are found:

{
"scrapedUrl": "https://example.com",
"facebook": null,
"instagram": null,
"youtube": null
}

If the page fails to load:

{
"scrapedUrl": "https://example.com",
"error": "Failed to load page after 3 retries."
}

๐Ÿง  Features

  • ๐Ÿ” Detects 30+ social networks
  • ๐Ÿš€ Fast scraping with resource blocking
  • ๐Ÿงฉ Works on any website (static or JS-heavy)
  • ๐Ÿ›ก๏ธ Automatic retry logic
  • ๐Ÿงน Purges dataset before each run
  • ๐Ÿ“ฆ Clean, flat output for CSV/Excel exports
  • ๐ŸŒ Fully optimized for Apify infrastructure

๐Ÿ’ก Use Cases

  • Lead generation (collect social links from business websites)
  • Marketing enrichment (expand brand profiles)
  • Data aggregation tools
  • Competitor research
  • Automation workflows
  • SEO analysis
  • eCommerce store scraping

โ“ FAQs

1. Does it work on JavaScript-heavy websites?

Yes โ€” it renders the page using Playwright and waits until network idle.

2. Will it detect hidden or icon-only social buttons?

Yes โ€” the extractor checks all href links, even if they have no text.

3. Does it block heavy resources?

Yes โ€” images, videos, fonts are blocked for maximum speed.

4. Can I input 1000+ URLs?

Absolutely. Crawlee handles large lists efficiently.

The actor returns null values for missing platforms.


๐Ÿง‘โ€๐Ÿ’ป Developer Info

Author: codescraper Email: codescraper011@gmail.com


๐Ÿท๏ธ Tags

website-social-links-extractor . website-social-extractor . social-links-extractor . social-extractor ยท social-links ยท facebook ยท instagram ยท linkedin youtube ยท scraper ยท website-crawler ยท bulk-scraper ยท apify tiktok ยท twitter ยท marketing ยท automation ยท crawlee