Social Profile Extractor – Find Facebook, LinkedIn, Twitter avatar

Social Profile Extractor – Find Facebook, LinkedIn, Twitter

Pricing

from $3.50 / 1,000 social media profile extractors

Go to Apify Store
Social Profile Extractor – Find Facebook, LinkedIn, Twitter

Social Profile Extractor – Find Facebook, LinkedIn, Twitter

Crawl any website to find all social media profiles: Facebook, LinkedIn, Twitter, Instagram, YouTube, TikTok, GitHub, etc. Priority crawl, page-level output, final consolidated. Pay per page ( 3.50 / 1 K ) + 3.50/1K)+0.20/run. Residential proxy ready.

Pricing

from $3.50 / 1,000 social media profile extractors

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

6 days ago

Last modified

Share


🔍 Social Profile Extractor – Find Company Social Media Links (Facebook, LinkedIn, Twitter, Instagram, YouTube, TikTok, GitHub, etc.)

Crawl any website to discover social media profiles – Facebook, Twitter/X, LinkedIn, Instagram, YouTube, TikTok, Pinterest, GitHub, Threads, Snapchat, Reddit.
The Social Profile Extractor intelligently navigates a domain (prioritising contact, about, team, and footer pages) and extracts all social media links. Outputs page‑level findings immediately, plus a final consolidated list per domain. Perfect for lead enrichment, competitor research, influencer discovery, and B2B prospecting.


💡 What is the Social Profile Extractor?

The Social Profile Extractor is a specialised Apify actor that automatically finds all social media profiles associated with a given website. It works by:

  • Crawling up to a user‑defined number of pages per domain (default 15)
  • Prioritising pages likely to contain social links (contact, about, team, footer, support, social)
  • Extracting links to 13+ social platforms:
    • Facebook, Twitter/X, LinkedIn, Instagram, YouTube, TikTok, Pinterest, GitHub, Threads, Snapchat, Reddit
  • Filtering out share/intent links (e.g., sharer.php, ?whatsapp)
  • Producing two types of output:
    1. Page‑level items – as soon as a page is scanned (with any discovered social links or just a scan record).
    2. A final summary item – after the crawl finishes (or hits the page limit), containing all unique social profiles found.

The actor uses curl_cffi for high‑performance HTTP requests with Chrome impersonation, respects same‑domain navigation, and saves checkpoints to resume interrupted crawls.

Pay‑per‑event pricing – you are charged only for successfully processed pages ($0.0035 per page, $3.50 per 1,000 pages). Failed or blocked pages cost nothing. The actor start fee is $0.20 per run.


🚀 Key Features

FeatureDescription
13+ social platformsFacebook, Twitter, LinkedIn, Instagram, YouTube, TikTok, Pinterest, GitHub, Threads, Snapchat, Reddit.
Smart crawlingPrioritises contact, about, team, footer, and social‑related pages first.
Same‑domain onlyStays within the target website (no external links).
Page‑level immediate outputYou see results as soon as each page is scanned – no need to wait for full crawl.
Final consolidated outputOne summary item per domain with all unique social profile URLs.
Checkpoint & resumeSaves progress after every 5 pages; if interrupted, restarts where it left off.
Residential proxy readyBypasses anti‑bot measures (recommended for large crawls).
Pay‑per‑result pricing$0.20 per run + $3.50 per 1,000 successfully crawled pages. Failed pages are free.
Clean JSON outputEasy to export to CSV, Excel, or integrate into CRMs.

📥 Input Parameters

The actor accepts a JSON object with the following fields:

ParameterTypeRequiredDefaultDescription
startUrlsarray of objectsYesList of starting URLs (e.g., [{"url": "https://example.com"}]).
maxPagesPerDomainintegerNo15Maximum number of pages to crawl per domain.
concurrencyintegerNo20Number of concurrent requests.
proxyConfigurationobjectNoApify proxy configuration. Residential proxies strongly recommended.

Example Input

{
"startUrls": [
{"url": "https://stripe.com"},
{"url": "https://shopify.com"},
{"url": "https://airbnb.com"}
],
"maxPagesPerDomain": 20,
"concurrency": 15,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

📤 Output Fields

The actor pushes two types of items:

1. Page‑level item (status: "scanned" or "new_found")

Sent after each page is parsed. If new social profiles were discovered on that page, you get a "new_found" item; otherwise a "scanned" item with the current socials found (may be empty).

FieldTypeDescription
domainstringThe website domain.
start_urlstringThe original starting URL.
page_urlstringThe crawled page URL.
page_titlestringPage title (from <title>).
descriptionstringMeta description or OG description.
social_profilesobjectDictionary of platform → array of URLs found on this page.
platformstring(only in new_found items) Platform name.
profilesarray(only in new_found items) List of URLs for that platform.
pages_scannedintegerRunning total of scanned pages for this domain.
statusstring"scanned" or "new_found".

2. Final consolidated item (status: "completed" or "partial")

Sent after the crawl finishes (reached maxPagesPerDomain or no more internal links).

FieldTypeDescription
domainstringThe website domain.
start_urlstringOriginal starting URL.
social_profilesobjectDictionary of platform → array of all unique URLs found across the whole crawl.
pages_scannedintegerTotal pages scanned.
statusstring"completed" (reached limit or finished) or "partial" (still links left but stopped).

Example Output (Final Item)

{
"domain": "stripe.com",
"start_url": "https://stripe.com",
"social_profiles": {
"twitter": ["https://twitter.com/stripe"],
"linkedin": ["https://linkedin.com/company/stripe"],
"github": ["https://github.com/stripe"],
"youtube": ["https://youtube.com/user/stripe"]
},
"pages_scanned": 12,
"status": "completed"
}

💰 Pricing

ComponentPrice
Actor start (per run)$0.20
Per successful page$0.0035
Per 1,000 successful pages$3.50
  • You are charged only for pages that are successfully fetched and parsed (HTTP 200 with HTML content).
  • Failed pages (404, 403, timeout, blocking) cost nothing.
  • Actor start fee covers the infrastructure for the run (even if zero pages are scraped).
  • Example: 500 successful pages = $0.20 + (500 × $0.0035) = $1.95.
  • Example: 2,500 successful pages = $0.20 + (2,500 × $0.0035) = $8.95.

Checkpoint & resume: If the actor stops due to spending limit or interruption, restarting will continue from the saved state without re‑charging already processed pages.


🛠 How to Use on Apify

  1. Create a task with this actor.
  2. Provide one or more start URLs – the actor will crawl each domain independently.
  3. Set maxPagesPerDomain – how many pages to scan per site (default 15, which usually finds all social links).
  4. Enable residential proxies – strongly recommended to avoid being blocked (especially for large sites).
  5. Run – the actor will crawl, output page‑level results immediately, and finally push one summary per domain.
  6. Export – download the dataset as JSON, CSV, or Excel.

Running via API

curl -X POST "https://api.apify.com/v2/acts/your-username~social-profile-extractor/runs" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-d '{
"startUrls": [{"url": "https://stripe.com"}],
"maxPagesPerDomain": 20,
"proxyConfiguration": {"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}
}'

🎯 Use Cases

Use CaseHow It Helps
Lead enrichmentAdd social media handles to your CRM records.
Competitor analysisQuickly see which platforms competitors are active on.
Influencer discoveryFind social profiles of companies you want to partner with.
Sales prospectingUse LinkedIn profiles to identify decision‑makers.
Brand monitoringCheck if a company’s social links are up‑to‑date.
Market researchAggregate social presence across an industry.

❓ Frequently Asked Questions

1. Which social platforms are supported?
Facebook, Twitter/X, LinkedIn, Instagram, YouTube, TikTok, Pinterest, GitHub, Threads, Snapchat, Reddit.

2. How many pages should I crawl?
Most social links are found within the first 10–15 pages (homepage, contact, about, footer). maxPagesPerDomain = 15 is sufficient for most sites. Increase to 30 for very large sites.

3. Do I need a proxy?
For a small number of domains (1–5) with maxPagesPerDomain ≤ 15, you may not need a proxy. For larger runs or if you see 429 errors, enable residential proxies.

4. What is the difference between the page‑level and final item?
Page‑level items (status: "scanned") are pushed after each page so you can see progress immediately. The final item (status: "completed") contains all unique social profiles aggregated across all scanned pages.

5. Does it follow redirects?
Yes – the actor follows HTTP redirects (allow_redirects=True).

6. What happens if a page is blocked (Cloudflare, CAPTCHA)?
The actor logs a failure and moves to the next URL. No charge is applied for that page.

7. Can I crawl external websites (not same domain)?
No – the actor is designed to stay within the same domain. It will not follow links to other domains.

8. How are share links (e.g., twitter.com/intent/tweet) filtered out?
The actor ignores URLs containing sharer, intent, share, whatsapp, telegram, mailto, tel, etc. This prevents false positives.

9. What is the actor start fee for?
It covers the cost of launching the container, setting up proxies, and managing checkpoints – even if the crawl returns no pages.

10. Is there a free trial?
Apify gives new users $5 monthly credit, which is enough to test this actor on several domains. After that, pay‑as‑you‑go or subscription options are available.



🔍 SEO Keywords

social profile extractor, find social media links, company social scraper, LinkedIn profile finder, Twitter handle extractor, Facebook page discovery, Instagram business account finder, social media intelligence, Apify social scraper, B2B lead enrichment, competitor social analysis




Start extracting social profiles today – $0.20 per run + $3.50 per 1,000 pages. Find every social link for any website.