Facebook Page Scraper avatar
Facebook Page Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Facebook Page Scraper

Facebook Page Scraper

Developed by

Scraper Engine

Scraper Engine

Maintained by Community

Facebook Page Scraper collects data from Facebook pages, including posts, followers, page info, and engagement metrics. It automates public page extraction for research, analytics, or marketing insights, exporting clean JSON or CSV data for easy integration into workflows.

5.0 (4)

Pricing

$19.99/month + usage

0

3

2

Last modified

a day ago

Facebook Page Scraper ๐Ÿ“˜

Extract comprehensive information from Facebook pages at scale with advanced proxy fallback logic. This powerful Apify Actor allows you to scrape page details including likes, followers, contact information, and more - perfect for lead generation, market research, and competitive analysis.

๐ŸŒŸ Why Choose This Actor?

  • ๐Ÿš€ Bulk Processing: Scrape multiple Facebook pages simultaneously
  • ๐Ÿ”„ Smart Proxy Fallback: Automatic fallback from direct connection โ†’ datacenter proxy โ†’ residential proxy with 3 retries
  • ๐Ÿ“Š Comprehensive Data: Extract likes, followers, email, website, categories, and more
  • โœ… High Success Rate: Robust error handling and retry logic ensure maximum data extraction
  • ๐Ÿ’พ Structured Output: Clean, organized JSON output ready for analysis
  • โšก Fast & Efficient: Optimized for performance with async processing

๐ŸŽฏ Key Features

Advanced Proxy Management

  • No Proxy by Default: Starts with direct connection for maximum speed
  • Automatic Fallback: If Facebook blocks the request, automatically switches to datacenter proxy
  • Residential Proxy: Falls back to residential proxy if datacenter is blocked
  • Sticky Proxy: Once fallback occurs, sticks with residential proxy for all remaining requests
  • 3 Retry Attempts: Retries up to 3 times with residential proxy before giving up
  • Clear Logging: All proxy events are logged for transparency

Data Extraction

  • Page Information: Title, category, introduction/bio
  • Engagement Metrics: Likes count, followers count
  • Contact Details: Email address, website URL
  • Visual Assets: Profile picture URL, cover photo URL
  • Page Identifiers: Page ID, page name/vanity URL
  • Business Info: Business page status, ad library ID

๐Ÿ“ฅ Input

The actor accepts the following input configuration:

{
"startUrls": [
"https://www.facebook.com/copperkettleyqr",
"https://www.facebook.com/ChrisBrecheensWritingAboutWriting",
"pagename"
],
"maxPages": 10,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Input Parameters

ParameterTypeRequiredDescription
startUrlsArrayโœ… YesList of Facebook page URLs or usernames. Supports both full URLs (https://www.facebook.com/pagename) and usernames (pagename)
maxPagesIntegerโŒ NoMaximum number of pages to scrape (1-1000). Default: 10. Use this to limit scraping scope
proxyConfigurationObjectโŒ NoProxy settings. Default: no proxy. Enable useApifyProxy: true to use Apify proxies

Proxy Configuration Options

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
  • useApifyProxy: Set to true to enable Apify proxy
  • apifyProxyGroups: Optional. Specify proxy groups like ["RESIDENTIAL"], ["DATACENTER"], or leave empty for auto

๐Ÿ“ค Output

The actor outputs structured data to the Apify dataset. Each page produces one record:

{
"facebookUrl": "https://www.facebook.com/copperkettleyqr",
"categories": ["Pizza place"],
"info": [
"The Copper Kettle Restaurant, Regina. 2,660 likes",
"14 talking about this ยท 1,119 were here. Longstanding local restaurant."
],
"likes": 2660,
"messenger": null,
"title": "The Copper Kettle Restaurant",
"pageId": "100064027242849",
"pageName": "copperkettleyqr",
"pageUrl": "https://www.facebook.com/copperkettleyqr",
"intro": "Longstanding local restaurant. Mediterranean specialties, signature gourmet pizzas",
"websites": ["https://thecopperkettle.online"],
"email": "copperkettle.events@gmail.com",
"website": "thecopperkettle.online",
"followers": 2900,
"profilePictureUrl": "https://scontent.fdac142-1.fna.fbcdn.net/...",
"coverPhotoUrl": "https://scontent.fdac142-1.fna.fbcdn.net/...",
"about_me": {
"text": "Longstanding local restaurant. Mediterranean specialties..."
},
"facebookId": "100064027242849",
"pageAdLibrary": {
"is_business_page_active": false,
"id": "851606664870954"
},
"success": true
}

Output Fields

FieldTypeDescription
facebookUrlStringOriginal Facebook page URL
titleStringPage title/name
categoriesArrayPage categories (e.g., "Restaurant", "Local Business")
likesIntegerTotal number of page likes
followersIntegerTotal number of followers
emailStringContact email address (if available)
websiteStringWebsite domain
websitesArrayFull website URLs
introStringPage introduction/bio text
pageIdStringFacebook page ID
pageNameStringPage vanity name/username
profilePictureUrlStringProfile picture image URL
coverPhotoUrlStringCover photo image URL
infoArrayAdditional page information
facebookIdStringFacebook ID
pageAdLibraryObjectBusiness page and ad library information
successBooleanWhether scraping was successful

๐Ÿš€ How to Use the Actor (via Apify Console)

Step 1: Access the Actor

  1. Log in to Apify Console
  2. Navigate to Actors in the left sidebar
  3. Find Facebook Page Scraper and click it

Step 2: Configure Input

  1. Click the Input tab
  2. Add your Facebook page URLs in the startUrls field:
    • Enter full URLs: https://www.facebook.com/pagename
    • Or just usernames: pagename
    • One URL per line for bulk input
  3. Set maxPages to limit the number of pages to scrape
  4. Configure proxy settings:
    • Leave disabled for direct connection (recommended for testing)
    • Enable for production use with automatic fallback

Step 3: Run the Actor

  1. Click the green Start button
  2. The actor will begin processing your pages

Step 4: Monitor Progress

  1. Watch the Log tab for real-time updates:
    • ๐Ÿ“„ Page processing status
    • ๐Ÿ”— Proxy usage and fallback events
    • โœ… Success/failure messages
    • ๐Ÿ“Š Progress statistics

Step 5: Access Results

  1. Once complete, click the Dataset tab
  2. View results in table format
  3. Click Export to download as:
    • JSON
    • CSV
    • Excel
    • RSS

๐Ÿ’ก Best Use Cases

๐Ÿ“ˆ Lead Generation

  • Extract contact information from business pages
  • Build targeted email lists
  • Identify potential customers or partners
  • Gather business intelligence

๐Ÿ” Market Research

  • Analyze competitor pages
  • Track engagement metrics
  • Monitor industry trends
  • Identify market opportunities

๐Ÿ“Š Data Analysis

  • Collect page statistics at scale
  • Compare multiple pages
  • Track page growth over time
  • Build comprehensive databases

๐ŸŽฏ Marketing & Outreach

  • Find contact details for outreach campaigns
  • Identify influencers and brand ambassadors
  • Research target audience demographics
  • Build marketing lists

โ“ Frequently Asked Questions

Q: What types of Facebook pages can I scrape?

A: The actor works with public Facebook pages including business pages, community pages, public figures, and organizations. Private or restricted pages cannot be scraped.

Q: How does the proxy fallback work?

A: The actor follows this sequence:

  1. Direct connection (no proxy) - fastest option
  2. If blocked โ†’ Datacenter proxy - good balance of speed and anonymity
  3. If blocked โ†’ Residential proxy - highest success rate
  4. Retries up to 3 times with residential proxy
  5. All subsequent requests use the successful proxy type

Q: Can I scrape pages without logging in?

A: Yes, this actor scrapes publicly available information without requiring Facebook login credentials.

Q: How many pages can I scrape at once?

A: You can scrape up to 1000 pages in a single run (configurable via maxPages parameter). For larger batches, run the actor multiple times.

Q: What if a page doesn't have contact information?

A: The actor will still extract all available information. Missing fields will be null in the output.

Q: How accurate is the follower/likes count?

A: The actor extracts the exact numbers displayed on the page. Counts are accurate as of the scraping time.

Q: Can I schedule regular scraping?

A: Yes! Use Apify's Scheduler feature to run the actor automatically at specified intervals (hourly, daily, weekly, etc.).

Q: How much does it cost to run?

A: Costs depend on:

  • Compute time: Based on page count and processing time
  • Proxy usage: Residential proxies cost more than datacenter
  • Data transfer: Minimal cost for page data

Typical costs:

  • 10 pages: ~$0.01-0.05
  • 100 pages: ~$0.10-0.50
  • 1000 pages: ~$1-5

๐Ÿ›ก๏ธ Support and Feedback

Need Help?

Feature Requests

We're constantly improving! If you have suggestions:

  1. Contact us via email or Discord
  2. Describe your use case
  3. We'll consider it for future updates

Rate This Actor

If you find this actor helpful, please leave a review on the Apify Store! Your feedback helps us improve and helps others discover this tool.

Terms of Service

  • This actor only accesses publicly available data
  • Respects Facebook's robots.txt and rate limits
  • Does not access private pages or restricted content
  • Does not bypass any authentication or paywalls

Responsible Use

  • Respect Privacy: Only scrape public information
  • Follow Laws: Ensure compliance with:
    • Facebook Terms of Service
    • GDPR (if applicable)
    • Local data protection laws
    • Anti-spam regulations
  • Rate Limiting: Use reasonable request rates
  • Attribution: Credit data sources when appropriate

Disclaimer

The end user is solely responsible for:

  • Ensuring legal compliance in their jurisdiction
  • Respecting intellectual property rights
  • Following Facebook's Terms of Service
  • Ethical use of extracted data

This actor is provided as a tool for legitimate business intelligence, research, and lead generation purposes. Misuse of this tool is not endorsed or supported.

๐Ÿ”ง Technical Details

Architecture

  • Language: Python 3.11
  • Framework: Apify SDK
  • HTTP Requests: requests library
  • HTML Parsing: BeautifulSoup4
  • Async Support: asyncio

Performance

  • Speed: ~5-10 seconds per page (depending on data complexity)
  • Memory: ~256MB per worker
  • Scalability: Can process thousands of pages in a single run

Data Storage

  • Dataset: Structured JSON output
  • Key-Value Store: Configuration and state (if needed)

๐Ÿ“Š Version History

Version 1.0 (Current)

  • โœ… Initial release
  • โœ… Advanced proxy fallback logic
  • โœ… Comprehensive data extraction
  • โœ… Bulk processing support
  • โœ… Detailed logging
  • โœ… Structured output

Planned Features

  • ๐Ÿ”œ Post scraping from pages
  • ๐Ÿ”œ Comment extraction
  • ๐Ÿ”œ Historical data tracking
  • ๐Ÿ”œ Advanced filtering options
  • ๐Ÿ”œ Export to multiple formats

๐ŸŽ‰ Get Started Now!

Ready to extract Facebook page data at scale?

๐Ÿ‘‰ Run the Actor ๐Ÿ‘ˆ


Made with โค๏ธ for the Apify Community

Happy Scraping! ๐Ÿš€