Facebook Groups Scraper
Pricing
$19.99/month + usage
Facebook Groups Scraper
🔍 Facebook Groups Scraper (facebook-groups-scraper) extracts public group data at scale — posts, comments, reactions, member lists, join links & metadata. 📊 Export to CSV/JSON for lead gen, research & community insights. ⚙️ Fast, reliable, compliant. 🚀 Ideal for marketers & analysts.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapeBase
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
Facebook Groups Scraper
Facebook Groups Scraper is a production-ready Facebook group scraper tool that extracts structured post data (text, reactions, comments, media, and group metadata) from public Facebook groups at scale. It solves the pain of manual copy-paste by automating Facebook groups data extraction with sorting and filtering controls for analysts, marketers, and developers. Use this Facebook group data extractor to power social listening, research, and pipelines that scrape Facebook group posts to CSV/JSON for analytics — reliably and at scale. 🚀
What data / output can you get?
Below are the exact fields this Facebook group posts scraper saves to the Apify dataset for each post. You can download Facebook group data as CSV, JSON, or Excel from the Apify platform.
| Data field | Description | Example value |
|---|---|---|
| facebookUrl | URL of the Facebook group the post belongs to | https://www.facebook.com/groups/example-group |
| url | Direct permalink to the post (when available) | https://www.facebook.com/groups/.../posts/... |
| time | ISO8601 timestamp in UTC for post creation | 2024-03-05T14:22:17.000Z |
| id | Unique post story identifier | 1234567890123456 |
| legacyId | Legacy post ID (post_id) | 987654321098765 |
| feedbackId | Post feedback identifier | 1122334455667788 |
| user.id | Author’s user ID | 100012345678901 |
| user.name | Author’s display name | Jane Doe |
| text | Plain-text content of the post | Looking for recommendations… |
| attachments | Array of media or album objects (photos with image/thumbnail or albums with mediaset_token) | [{ "thumbnail": "...", "image": {...} }, { "mediaset_token": "..." }] |
| likesCount | Total number of likes (aggregate) | 124 |
| sharesCount | Total number of shares | 7 |
| commentsCount | Total number of comments | 32 |
| topReactionsCount | Sum of top reactions | 140 |
| reactionLikeCount | Number of Like reactions | 120 |
| reactionLoveCount | Number of Love reactions | 20 |
| topComments | Array of up to 2 highlighted comments with commenter metadata | [{ "commentUrl": "...", "profileName": "..." }] |
| facebookId | Group ID where the post was published | 123456789012345 |
| groupTitle | Group title, when available | Example Group |
| pageAdLibrary.is_business_page_active | Ad library flag (boolean) | false |
| pageAdLibrary.id | Group/page identifier used in ad library context | 123456789012345 |
| inputUrl | The original input group URL used for scraping | https://www.facebook.com/groups/example-group |
Note: attachments can be either photos (with image, thumbnail, dimensions, and owner info) or media sets/albums (with mediaset_token), and topComments include a subset of top-level comments with commenter details. Data is export-ready for CSV, JSON, or Excel.
Key features
-
🔍 Automatic doc_id, node_id, end_cursor discovery
Automatically discovers GraphQL parameters (doc_id, node_id, end_cursor) from group HTML/JS for resilient Facebook group scraping software that adapts to frontend changes. -
🧭 Smart pagination through group feeds
Uses cursor-based pagination to traverse group discussions and streams results live to the Apify dataset for continuous Facebook group data export. -
🏠 Residential proxy enforced + auto-retry
Always uses Apify Residential proxy and automatically retries until your data demand is fully fulfilled — ideal for reliable Facebook group scraping software at scale. -
🔄 Flexible post sorting modes
Choose viewOption from CHRONOLOGICAL, RECENT_ACTIVITY, TOP_POSTS, or CHRONOLOGICAL_LISTINGS to tailor your Facebook group posts scraper to the task. -
🎯 Advanced filtering controls
Filter by searchGroupKeyword, searchGroupYear, and onlyPostsNewerThan for laser-focused Facebook groups data extraction (e.g., recent posts in 2024 matching letter-based queries). -
🛡️ Anti-blocking behavior
Randomized delays and retry logic help minimize detection and keep your runs stable over time. -
📦 Export-ready structured data
Results stream to the dataset in real time, so you can download Facebook group data as CSV or JSON and plug it into BI tools and workflows. -
🧰 Developer-friendly via Apify platform
Access your datasets programmatically using the Apify API for a Facebook group data scraper API workflow (ETL, analytics, enrichment).
How to use Facebook Groups Scraper - step by step
- Create or log in to your Apify account.
- Open the Facebook Groups Scraper actor in the Apify Console.
- Add target groups in startUrls (string list). Only public Facebook groups are supported.
- (Optional) Tune resultsLimit to cap how many posts you collect per run (default 20, min 1).
- (Optional) Choose a viewOption: CHRONOLOGICAL, RECENT_ACTIVITY, TOP_POSTS, or CHRONOLOGICAL_LISTINGS.
- (Optional) Apply filters: use searchGroupKeyword (single letters recommended) with searchGroupYear, or set onlyPostsNewerThan (absolute date or relative like “30 days”).
- (Optional) If automatic discovery fails and you see “Missing doc_id” in logs, provide fallbackDocId.
- Click Start. Monitor real-time logs and progress.
- When complete, go to the Dataset tab to preview results and export them to CSV, JSON, or Excel.
Pro Tip: Combine a short searchGroupKeyword (e.g., “a”) with searchGroupYear and onlyPostsNewerThan for targeted Facebook group leads extractor workflows and enriched analytics.
Use cases
| Use case name | Description |
|---|---|
| Social listening & trend tracking | Monitor discussion themes and engagement by scraping Facebook group posts to CSV for dashboards and alerts. |
| Lead generation in niche communities | Extract recent posts with keywords and contact-relevant content for outreach pipelines using a Facebook group leads extractor approach. |
| Academic and market research | Collect time-bounded corpora (e.g., “last 30 days”) across multiple groups for qualitative and quantitative analysis. |
| Marketplace / Buy & Sell monitoring | Use CHRONOLOGICAL_LISTINGS to capture inventory-like posts consistently and analyze activity over time. |
| Competitive and product research | Track top posts, reactions, and comments to understand sentiment and community dynamics. |
| Data engineering pipelines | Pull datasets programmatically via Apify to feed warehouses for modeling, reporting, and enrichment workflows. |
Why choose Facebook Groups Scraper?
This Facebook group scraper tool prioritizes precision, scale, and reliability for production workflows.
- ✅ Accurate, normalized output with reactions, comments, and media for analytics-ready datasets
- 🌍 Works on public groups globally; date filters support absolute or relative time
- 📈 Scales to multiple groups per run with auto-pagination and resultsLimit control
- 🧑💻 Developer access through Apify datasets and API for downstream automation
- 🔒 Safe approach with enforced residential proxy and anti-blocking delays
- 💰 Cost-effective alternative to brittle browser extensions or manual exports
- 🔌 Integrations-ready: export to CSV/JSON and plug into n8n, Make, or internal ETL
In short: a production-grade Facebook group data extractor vs. unstable alternatives — built for analysts, marketers, and developers who need dependable output.
Is it legal / ethical to use Facebook Groups Scraper?
Yes — when used responsibly. This actor extracts data from publicly available Facebook group pages and does not access private or authenticated content. You should:
- Scrape only public groups and publicly visible posts.
- Respect Facebook’s terms and applicable laws (e.g., GDPR, CCPA).
- Avoid personal data misuse; use results for analysis and research, not spam.
- Consult your legal team for your specific jurisdiction and use case.
Input parameters & output format
Example JSON input
{"startUrls": ["https://www.facebook.com/groups/germtheory.vs.terraintheory"],"resultsLimit": 100,"viewOption": "RECENT_ACTIVITY","searchGroupKeyword": "a","searchGroupYear": "2024","onlyPostsNewerThan": "2 months","fallbackDocId": "","proxyConfiguration": {"useApifyProxy": false}}
Input parameters
-
startUrls (array, required)
Description: Add one or more public Facebook group URLs. Only public groups are supported.
Default: none -
resultsLimit (integer)
Description: Maximum number of posts to scrape from each group. Minimum: 1.
Default: 20 -
viewOption (string)
Description: Post sorting strategy. One of CHRONOLOGICAL, RECENT_ACTIVITY, TOP_POSTS, CHRONOLOGICAL_LISTINGS. Note: Post limit applies to “New posts” sorting only.
Default: CHRONOLOGICAL -
searchGroupKeyword (string)
Description: Search posts by keyword or letter. Without login, search is limited; 1–2 letter searches work best.
Default: "" -
searchGroupYear (string)
Description: Filter posts from a specific year (e.g., 2024). Best used together with searchGroupKeyword.
Default: "" -
onlyPostsNewerThan (string)
Description: Stop scraping when posts are older than this date. Formats: absolute (YYYY-MM-DD), or relative (e.g., "2 weeks", "7 days", "1 month").
Default: "" -
fallbackDocId (string)
Description: Use a known working GraphQL doc_id if automatic discovery fails (e.g., after a Facebook frontend update).
Default: "" -
proxyConfiguration (object)
Description: Configure proxy settings. The actor enforces the use of residential proxy with rotation and auto-retry.
Default: { "useApifyProxy": false }
Example JSON output
{"facebookUrl": "https://www.facebook.com/groups/example-group","url": "https://www.facebook.com/groups/example-group/posts/1234567890123456/","time": "2024-03-05T14:22:17.000Z","user": {"id": "100012345678901","name": "Jane Doe"},"text": "Looking for recommendations on the best resources...","topReactionsCount": 140,"feedbackId": "1122334455667788","reactionLikeCount": 120,"reactionLoveCount": 20,"id": "1234567890123456","legacyId": "987654321098765","attachments": [{"thumbnail": "https://scontent.xx.fbcdn.net/v/t1.0-9/thumbnail.jpg","__typename": "Photo","is_playable": false,"image": {"uri": "https://scontent.xx.fbcdn.net/v/t1.0-9/photo.jpg","height": 1080,"width": 1080},"id": "555555555555555","__isMedia": "Photo","photo_cix_screen": null,"copyright_banner_info": null,"owner": {"__typename": "User","id": "100012345678901"},"ocrText": "example detected text"},{"mediaset_token": "ms.987654321.123456789","url": "https://www.facebook.com/media/set/?set=ms.987654321.123456789&type=1","comet_product_tag_feed_overlay_renderer": null}],"likesCount": 124,"sharesCount": 7,"commentsCount": 32,"topComments": [{"commentUrl": "https://www.facebook.com/groups/example-group/permalink/987654321098765/?comment_id=777777777777777","id": "666666666666666","feedbackId": "444444444444444","date": "2024-03-05T15:10:00.000Z","text": "Totally agree!","profileUrl": "https://www.facebook.com/john.smith","profilePicture": "https://scontent.xx.fbcdn.net/v/t1.0-1/picture.jpg","profileId": "100011112222333","profileName": "John Smith","likesCount": "12","threadingDepth": 0},{"commentUrl": "https://www.facebook.com/groups/example-group/permalink/987654321098765/?comment_id=888888888888888","id": "222222222222222","feedbackId": "333333333333333","date": "2024-03-05T15:45:10.000Z","text": "Here’s a resource that helped me.","profileUrl": "https://www.facebook.com/maria.lee","profilePicture": "https://scontent.xx.fbcdn.net/v/t1.0-1/picture2.jpg","profileId": "100099998877665","profileName": "Maria Lee","likesCount": "5","threadingDepth": 0}],"facebookId": "123456789012345","groupTitle": "Example Group","pageAdLibrary": {"is_business_page_active": false,"id": "123456789012345"},"inputUrl": "https://www.facebook.com/groups/example-group"}
Fields that may be empty depend on what’s available on the page (e.g., url or groupTitle). topComments return up to two comments when present.
FAQ
Do I need to log in to scrape Facebook groups?
No. This actor works with public Facebook groups without login. However, search within groups is limited without authentication, so single-letter or two-letter searches are recommended for searchGroupKeyword.
Can this extract comments and reactions from posts?
Yes. Each post includes likesCount, commentsCount, sharesCount, topReactionsCount, reactionLikeCount, reactionLoveCount, and a topComments array with up to two highlighted comments and commenter details.
Can I export results to CSV?
Yes. After the run completes, open the Dataset and export to CSV, JSON, or Excel. This enables straightforward Facebook group data export for BI and analysis.
How do I limit or sort the posts I collect?
Use resultsLimit to control volume and viewOption to set CHRONOLOGICAL, RECENT_ACTIVITY, TOP_POSTS, or CHRONOLOGICAL_LISTINGS. For finer targeting, combine searchGroupKeyword, searchGroupYear, and onlyPostsNewerThan.
What does “Missing doc_id” mean and how do I fix it?
If Facebook changes its frontend and automatic discovery fails, you’ll see “Missing doc_id” in logs. Set a known working value in fallbackDocId to continue scraping.
Does it support proxies?
Yes. The actor always uses Apify Residential proxy with rotation and automatic retries to reliably collect data at scale.
What types of attachments are included?
attachments contain photo objects (thumbnail, image, dimensions, owner, optional OCR text) and media sets/albums (mediaset_token with a direct media set URL), depending on what the post contains.
Can I access the data programmatically?
Yes. Use the Apify API to access your dataset for downstream pipelines, making this a practical Facebook group data scraper API workflow.
Closing thoughts
Facebook Groups Scraper is built to turn public Facebook group discussions into structured, analytics-ready datasets. With automatic discovery, enforced residential proxies, flexible sorting/filters, and live dataset streaming, it’s ideal for marketers, developers, data analysts, and researchers. Use the Apify API to automate exports or integrate into your pipeline, and start extracting smarter community insights today.