Facebook Group Posts Scraper (Details)
Pricing
$19.99/month + usage
Facebook Group Posts Scraper (Details)
The Facebook Group Posts And Details Scraper extracts posts, comments, reactions, and group metadata from public Facebook groups. Export structured JSON for research, analytics, automation, and monitoring. Fast, reliable, and built for scalable Facebook data scraping workflows. 🚀
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapAPI
Actor stats
1
Bookmarked
5
Total users
2
Monthly active users
13 days
Issues response
6 days ago
Last modified
Categories
Share
Facebook Group Posts Scraper (Details)
The Facebook Group Posts Scraper (Details) is a fast, scalable Facebook group posts extractor that collects public group feed posts via GraphQL — including text, author fields, avatar image URL, timestamp, and group URL — and saves them as structured JSON for research, analytics, automation, and monitoring. It solves the pain of manually tracking group content by turning group feeds into clean, machine‑readable records you can monitor over time using Monitoring Mode. Built for marketers, developers, data analysts, and researchers, this Facebook group content crawler enables repeatable “download Facebook group posts data” workflows at scale. Note: it does not scrape comment threads or reactions; it collects exactly what the group feed query returns.
What data / output can you get?
The scraper writes one JSON record per post to the Apify dataset as soon as it’s parsed. Fields are derived directly from the feed response and normalized consistently.
| Data field | Description | Example value |
|---|---|---|
| id | Unique post identifier parsed from the feed | "26084479927869002" |
| image | First detected media/profile image URL in the post | "https://scontent.xx.fbcdn.net/…jpg" |
| postText | Best-effort post text extracted from content sections | "Looking for mattress recommendations in Australia…" |
| postAuthor | Display name of the post’s owning profile | "Jeffrey Sambell" |
| postAuthorId | Owning profile ID (stringified) | "pfbid02cw1sr66o7LhcYcByToHY…" |
| postAuthorUrl | Constructed author profile URL when resolvable | "https://www.facebook.com/pfbid02cw1s…" |
| Email field (always empty in this feed) | "" | |
| firstName | Parsed first name from author name/short_name | "Jeffrey" |
| lastName | Parsed last name from author name/short_name | "Sambell" |
| timestamp | Unix epoch seconds of the post (as string) | "1770944771" |
| groupUrl | Canonical URL of the source Facebook group | "https://www.facebook.com/groups/germtheory.vs.terraintheory" |
Notes:
- The actor outputs structured JSON. Apify datasets make it easy to integrate results into your pipelines or dashboards.
- Comments, replies, reactions, and engagement metrics are not included — this is a Facebook group posts scraper, not a Facebook group comments scraper or reactions scraper.
Key features
-
⚡ GraphQL-powered feed extraction
- Targets Facebook group feed posts and normalizes text, author, image URL, timestamp, and group URL into clean JSON.
-
🧭 Flexible sorting
- Choose how the feed is traversed with sortOrder: RECENT_POSTS, TOP_POSTS, RECENT_ACTIVITY, or CHRONOLOGICAL.
-
🕒 Date filtering
- Use startDate (YYYY-MM-DD) to scrape only recent posts and keep your Facebook group posts monitoring tool focused on fresh content.
-
🔁 Monitoring Mode for ongoing tracking
- When monitoringMode is enabled, the actor remembers seen post IDs in Apify Key‑Value Store and only outputs new posts on subsequent runs.
-
🧮 Multi-group quota splitting
- Provide multiple startUrls and the actor automatically splits your maxPosts quota across them, merges results, and enforces the global cap.
-
🚀 Streamed dataset writes
- Each post is pushed to the dataset as soon as it’s ready, ideal for near‑real‑time pipelines and Facebook group posts analytics tool scenarios.
-
🛡️ Residential proxy sessions
- Uses Apify residential proxy under the hood and rotates a managed session for more reliable public group access.
-
🧰 Developer-friendly
- Structured fields with stable naming make it easy to use as a Facebook group posts API extractor in your automation stack.
How to use Facebook Group Posts Scraper (Details) - step by step
- Sign in to Apify and open the “Facebook Group Posts Scraper (Details)” actor.
- Add one or more startUrls. You can enter:
- Full group URLs (e.g., https://www.facebook.com/groups/germtheory.vs.terraintheory)
- Group identifiers (e.g., germtheory.vs.terraintheory)
- Set maxPosts. This is a global cap (minimum 10). With multiple groups, the cap is split across them automatically.
- Choose sortOrder if needed: RECENT_POSTS, TOP_POSTS, RECENT_ACTIVITY, or CHRONOLOGICAL.
- Optionally set startDate (YYYY-MM-DD) to only scrape posts newer than that date.
- Enable monitoringMode to output only new (unseen) posts on repeat runs. The actor persists seen post IDs in the Apify Key‑Value Store.
- Proxy: You don’t need to configure anything — the actor uses residential proxy automatically. The proxyConfiguration input is available but the actor still uses residential proxy regardless of your choice.
- Click Run. Posts will be written to the dataset continuously as they’re parsed. When the run finishes, download the dataset in JSON.
Pro tip: Start with maxPosts between 20 and 50 to validate your setup. For full history again, set monitoringMode to false.
Use cases
| Use case name | Description |
|---|---|
| Market research on community topics | Track discussions in public groups to identify pain points, trends, and emerging themes using structured post text and timestamps. |
| Competitive intelligence for social content | Monitor competitor‑adjacent groups to see what content resonates based on recent activity ordering and recency filters. |
| Content ideation for social teams | Harvest post ideas and language used by your audience to guide content calendars and messaging. |
| Academic or policy research | Collect public discourse data from multiple groups with a fixed quota split across startUrls for reproducible sampling. |
| Automation pipelines (API extractor) | Use the dataset as a Facebook group posts API extractor in backend workflows that enrich, rank, or transform public posts. |
| Monitoring and alerts | Run with monitoringMode to only capture new posts and feed alerting/reporting dashboards without duplicates. |
| Archiving public group content | Build a lightweight Facebook group posts archiver by scheduling recurring runs and storing JSON snapshots over time. |
Why choose Facebook Group Posts Scraper (Details)?
- 🎯 Purpose-built for group feeds: Focuses precisely on public Facebook group feed posts with clean, stable field names.
- 🧹 Duplicate‑aware monitoring: Persisted seen IDs ensure monitoring runs only surface new content.
- 🧭 Sort and filter control: sortOrder and startDate give you control over feed traversal and time windows.
- 🧮 Scales across groups: Split quotas across multiple groups and enforce a single global maxPosts cap.
- ⚙️ Stream‑to‑dataset: Posts are pushed as they’re parsed for faster downstream processing.
- 🛡️ Managed residential proxy: Uses a residential session under the hood for better reliability on public content.
- 🧑💻 Developer‑ready JSON: Simple field names that are easy to consume in analytics and automation pipelines.
In short: a reliable Facebook group posts scraper that’s optimized for repeatable, production‑grade workflows — not a brittle browser extension.
Is it legal / ethical to use Facebook Group Posts Scraper (Details)?
Yes — when used responsibly. This actor targets publicly available Facebook group feed content and does not access private data, authenticated areas, comment threads, or reactions. Always:
- Scrape only public data you’re permitted to process.
- Respect applicable regulations (e.g., GDPR, CCPA) and your organization’s data policies.
- Review and comply with Facebook’s terms and any restrictions that apply to your use case. For edge cases, consult your legal team.
Input parameters & output format
Example JSON input
{"startUrls": ["germtheory.vs.terraintheory","https://www.facebook.com/groups/germtheory.vs.terraintheory"],"monitoringMode": false,"maxPosts": 25,"sortOrder": "RECENT_ACTIVITY","startDate": "2025-02-20","proxyConfiguration": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
Input parameters
- startUrls (array, required)
- Description: Enter one or more Facebook group URLs or group identifiers. Examples: “https://www.facebook.com/groups/germtheory.vs.terraintheory” or “germtheory.vs.terraintheory”. You can add multiple groups to scrape them all in one run.
- Default: none
- monitoringMode (boolean)
- Description: Track only fresh content. When enabled, the actor remembers which post IDs it has already seen and skips duplicates in future runs. Uses Apify Key‑Value Store under the hood to persist seen post IDs.
- Default: false
- maxPosts (integer, minimum 10)
- Description: Global cap across the whole run. With multiple startUrls, the cap is split across groups (split_quota) and results are merged, then capped at this value. With one group, that group gets the full quota.
- Default: 10
- sortOrder (string; one of RECENT_POSTS, TOP_POSTS, RECENT_ACTIVITY, CHRONOLOGICAL)
- Description: Sorting strategy for the group feed (sortingSetting in GraphQL variables).
- Default: RECENT_ACTIVITY
- startDate (string, nullable, pattern YYYY-MM-DD)
- Description: Only scrape posts newer than the selected date. Leave empty to scrape all available posts regardless of date.
- Default: null
- proxyConfiguration (object)
- Description: Proxy settings (optional). This actor always uses residential proxy regardless of your choice. Configure Apify Proxy credentials if needed.
- Default: prefill uses Apify Proxy with RESIDENTIAL group
Example JSON output
{"id": "26084479927869002","image": "https://scontent.xx.fbcdn.net/v/t39.30808-1/…jpg","postText": "I thought this might be a good page to ask on...","postAuthor": "Jeffrey Sambell","postAuthorId": "pfbid02cw1sr66o7LhcYcByToHY3gvKrnPrLCTckGUu8ULGof7xDR9THbzrEFZfZeVU7q6El","postAuthorUrl": "https://www.facebook.com/pfbid02cw1sr66o7LhcYcByToHY3gvKrnPrLCTckGUu8ULGof7xDR9THbzrEFZfZeVU7q6El","email": "","firstName": "Jeffrey","lastName": "Sambell","timestamp": "1770944771","groupUrl": "https://www.facebook.com/groups/germtheory.vs.terraintheory"}
Field notes:
- email is intentionally an empty string in this feed output.
- timestamp is a Unix epoch value represented as a string.
FAQ
Does it scrape comments or reactions?
No. The actor targets the group feed query only and does not extract comment threads or reactions. It outputs post-level fields such as text, author info, image URL, timestamp, and group URL.
Can I monitor only new posts over time?
Yes. Enable monitoringMode to track only fresh content. The actor remembers previously seen post IDs using the Apify Key‑Value Store and skips duplicates on subsequent runs.
What sort orders are supported?
You can choose from RECENT_POSTS, TOP_POSTS, RECENT_ACTIVITY, and CHRONOLOGICAL. The actor validates your selection and applies it to the group feed traversal.
How many posts can I collect per run?
Set maxPosts (minimum 10) as a global cap. If you supply multiple startUrls, the actor splits the quota across groups, merges results, and enforces the overall cap.
Can I filter by date?
Yes. Use startDate in the format YYYY-MM-DD to only include posts newer than that date. Leave it empty to scrape all available posts regardless of date.
What proxy does it use?
The actor uses a residential proxy session under the hood and manages a network session automatically. The proxyConfiguration input is available, but residential proxy is still used regardless of your choice.
What groups are supported?
Public Facebook groups. Provide full group URLs or group identifiers in startUrls. The actor normalizes and deduplicates group URLs internally.
What does each output record include?
Each dataset item includes: id, image, postText, postAuthor, postAuthorId, postAuthorUrl, email, firstName, lastName, timestamp, and groupUrl — one JSON object per post.
Closing CTA / Final thoughts
Facebook Group Posts Scraper (Details) is built to turn public Facebook group feeds into clean, structured JSON for analytics, automation, and continuous monitoring. With flexible sorting, date filtering, monitoringMode for deduped re-runs, and residential proxy sessions, it’s a dependable tool for marketers, developers, analysts, and researchers. Plug the dataset into your pipelines, schedule recurring runs, or use it as a Facebook group posts API extractor in your stack — and start extracting smarter from public group feeds today.
