Facebook Group Scraper (Posts and Details ) avatar

Facebook Group Scraper (Posts and Details )

Pricing

from $3.99 / 1,000 results

Go to Apify Store
Facebook Group Scraper (Posts and Details )

Facebook Group Scraper (Posts and Details )

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

ScrapeEngine

ScrapeEngine

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Facebook Group Posts & Details Scraper

Apify Actor that scrapes public (and optionally logged-in) Facebook group feeds via Facebook’s internal GraphQL endpoints. It outputs post text, post id, author fields, avatar URL, timestamp, and group URL — aligned with the ./.actor/actor.json.

What it actually extracts

FieldNotes
Post text, post idFrom the group feed query
Author name, id, profile URL, avatarWhen present in the feed payload
TimestampRegex-extracted from feed JSON nodes (extract_epoch_from_node)
Group URLAdded per row

Not included in this Actor: comment threads, reaction counts, media URL lists, or member lists. Those would require separate GraphQL operations and are not implemented in the current code. Marketing copy that claims otherwise should be ignored in favor of this README and the actor schema.

Reliability

Facebook may return empty feed pages, missing doc_id in JS bundles, or GraphQL errors when requests look automated. The actor uses src/helper.py + src/group.py (same pipeline as standalone helper.py + group.py) and src/main.py (Apify input, residential proxy, KV monitoring, push_data). Residential Apify proxy is recommended. cannot guarantee full maxPosts on every run.

  • maxPostsGlobal maximum across all groups (minimum 10). With multiple groups, quotas are split like standalone group.py (split_quota); merged output is capped at maxPosts.
  • monitoringMode: When enabled, only post IDs not already stored per group in the key-value store are pushed. For a full re-scrape, turn monitoring off.
  • Progress logs refer to collected posts vs maxPosts, not a statistical “success rate.”

Input (summary)

  • startUrls — Group URLs or group identifiers (required)
  • maxPosts — Minimum 10; total cap across all groups when multiple URLs are listed
  • sortOrder, startDate, proxyConfiguration
  • monitoringMode — Incremental runs (KV-backed seen IDs)

Example input

{
"startUrls": [{ "url": "https://www.facebook.com/groups/your-group" }],
"sortOrder": "RECENT_ACTIVITY",
"maxPosts": 50,
"monitoringMode": false
}

Example output row

{
"id": "25346255468358122",
"image": "https://scontent...",
"postText": "Post content...",
"postAuthor": "Jane Doe",
"postAuthorId": "100048765312428",
"postAuthorUrl": "https://www.facebook.com/...",
"email": "",
"firstName": "Jane",
"lastName": "Doe",
"timestamp": "1711800000",
"groupUrl": "https://www.facebook.com/groups/example"
}

Only scrape content you have the right to access. Respect Facebook’s Terms of Service and applicable laws. Use responsible rate limits and data handling.