Facebook Pages Posts Extractor (Rich Metadata)cookieless avatar
Facebook Pages Posts Extractor (Rich Metadata)cookieless
Under maintenance

Pricing

from $1.50 / 1,000 results

Go to Apify Store
Facebook Pages Posts Extractor (Rich Metadata)cookieless

Facebook Pages Posts Extractor (Rich Metadata)cookieless

Under maintenance

Extract high-fidelity Facebook page post metadata without cookies, capturing granular engagement metrics, hidden timestamps, and comprehensive interaction data. Structured, analysis-ready extraction for precise social media content trend analysis.

Pricing

from $1.50 / 1,000 results

Rating

0.0

(0)

Developer

Surge Street

Surge Street

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 days ago

Last modified

Share

"# Facebook Pages Posts Extractor (Rich Metadata)cookieless

Overview

This actor performs a deep extraction of Facebook Pages post data, capturing comprehensive metadata including engagement metrics, media attributes, targeting parameters, and call-to-action elements. The extractor operates in a cookieless environment, ensuring reliable data collection without authentication dependencies. Output data maintains high fidelity with structured schema validation, providing data scientists with consistent field types and complete nested object hierarchies for downstream analytical workflows.

Data Dictionary

Field NameData TypeDefinition
post_idStringUnique identifier for the Facebook post in platform-native format
external_idStringSecondary identifier used for cross-platform tracking and deduplication
scraped_atString (ISO 8601)UTC timestamp indicating when the data extraction occurred
post_typeStringClassification of post format (e.g., status_update, photo, video, link, event)
contentStringRaw text content of the post, including body copy and captions
urlStringCanonical URL pointing to the post on Facebook's platform
language_codeStringISO 639-1 language code with regional variant (e.g., en_US, es_MX)
is_verifiedBooleanIndicates whether the posting page has official verification status
sentiment_scoreFloatComputed sentiment polarity ranging from -1.0 (negative) to 1.0 (positive)
privacy_settingStringVisibility level of the post (public, friends, custom, private)
page_info.page_idStringUnique identifier for the Facebook Page that published the post
page_info.nameStringDisplay name of the Facebook Page
page_info.categoryStringBusiness or content category assigned to the Page
page_info.follower_countIntegerTotal number of followers/likes for the Page at time of scraping
engagement_metrics.reactionsIntegerAggregate count of all reaction types (Like, Love, Haha, Wow, Sad, Angry)
engagement_metrics.commentsIntegerTotal number of top-level comments on the post
engagement_metrics.sharesIntegerCount of times the post was shared to timelines or other destinations
engagement_metrics.reachIntegerEstimated number of unique users who viewed the post
media.has_mediaBooleanIndicates presence of attached media assets (images, videos, documents)
media.typeStringPrimary media type classification (image, video, album, document, none)
media.countIntegerNumber of media items attached to the post
media.dimensions.widthIntegerWidth in pixels of the primary media asset
media.dimensions.heightIntegerHeight in pixels of the primary media asset
targeting.locationArray[String]ISO country codes representing geographic targeting parameters
targeting.age_rangeStringAge demographic range targeted by the post (e.g., 18-24, 25-34, 18-65+)
targeting.demographicsStringAdditional demographic targeting criteria (all, custom, specific segments)
cta_data.has_ctaBooleanIndicates presence of a call-to-action button on the post
cta_data.button_typeStringType of CTA button (learn_more, shop_now, sign_up, contact_us, etc.)
cta_data.destination_urlStringTarget URL where the CTA button redirects users

Sample Dataset

Below is a sample of the high-fidelity JSON output:

{
""post_id"": ""page_post_789012345678901"",
""external_id"": ""fb_p_2025122001_xyz789"",
""scraped_at"": ""2025-12-20T08:15:32Z"",
""post_type"": ""status_update"",
""content"": ""Exciting developments in our Q4 roadmap!"",
""url"": ""https://facebook.com/pages/posts/789012345678901"",
""language_code"": ""en_US"",
""is_verified"": true,
""sentiment_score"": 0.78,
""privacy_setting"": ""public"",
""page_info"": {
""page_id"": ""156789012345"",
""name"": ""Tech Solutions Inc"",
""category"": ""Technology Company"",
""follower_count"": 45678
},
""engagement_metrics"": {
""reactions"": 1234,
""comments"": 89,
""shares"": 156,
""reach"": 12567
},
""media"": {
""has_media"": true,
""type"": ""image"",
""count"": 2,
""dimensions"": {
""width"": 1200,
""height"": 630
}
},
""targeting"": {
""location"": [""US"", ""UK"", ""CA""],
""age_range"": ""18-65+"",
""demographics"": ""all""
},
""cta_data"": {
""has_cta"": true,
""button_type"": ""learn_more"",
""destination_url"": ""https://techsolutions.com/roadmap""
}
}

Configuration Parameters

To ensure optimal data depth, configure the following:

ParameterJSON Field NameData TypeDescriptionExample Value
Page IDpageIdStringNumeric identifier of the Facebook Page from which to extract posts. Located in the Page's URL or About section.61553294532484

Analytical Use Cases

Engagement Pattern Analysis: Data scientists can construct time-series models using scraped_at, engagement_metrics, and post_type to identify optimal posting schedules and content formats that maximize reach and interaction rates.

Sentiment Correlation Studies: The sentiment_score field enables correlation analysis between emotional tone and engagement performance, supporting A/B testing hypotheses for content optimization strategies.

Competitive Intelligence: By extracting posts from multiple Pages within the same page_info.category, analysts can benchmark engagement metrics, content strategies, and CTA effectiveness across industry competitors.

Viral Content Identification: Threshold-based filtering on engagement_metrics.shares and engagement_metrics.reach ratios enables identification of viral content patterns for replication in future campaigns.

Media Performance Attribution: The media object structure supports multivariate analysis of how media type, count, and dimensions impact engagement outcomes across different audience segments.

Geographic Targeting Optimization: The targeting.location array facilitates cohort analysis to determine which geographic markets demonstrate highest engagement rates for specific content types.

Longitudinal Content Strategy Studies: Consistent schema structure across extraction runs enables panel data analysis tracking how Page strategy evolution (measured through post_type, cta_data, and content themes) correlates with follower growth and engagement trends over extended time periods.

Technical Limitations

Important Considerations:

  • Rate Limiting: Extraction throughput is subject to platform-imposed rate limits. Recommended batch size is 100 posts per execution to maintain stability.
  • Data Freshness: Engagement metrics (reactions, comments, shares, reach) represent point-in-time snapshots and do not update retroactively. Historical engagement growth requires multiple extraction runs.
  • Cookieless Constraints: Operating without authentication limits access to public posts only. Private or restricted content will not appear in extraction results.
  • Sentiment Score Methodology: The sentiment_score is computed using lexicon-based analysis and may not capture context-dependent sentiment nuances or sarcasm.
  • Media Dimension Availability: The media.dimensions object may return null values for certain media types (videos, documents) where dimensional metadata is not exposed.
  • Reach Metric Accuracy: The reach value is an estimated metric and may not reflect exact unique user counts due to platform calculation methodologies.
  • Historical Data Retention: Facebook Pages typically expose the most recent 100-500 posts via public interfaces. Comprehensive historical archives may require incremental extraction strategies.
  • Targeting Data Completeness: The targeting object reflects advertised targeting parameters when available; organic posts may return default values (""all"", null arrays).

Keywords & Tags: This web scraper functions as a specialized web scraping tool designed to extract data from web pages, specifically engineered to scrape website data from Facebook Pages. As a social media post scraper and blog post scraper, it serves content analysts and functions as a lead generation scraper for competitive intelligence workflows."