Facebook Pages Posts Extractor (Rich Metadata)cookieless
Pricing
from $1.50 / 1,000 results
Facebook Pages Posts Extractor (Rich Metadata)cookieless
Extract high-fidelity Facebook page post metadata without cookies, capturing granular engagement metrics, hidden timestamps, and comprehensive interaction data. Structured, analysis-ready extraction for precise social media content trend analysis.
Pricing
from $1.50 / 1,000 results
Rating
0.0
(0)
Developer

Surge Street
Actor stats
0
Bookmarked
1
Total users
0
Monthly active users
3 days ago
Last modified
Categories
Share
"# Facebook Pages Posts Extractor (Rich Metadata)cookieless
Overview
This actor performs a deep extraction of Facebook Pages post data, capturing comprehensive metadata including engagement metrics, media attributes, targeting parameters, and call-to-action elements. The extractor operates in a cookieless environment, ensuring reliable data collection without authentication dependencies. Output data maintains high fidelity with structured schema validation, providing data scientists with consistent field types and complete nested object hierarchies for downstream analytical workflows.
Data Dictionary
| Field Name | Data Type | Definition |
|---|---|---|
post_id | String | Unique identifier for the Facebook post in platform-native format |
external_id | String | Secondary identifier used for cross-platform tracking and deduplication |
scraped_at | String (ISO 8601) | UTC timestamp indicating when the data extraction occurred |
post_type | String | Classification of post format (e.g., status_update, photo, video, link, event) |
content | String | Raw text content of the post, including body copy and captions |
url | String | Canonical URL pointing to the post on Facebook's platform |
language_code | String | ISO 639-1 language code with regional variant (e.g., en_US, es_MX) |
is_verified | Boolean | Indicates whether the posting page has official verification status |
sentiment_score | Float | Computed sentiment polarity ranging from -1.0 (negative) to 1.0 (positive) |
privacy_setting | String | Visibility level of the post (public, friends, custom, private) |
page_info.page_id | String | Unique identifier for the Facebook Page that published the post |
page_info.name | String | Display name of the Facebook Page |
page_info.category | String | Business or content category assigned to the Page |
page_info.follower_count | Integer | Total number of followers/likes for the Page at time of scraping |
engagement_metrics.reactions | Integer | Aggregate count of all reaction types (Like, Love, Haha, Wow, Sad, Angry) |
engagement_metrics.comments | Integer | Total number of top-level comments on the post |
engagement_metrics.shares | Integer | Count of times the post was shared to timelines or other destinations |
engagement_metrics.reach | Integer | Estimated number of unique users who viewed the post |
media.has_media | Boolean | Indicates presence of attached media assets (images, videos, documents) |
media.type | String | Primary media type classification (image, video, album, document, none) |
media.count | Integer | Number of media items attached to the post |
media.dimensions.width | Integer | Width in pixels of the primary media asset |
media.dimensions.height | Integer | Height in pixels of the primary media asset |
targeting.location | Array[String] | ISO country codes representing geographic targeting parameters |
targeting.age_range | String | Age demographic range targeted by the post (e.g., 18-24, 25-34, 18-65+) |
targeting.demographics | String | Additional demographic targeting criteria (all, custom, specific segments) |
cta_data.has_cta | Boolean | Indicates presence of a call-to-action button on the post |
cta_data.button_type | String | Type of CTA button (learn_more, shop_now, sign_up, contact_us, etc.) |
cta_data.destination_url | String | Target URL where the CTA button redirects users |
Sample Dataset
Below is a sample of the high-fidelity JSON output:
{""post_id"": ""page_post_789012345678901"",""external_id"": ""fb_p_2025122001_xyz789"",""scraped_at"": ""2025-12-20T08:15:32Z"",""post_type"": ""status_update"",""content"": ""Exciting developments in our Q4 roadmap!"",""url"": ""https://facebook.com/pages/posts/789012345678901"",""language_code"": ""en_US"",""is_verified"": true,""sentiment_score"": 0.78,""privacy_setting"": ""public"",""page_info"": {""page_id"": ""156789012345"",""name"": ""Tech Solutions Inc"",""category"": ""Technology Company"",""follower_count"": 45678},""engagement_metrics"": {""reactions"": 1234,""comments"": 89,""shares"": 156,""reach"": 12567},""media"": {""has_media"": true,""type"": ""image"",""count"": 2,""dimensions"": {""width"": 1200,""height"": 630}},""targeting"": {""location"": [""US"", ""UK"", ""CA""],""age_range"": ""18-65+"",""demographics"": ""all""},""cta_data"": {""has_cta"": true,""button_type"": ""learn_more"",""destination_url"": ""https://techsolutions.com/roadmap""}}
Configuration Parameters
To ensure optimal data depth, configure the following:
| Parameter | JSON Field Name | Data Type | Description | Example Value |
|---|---|---|---|---|
| Page ID | pageId | String | Numeric identifier of the Facebook Page from which to extract posts. Located in the Page's URL or About section. | 61553294532484 |
Analytical Use Cases
Engagement Pattern Analysis: Data scientists can construct time-series models using scraped_at, engagement_metrics, and post_type to identify optimal posting schedules and content formats that maximize reach and interaction rates.
Sentiment Correlation Studies: The sentiment_score field enables correlation analysis between emotional tone and engagement performance, supporting A/B testing hypotheses for content optimization strategies.
Competitive Intelligence: By extracting posts from multiple Pages within the same page_info.category, analysts can benchmark engagement metrics, content strategies, and CTA effectiveness across industry competitors.
Viral Content Identification: Threshold-based filtering on engagement_metrics.shares and engagement_metrics.reach ratios enables identification of viral content patterns for replication in future campaigns.
Media Performance Attribution: The media object structure supports multivariate analysis of how media type, count, and dimensions impact engagement outcomes across different audience segments.
Geographic Targeting Optimization: The targeting.location array facilitates cohort analysis to determine which geographic markets demonstrate highest engagement rates for specific content types.
Longitudinal Content Strategy Studies: Consistent schema structure across extraction runs enables panel data analysis tracking how Page strategy evolution (measured through post_type, cta_data, and content themes) correlates with follower growth and engagement trends over extended time periods.
Technical Limitations
Important Considerations:
- Rate Limiting: Extraction throughput is subject to platform-imposed rate limits. Recommended batch size is 100 posts per execution to maintain stability.
- Data Freshness: Engagement metrics (
reactions,comments,shares,reach) represent point-in-time snapshots and do not update retroactively. Historical engagement growth requires multiple extraction runs. - Cookieless Constraints: Operating without authentication limits access to public posts only. Private or restricted content will not appear in extraction results.
- Sentiment Score Methodology: The
sentiment_scoreis computed using lexicon-based analysis and may not capture context-dependent sentiment nuances or sarcasm. - Media Dimension Availability: The
media.dimensionsobject may return null values for certain media types (videos, documents) where dimensional metadata is not exposed. - Reach Metric Accuracy: The
reachvalue is an estimated metric and may not reflect exact unique user counts due to platform calculation methodologies. - Historical Data Retention: Facebook Pages typically expose the most recent 100-500 posts via public interfaces. Comprehensive historical archives may require incremental extraction strategies.
- Targeting Data Completeness: The
targetingobject reflects advertised targeting parameters when available; organic posts may return default values (""all"", null arrays).
Keywords & Tags: This web scraper functions as a specialized web scraping tool designed to extract data from web pages, specifically engineered to scrape website data from Facebook Pages. As a social media post scraper and blog post scraper, it serves content analysts and functions as a lead generation scraper for competitive intelligence workflows."