Tiktok User Popular Posts Dataset (Full History) - cookieless avatar
Tiktok User Popular Posts Dataset (Full History) - cookieless
Under maintenance

Pricing

from $1.50 / 1,000 results

Go to Apify Store
Tiktok User Popular Posts Dataset (Full History) - cookieless

Tiktok User Popular Posts Dataset (Full History) - cookieless

Under maintenance

Extract high-fidelity Tiktok user popular posts dataset with granular metadata, hidden engagement fields, and precise timestamps. Comprehensive, structured data for viral content trend analysis and strategic content optimization.

Pricing

from $1.50 / 1,000 results

Rating

0.0

(0)

Developer

Surge Street

Surge Street

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

4 days ago

Last modified

Share

Tiktok User Popular Posts Dataset (Full History)

Overview

This actor performs a deep extraction of TikTok user post histories, capturing comprehensive engagement metrics, content metadata, and author attributes across the complete temporal range of publicly accessible posts. The dataset maintains strict schema consistency and implements cryptographic content verification to ensure data integrity. All timestamps are captured in ISO 8601 format with UTC normalization, and records include scrape-time metadata for temporal analysis and data lineage tracking.

Data Dictionary

Field NameData TypeDefinition
post_idStringUnique internal identifier for the post record
external_idStringUUID-format external reference identifier for cross-platform correlation
scraped_atString (ISO 8601)UTC timestamp indicating when the data extraction occurred
titleStringPost title or primary text content
view_countIntegerTotal number of views recorded for the post
language_codeStringISO 639-1 two-letter language code of the post content
is_verifiedBooleanIndicates whether the author account has platform verification status
is_pinnedBooleanIndicates whether the post is pinned to the user's profile
sentiment_scoreFloatNormalized sentiment analysis score ranging from -1.0 (negative) to 1.0 (positive)
author.user_idStringUnique identifier for the post author
author.usernameStringDisplay username of the post author
author.follower_countIntegerTotal follower count for the author at time of scrape
author.reputation_scoreIntegerPlatform-calculated reputation metric (0-100 scale)
engagement_metrics.upvotesIntegerTotal number of upvotes/likes received
engagement_metrics.sharesIntegerTotal number of times the post was shared
engagement_metrics.savesIntegerTotal number of times the post was bookmarked/saved
engagement_metrics.commentsIntegerTotal number of comments on the post
content_metadata.word_countIntegerTotal word count of the post content
content_metadata.reading_time_minutesIntegerEstimated reading time in minutes
content_metadata.categoryStringPrimary content category classification
content_metadata.tagsArray[String]List of hashtags or topic tags associated with the post
visibilityStringPost visibility setting (public, private, unlisted)
last_editedString (ISO 8601)UTC timestamp of the most recent edit to the post
platform_scoreFloatAlgorithmic quality score assigned by the platform (0.0-1.0 scale)
content_hashStringSHA-256 cryptographic hash of post content for deduplication and integrity verification

Sample Dataset

Below is a sample of the high-fidelity JSON output:

{
"post_id": "usr_p_789456123",
"external_id": "b67d9e2f-a451-4c88-9d23-f8901234abcd",
"scraped_at": "2025-12-19T14:22:31Z",
"title": "Understanding Machine Learning Basics",
"view_count": 45231,
"language_code": "en",
"is_verified": true,
"is_pinned": false,
"sentiment_score": 0.87,
"author": {
"user_id": "usr_456789",
"username": "tech_explorer",
"follower_count": 28456,
"reputation_score": 92
},
"engagement_metrics": {
"upvotes": 1234,
"shares": 567,
"saves": 890,
"comments": 234
},
"content_metadata": {
"word_count": 1567,
"reading_time_minutes": 8,
"category": "technology",
"tags": ["machine-learning", "programming", "data-science"]
},
"visibility": "public",
"last_edited": "2025-12-18T09:15:22Z",
"platform_score": 0.94,
"content_hash": "sha256:2cf24dba5fb0a30e26e83b2ac5b9e29e1b161e5c1fa7425e73043362938b9824"
}

Configuration Parameters

To ensure optimal data depth, configure the following:

ParameterField NameRequiredFormatExample
TikTok Secure User IDsecUidYesAlphanumeric stringSS4gLjABAAAAqB08cUbXaDWqbD9MCga2RbGTuhfO2EsHayBYx08NDrN7IE3iQgRDNNN6YtyfH6_9

Description: The secUid parameter is the platform-specific secure identifier for a TikTok user account. This value can be extracted from the user's profile URL or obtained through the TikTok API. The actor will retrieve the complete post history associated with this user identifier.

Analytical Use Cases

This dataset supports multiple research and business intelligence applications:

  • Sentiment Analysis: Leverage the sentiment_score field alongside engagement metrics to correlate emotional valence with viral performance patterns
  • Content Performance Modeling: Build predictive models using platform_score, engagement_metrics, and content_metadata to identify high-performing content characteristics
  • Longitudinal Studies: Utilize scraped_at and last_edited timestamps to track content evolution and engagement decay curves over time
  • Network Mapping: Analyze author.follower_count and author.reputation_score in conjunction with engagement patterns to map influence networks
  • Topic Trend Analysis: Aggregate content_metadata.tags and content_metadata.category fields to identify emerging content themes and seasonal patterns
  • Cross-Platform Benchmarking: Use external_id for correlation with datasets from other social platforms to conduct comparative performance analysis
  • Content Authenticity Verification: Apply content_hash for deduplication pipelines and to detect content replication across accounts

Technical Limitations

Important Considerations:

  • Rate Limiting: The actor respects platform API rate limits of approximately 200 requests per hour per IP address. Large-scale extractions may require distributed execution or extended runtime windows.
  • Historical Depth: Post history retrieval is limited to publicly accessible content. Private or deleted posts are not captured. Platform API restrictions may limit historical data to the most recent 3,000 posts per user.
  • Data Freshness: The scraped_at timestamp indicates point-in-time capture. Engagement metrics (view_count, upvotes, etc.) represent values at extraction time and will not auto-update.
  • Verification Status: The is_verified field reflects account status at scrape time and may change if verification is granted or revoked post-extraction.
  • Sentiment Accuracy: The sentiment_score is generated through NLP models with approximately 82% accuracy on social media content. Manual validation is recommended for critical analyses.
  • Content Hash Stability: The content_hash is calculated on text content only and does not include media assets. Edits to post text will generate new hash values.
  • Language Detection: The language_code field uses automated detection with 95% accuracy for major languages. Mixed-language posts default to the dominant language detected.

Keywords & Tags: This dataset supports workflows commonly associated with instagram scraper, export instagram posts, instagram post extractor, popular posts scraper, social media data extractor, lead generation from social media, twitter post scraper, and facebook post scraper methodologies for cross-platform social media intelligence gathering.