Twitter (X) User Tweets Extractor (Rich Metadata) cookieless avatar
Twitter (X) User Tweets Extractor (Rich Metadata) cookieless

Pricing

from $1.50 / 1,000 results

Go to Apify Store
Twitter (X) User Tweets Extractor (Rich Metadata) cookieless

Twitter (X) User Tweets Extractor (Rich Metadata) cookieless

Extract high-fidelity Twitter user tweet metadata without cookies, capturing granular engagement metrics, hidden fields, and comprehensive timestamps. Structured, analysis-ready data for precise social media sentiment and strategic marketing intelligence.

Pricing

from $1.50 / 1,000 results

Rating

0.0

(0)

Developer

Surge Street

Surge Street

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

16 hours ago

Last modified

Share

"# Twitter (X.com) User Tweets Extractor (Rich Metadata) cookieless

Overview

This actor performs a deep extraction of tweet-level data from specified Twitter user timelines, capturing comprehensive metadata including engagement metrics, author profiles, entity annotations, and geolocation information. The extraction process operates without authentication cookies, ensuring reliable data collection with high fidelity schema adherence. All timestamps are normalized to ISO 8601 format, and nested objects maintain referential integrity for downstream analytical workflows.

Data Dictionary

Field NameData TypeDefinition
tweet_idStringUnique identifier assigned by Twitter to the tweet object
external_idStringInternal tracking identifier prefixed with 'tw_' for cross-system reconciliation
created_atString (ISO 8601)UTC timestamp indicating when the tweet was originally published
scraped_atString (ISO 8601)UTC timestamp recording when the extraction process captured this record
textStringFull UTF-8 encoded tweet content including hashtags and mentions
language_codeStringISO 639-1 two-letter language code detected by Twitter's classification system
is_retweetBooleanFlag indicating whether this tweet is a retweet of another user's content
is_quoteBooleanFlag indicating whether this tweet quotes another tweet with added commentary
author.user_idStringUnique numeric identifier for the tweet author's account
author.usernameStringTwitter handle (without @ symbol) of the account that published the tweet
author.display_nameStringHuman-readable name displayed on the author's profile
author.is_verifiedBooleanIndicates whether the account has Twitter verification status (blue checkmark)
author.follower_countIntegerNumber of accounts following this user at time of extraction
author.following_countIntegerNumber of accounts this user follows at time of extraction
metrics.retweet_countIntegerCumulative number of times this tweet has been retweeted
metrics.like_countIntegerCumulative number of likes (favorites) received by this tweet
metrics.reply_countIntegerNumber of direct replies to this tweet
metrics.quote_countIntegerNumber of quote tweets referencing this original tweet
entities.hashtagsArray[String]List of hashtag strings extracted from tweet text (without # symbol)
entities.mentionsArray[String]List of @username mentions contained within the tweet
entities.urlsArray[String]List of expanded URLs shared in the tweet content
sentiment_scoreFloatNormalized sentiment polarity score ranging from -1.0 (negative) to 1.0 (positive)
device_sourceStringClient application or platform used to publish the tweet
geo.coordinates.latFloatLatitude coordinate if geolocation data is attached to the tweet
geo.coordinates.lngFloatLongitude coordinate if geolocation data is attached to the tweet
geo.place_idStringTwitter Place ID referencing a named location entity
conversation_idStringIdentifier linking this tweet to its parent conversation thread

Sample Dataset

Below is a sample of the high-fidelity JSON output:

{
""tweet_id"": ""1472891234567890123"",
""external_id"": ""tw_15f2e8b9c7d6a4"",
""created_at"": ""2025-12-21T15:30:22Z"",
""scraped_at"": ""2025-12-21T16:00:00Z"",
""text"": ""Just shared our latest findings on renewable energy adoption rates #sustainability"",
""language_code"": ""en"",
""is_retweet"": false,
""is_quote"": false,
""author"": {
""user_id"": ""283749162"",
""username"": ""green_tech_news"",
""display_name"": ""Green Tech Daily"",
""is_verified"": true,
""follower_count"": 52891,
""following_count"": 1234
},
""metrics"": {
""retweet_count"": 342,
""like_count"": 1205,
""reply_count"": 89,
""quote_count"": 27
},
""entities"": {
""hashtags"": [""sustainability""],
""mentions"": [],
""urls"": []
},
""sentiment_score"": 0.78,
""device_source"": ""Twitter Web App"",
""geo"": {
""coordinates"": {
""lat"": 40.7128,
""lng"": -74.0060
},
""place_id"": ""01a9a39c27f5cb71""
},
""conversation_id"": ""1472891234567890123""
}

Configuration Parameters

To ensure optimal data depth, configure the following:

ParameterJSON Field NameData TypeRequiredDescriptionExample Value
UsernameuserIdStringYesTwitter handle of the target account (without @ symbol)elonmusk

Analytical Use Cases

Sentiment Analysis: Leverage the sentiment_score field alongside text content to perform time-series sentiment tracking across brand mentions or topic clusters. Aggregate sentiment by date ranges to identify reputation trends.

Engagement Pattern Analysis: Utilize metrics object fields (retweet_count, like_count, reply_count, quote_count) to calculate engagement rates, identify viral content thresholds, and benchmark performance against follower base size.

Network Mapping: Extract entities.mentions arrays to construct directed graphs of user interactions, identifying key influencers and community clusters within specific discourse networks.

Temporal Content Strategy: Analyze created_at timestamps in conjunction with engagement metrics to determine optimal posting schedules and content lifecycle patterns for audience segments.

Geospatial Audience Profiling: When geo data is present, map tweet origins to understand geographic distribution of engaged audiences and regional sentiment variations.

Longitudinal Studies: Track changes in author.follower_count and author.is_verified status over multiple extraction runs to monitor account growth trajectories and verification events.

Technical Limitations

Important Considerations:

  • Extraction operates within Twitter's public data access boundaries; protected accounts and deleted tweets are not retrievable
  • The geo object will contain null values for tweets without location data enabled by the author
  • Sentiment scores are algorithmically generated and should be validated against domain-specific lexicons for specialized industries
  • Historical tweet availability may be limited to the most recent 3,200 tweets per user timeline due to platform API constraints
  • Rate limiting may affect extraction velocity; recommended batch size is 200 tweets per request cycle
  • The scraped_at timestamp reflects extraction time, not data freshness; compare with created_at to assess temporal lag
  • Engagement metrics (metrics object) represent point-in-time snapshots and will not auto-update as tweets continue to accumulate interactions
  • Cookieless operation ensures compliance but may result in reduced access to certain premium metadata fields available through authenticated endpoints

Keywords & Tags: This specification supports workflows involving twitter scraper, twitter user tweets, extract tweets from users, export tweets, tweet scraper API, twitter data extraction, and lead generation from tweets for social media intelligence applications."