Threads Post Scraper
Pricing
from $0.00 / result
Pricing
from $0.00 / result
Rating
0.0
(0)
Developer

sinam7
Actor stats
1
Bookmarked
10
Total users
6
Monthly active users
16 days ago
Last modified
Categories
Share
Scrape Threads posts and comments from a single post URL. Filters out profile images and recommended posts, returning only the original post and its related replies.
What this actor does
- Accepts a single Threads post URL
- Extracts post content, author info, timestamps, images, and comments
- Automatically filters out:
- Profile thumbnails (s150x150, s42x42)
- Unrelated recommended posts from Threads feed
- Returns only posts from the original author or replies to the original post
Input
Example input
{"url": "https://www.threads.net/@username/post/ABC123"}
Input fields
| Field | Type | Required | Notes |
|---|---|---|---|
| url | string | Yes | Full Threads post URL (e.g., https://www.threads.net/@username/post/ABC123) |
Output
The actor outputs an array of post objects to the dataset.
Example output item
[{"authorId": "/@hyle_edu.kr","authorName": null,"content": "hyle_edu.kr\nDesign Threads\nCursor 디자인 총괄이 말아주는(낋여주는) 스타트업 랜딩 페이지 리뷰 영상\nUI/UX 디자이너분들께 실전 레이아웃·카피 피드백이 꽤 도움 될 것 같습니다.\n5가지 핵심 교훈:\n1) 첫 화면에서 한 줄로 정체를 밝혀라.\n2) 누구를 위한 제품인지 콕 집어 말해라.\n3) 한 화면에 CTA는 딱 하나만 둬라.\n4) 보라 그라디언트·AI 템플릿 느낌을 버려라.\n5) 말로 돌리지 말고, 바로 데모를 보여줘라. \nTranslate","timestamp": "2025-11-27T08:04:22.000Z","images": ["https://scontent-iad3-2.cdninstagram.com/v/t51.2885-15/588000954_17852930637589530_3115163017750846664_n.jpg?stp=dst-jpg_e35_tt6&efg=eyJ2ZW5jb2RlX3RhZyI6InRocmVhZHMuRkVFRC5pbWFnZV91cmxnZW4uMjE2MHgxMjA2LnNkci5mODI3ODcuZGVmYXVsdF9pbWFnZS5jMiJ9&_nc_ht=scontent-iad3-2.cdninstagram.com&_nc_cat=106&_nc_oc=Q6cZ2QHgrxrOBEi_eqXMrmfiDdX3j_o7E4JnDU2yJd9VGNapB_a-senBjgZIZW_Il9lVgJI&_nc_ohc=WeCMxiN7Zc4Q7kNvwGggxko&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&edm=APs17CUBAAAA&ccb=7-5&ig_cache_key=Mzc3NDk2Mjg2NDU1NDcxNzQwMg%3D%3D.3-ccb7-5&oh=00_Afm3SeH-ljp6D7EhRYx0phszFpE84lTMXT6l5lDUhMFC8Q&oe=69360721&_nc_sid=10d13b"],"postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXLmGEzDa","isOriginalAuthor": true},{"authorId": "/@hyle_edu.kr","authorName": null,"content": "hyle_edu.kr\nDesign Threads\nAuthor\nyoutube.com/watch…\nyoutube.com\nCursor Head of Design Reviews Startup Websites","timestamp": "2025-11-27T08:04:27.000Z","images": ["https://external-iad3-1.xx.fbcdn.net/emg1/v/t13/4510717797100347293?stp=dst-src&url=https%3A%2F%2Fi.ytimg.com%2Fvi%2FRynySryqM_0%2Fmaxresdefault.jpg&utld=ytimg.com&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&_nc_oc=AdnnBnpQiJ-ImFHLB1zQ6KdkwE3eYgejFX751eNz-RdBMIC6qxpnkF8Fk2HROXnpQZk&ccb=13-1&oh=06_Q3-5AXm8t92Bc1cF_D9RD01869Pwh9uLvqptfXeRGYMGpgoL&oe=6931F0AD&_nc_sid=1d65fc"],"postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXMNbE5Ds","isOriginalAuthor": true}]
Output fields
| Field | Type | Description |
|---|---|---|
| authorId | string | Author's Threads ID (e.g., "/@username") |
| authorName | string|null | Author's display name (often null in current implementation) |
| content | string | Post text content, newline-separated |
| timestamp | string | ISO 8601 timestamp |
| images | string[] | Array of image URLs (excludes profile pictures) |
| postUrl | string | Full URL to the post |
| isOriginalAuthor | boolean | Whether this post is from the original thread author |
Features
Smart filtering
Profile image exclusion: Automatically removes profile thumbnails by detecting thumbnail size patterns (s150x150, s42x42) in URLs.
Feed recommendation filtering: Threads shows recommended posts mixed with replies. This actor filters them out by:
- Identifying the original post author from the URL
- Keeping only posts from that author or replies to their posts
- Excluding unrelated recommended content
Stable selectors
Uses DOM attributes that are less likely to change:
div[data-pressable-container="true"]for post containersa[href^="/@"]for author linkstimeelements for timestampsspan[dir="auto"]for text content
Avoids obfuscated React Native class names like x1lliihq that change frequently.
Usage
Via Apify Console
- Open the actor in Apify Console
- Click Run
- Enter a Threads post URL in the input
- Start the run
- View results in the Dataset tab
Via API
Start a run
curl -X POST "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_TOKEN&waitForFinish=120" \-H "Content-Type: application/json" \-d '{"url": "https://www.threads.net/@username/post/ABC123"}'
The waitForFinish=120 parameter makes the request wait up to 120 seconds for results (synchronous mode).
Fetch results from dataset
$curl "https://api.apify.com/v2/datasets/DATASET_ID/items?token=YOUR_TOKEN"
Integration with n8n
This actor is designed to work with n8n Cloud via a sub-workflow:
- HTTP Request node calls the actor
- HTTP Request node fetches results from the dataset
- Results are passed to AI for summarization
See the blog post for detailed n8n integration guide.
Limitations
- Public content only: No login/cookies support
- DOM structure dependency: Threads may change their HTML structure at any time
- Rate limiting: Respect Apify's rate limits and Threads' terms of service
- No pagination: Fetches only initially loaded comments (typically first 10-20)
Notes
-
The actor waits 3 seconds after page load to ensure dynamic content is rendered
-
authorNameis often null because Threads stores author names in complex nested structures -
Image URLs are CDN links that may expire after some time
-
This actor is optimized for single post scraping, not bulk operations
-
Update the size filters (s150x150, s42x42) in the code