Threads Post Scraper avatar
Threads Post Scraper

Pricing

from $0.00 / result

Go to Apify Store
Threads Post Scraper

Threads Post Scraper

Scrape Threads posts and comments, post author only.

Pricing

from $0.00 / result

Rating

0.0

(0)

Developer

sinam7

sinam7

Maintained by Community

Actor stats

1

Bookmarked

10

Total users

6

Monthly active users

16 days ago

Last modified

Share

Scrape Threads posts and comments from a single post URL. Filters out profile images and recommended posts, returning only the original post and its related replies.

What this actor does

  • Accepts a single Threads post URL
  • Extracts post content, author info, timestamps, images, and comments
  • Automatically filters out:
    • Profile thumbnails (s150x150, s42x42)
    • Unrelated recommended posts from Threads feed
  • Returns only posts from the original author or replies to the original post

Input

Example input

{
"url": "https://www.threads.net/@username/post/ABC123"
}

Input fields

FieldTypeRequiredNotes
urlstringYesFull Threads post URL (e.g., https://www.threads.net/@username/post/ABC123)

Output

The actor outputs an array of post objects to the dataset.

Example output item

[
{
"authorId": "/@hyle_edu.kr",
"authorName": null,
"content": "hyle_edu.kr\nDesign Threads\nCursor 디자인 총괄이 말아주는(낋여주는) 스타트업 랜딩 페이지 리뷰 영상\nUI/UX 디자이너분들께 실전 레이아웃·카피 피드백이 꽤 도움 될 것 같습니다.\n5가지 핵심 교훈:\n1) 첫 화면에서 한 줄로 정체를 밝혀라.\n2) 누구를 위한 제품인지 콕 집어 말해라.\n3) 한 화면에 CTA는 딱 하나만 둬라.\n4) 보라 그라디언트·AI 템플릿 느낌을 버려라.\n5) 말로 돌리지 말고, 바로 데모를 보여줘라.  \nTranslate",
"timestamp": "2025-11-27T08:04:22.000Z",
"images": [
"https://scontent-iad3-2.cdninstagram.com/v/t51.2885-15/588000954_17852930637589530_3115163017750846664_n.jpg?stp=dst-jpg_e35_tt6&efg=eyJ2ZW5jb2RlX3RhZyI6InRocmVhZHMuRkVFRC5pbWFnZV91cmxnZW4uMjE2MHgxMjA2LnNkci5mODI3ODcuZGVmYXVsdF9pbWFnZS5jMiJ9&_nc_ht=scontent-iad3-2.cdninstagram.com&_nc_cat=106&_nc_oc=Q6cZ2QHgrxrOBEi_eqXMrmfiDdX3j_o7E4JnDU2yJd9VGNapB_a-senBjgZIZW_Il9lVgJI&_nc_ohc=WeCMxiN7Zc4Q7kNvwGggxko&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&edm=APs17CUBAAAA&ccb=7-5&ig_cache_key=Mzc3NDk2Mjg2NDU1NDcxNzQwMg%3D%3D.3-ccb7-5&oh=00_Afm3SeH-ljp6D7EhRYx0phszFpE84lTMXT6l5lDUhMFC8Q&oe=69360721&_nc_sid=10d13b"
],
"postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXLmGEzDa",
"isOriginalAuthor": true
},
{
"authorId": "/@hyle_edu.kr",
"authorName": null,
"content": "hyle_edu.kr\nDesign Threads\nAuthor\nyoutube.com/watch…\nyoutube.com\nCursor Head of Design Reviews Startup Websites",
"timestamp": "2025-11-27T08:04:27.000Z",
"images": [
"https://external-iad3-1.xx.fbcdn.net/emg1/v/t13/4510717797100347293?stp=dst-src&url=https%3A%2F%2Fi.ytimg.com%2Fvi%2FRynySryqM_0%2Fmaxresdefault.jpg&utld=ytimg.com&_nc_gid=CZmmu7zoN4syiEHB2bNy1A&_nc_oc=AdnnBnpQiJ-ImFHLB1zQ6KdkwE3eYgejFX751eNz-RdBMIC6qxpnkF8Fk2HROXnpQZk&ccb=13-1&oh=06_Q3-5AXm8t92Bc1cF_D9RD01869Pwh9uLvqptfXeRGYMGpgoL&oe=6931F0AD&_nc_sid=1d65fc"
],
"postUrl": "https://www.threads.net/@hyle_edu.kr/post/DRjXMNbE5Ds",
"isOriginalAuthor": true
}
]

Output fields

FieldTypeDescription
authorIdstringAuthor's Threads ID (e.g., "/@username")
authorNamestring|nullAuthor's display name (often null in current implementation)
contentstringPost text content, newline-separated
timestampstringISO 8601 timestamp
imagesstring[]Array of image URLs (excludes profile pictures)
postUrlstringFull URL to the post
isOriginalAuthorbooleanWhether this post is from the original thread author

Features

Smart filtering

Profile image exclusion: Automatically removes profile thumbnails by detecting thumbnail size patterns (s150x150, s42x42) in URLs.

Feed recommendation filtering: Threads shows recommended posts mixed with replies. This actor filters them out by:

  • Identifying the original post author from the URL
  • Keeping only posts from that author or replies to their posts
  • Excluding unrelated recommended content

Stable selectors

Uses DOM attributes that are less likely to change:

  • div[data-pressable-container="true"] for post containers
  • a[href^="/@"] for author links
  • time elements for timestamps
  • span[dir="auto"] for text content

Avoids obfuscated React Native class names like x1lliihq that change frequently.

Usage

Via Apify Console

  1. Open the actor in Apify Console
  2. Click Run
  3. Enter a Threads post URL in the input
  4. Start the run
  5. View results in the Dataset tab

Via API

Start a run

curl -X POST "https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_TOKEN&waitForFinish=120" \
-H "Content-Type: application/json" \
-d '{
"url": "https://www.threads.net/@username/post/ABC123"
}'

The waitForFinish=120 parameter makes the request wait up to 120 seconds for results (synchronous mode).

Fetch results from dataset

$curl "https://api.apify.com/v2/datasets/DATASET_ID/items?token=YOUR_TOKEN"

Integration with n8n

This actor is designed to work with n8n Cloud via a sub-workflow:

  1. HTTP Request node calls the actor
  2. HTTP Request node fetches results from the dataset
  3. Results are passed to AI for summarization

See the blog post for detailed n8n integration guide.

Limitations

  • Public content only: No login/cookies support
  • DOM structure dependency: Threads may change their HTML structure at any time
  • Rate limiting: Respect Apify's rate limits and Threads' terms of service
  • No pagination: Fetches only initially loaded comments (typically first 10-20)

Notes

  • The actor waits 3 seconds after page load to ensure dynamic content is rendered

  • authorName is often null because Threads stores author names in complex nested structures

  • Image URLs are CDN links that may expire after some time

  • This actor is optimized for single post scraping, not bulk operations

  • Update the size filters (s150x150, s42x42) in the code