
Sora Scraper
Pricing
Pay per event

Sora Scraper
Discover AI-generated video insights from OpenAI’s Sora 2 community—extract posts, media, user profiles, comments, and engagement metrics. Perfect for trend analysis, content curation, influencer tracking, and research. Fast, reliable, and fully customizable.
5.0 (4)
Pricing
Pay per event
7
16
16
Last modified
2 days ago
Sora Scraper
The Sora Scraper is an Apify actor for OpenAI's Sora 2 — the next generation of AI video creation platform. Extract posts, videos, engagement metrics, user profiles, and comments from Sora's community with unprecedented ease.
✨ Key Features
- 🚀 First-to-Market: The only scraper built specifically for Sora 2.
- 🎬 Media Downloads: Download videos, thumbnails, and GIFs directly from posts.
- 💬 Comment Extraction: Capture detailed comment threads with engagement data.
- 👤 Rich Profile Data: Extract complete user profiles including followers, verified status, and more.
- 📊 Engagement Metrics: Views, unique views, likes, remixes, replies, and recursive replies.
- 🔍 Flexible Search: Query any topic and discover Sora's creative community.
- ⚙️ Granular Control: Configure comment limits, media downloads, and result counts.
- 📦 Structured Output: Normalized JSON data ready for analysis and integration.
💡 Why It's Important
Sora represents OpenAI's breakthrough in AI-generated video content, and Sora 2 is the latest evolution. With this scraper, you can:
- Monitor trending content in the AI video generation space.
- Analyze user engagement patterns and viral content characteristics.
- Archive creative works for research, inspiration, or competitive analysis.
- Track community growth and user interactions in real-time.
- Build datasets for AI content analysis, sentiment studies, and trend forecasting.
👤 Who Is It For?
- AI Researchers studying generative video trends and user behavior.
- Content Creators seeking inspiration and understanding what resonates.
- Marketing Agencies monitoring brand mentions and creative trends.
- Data Scientists building datasets for machine learning and analytics.
- Media Companies tracking viral content and emerging creators.
- Developers building applications around AI-generated content.
🚀 Business Use Cases
- Trend Analysis: Identify viral prompts, themes, and creative patterns.
- Content Curation: Aggregate and showcase top Sora creations.
- Competitive Intelligence: Track how competitors use AI video generation.
- Influencer Discovery: Find trending creators and their engagement rates.
- Brand Monitoring: Track mentions and sentiment around your brand.
- Research & Development: Build datasets for AI content analysis.
- Market Research: Understand user preferences in AI-generated content.
🛠 Input Schema
The actor accepts the following input:
{"query": "SpongeBob","numOfComments": 10,"downloadVideo": false,"downloadThumbnail": false,"downloadGIF": false,"maxItems": 10,"proxyConfiguration": {"useApifyProxy": false}}
Input Parameters
Parameter | Type | Required | Description |
---|---|---|---|
query | string | Yes | Search query to find Sora posts (e.g., "SpongeBob", "sunset timelapse") |
numOfComments | integer | No | Maximum number of comments to extract per post (default: 10) |
downloadVideo | boolean | No | Download video files to key-value store. Key stored in videoStoreKey (default: false) |
downloadThumbnail | boolean | No | Download thumbnail images to key-value store. Key stored in thumbnailStoreKey (default: false) |
downloadGIF | boolean | No | Download GIF previews to key-value store. Key stored in gifStoreKey (default: false) |
maxItems | integer | No | Maximum number of posts to scrape (default: 10) |
proxyConfiguration | object | No | Apify proxy configuration for requests |
Notes:
- Required field:
query
is mandatory. - Media Downloads: Enabling video, thumbnail, or GIF downloads will increase run time but provide direct access to media files in the key-value store.
- Comments: Set
numOfComments
to control how many comments are extracted per post. Comments include full profile data and engagement metrics. - Performance: Higher
maxItems
and media downloads will consume more resources and time.
📦 Output Schema
Each dataset item contains comprehensive post data:
{"id": "s_68dca5d7d4ac8191987e9c6393d498d4","text": "spongebob as a ww2 leader speaking about the scourge of fish ruining bikini bottom wearing axis power uniform","caption": null,"link": "https://sora.chatgpt.com/p/s_68dca5d7d4ac8191987e9c6393d498d4","coverUrl": "https://videos.openai.com/vg-assets/...","gifUrl": "https://videos.openai.com/vg-assets/...","postedAt": 1759290839.830908,"updatedAt": 1759936985.530838,"likes": 1289,"replies": 43,"views": 39477,"uniqueViews": 24713,"remixes": 76,"recursiveReplies": 70,"dislikeCount": 0,"workspaceId": null,"postedToPublic": true,"emoji": "🧽","attachments": [{"id": "s_68dca5d7d4ac8191987e9c6393d498d4-attachment-0","title": "New Video","url": "https://sdmntprsouthcentralus.oaiusercontent.com/files/...","downloadableUrl": "https://sdmntprsouthcentralus.oaiusercontent.com/files/...","thumbnail": "https://videos.openai.com/vg-assets/...","gif": "https://videos.openai.com/vg-assets/...","width": 352,"height": 640,"generationId": "gen_01k6eyadhqezmskzd31pp2n2xm","generationType": "video_gen"}],"profile": {"id": "user-vmw00GfT7mSYdcIST7bLbwCF","username": "jakeleventhal","displayName": "Jake Leventhal","profilePictureUrl": "https://sdmntprnorthcentralus.oaiusercontent.com/files/...","coverPhotoUrl": null,"link": "https://sora.chatgpt.com/profile/jakeleventhal","verified": false,"followerCount": 2664,"followingCount": 7,"postCount": 61,"replyCount": 0,"likesReceivedCount": 22854,"remixCount": 1606,"cameoCount": 33,"isBlocked": false,"followedBy": [],"planType": null,"createdAt": 1753852741.285583,"updatedAt": 1759951105.520806,"bannedAt": null,"calpicoIsEnabled": true,"soraWhoCanMessageMe": "followees_only","isPublicFigure": false,"location": null,"description": null,"birthday": null,"website": null},"videoStoreKey": "s_68dca5d7d4ac8191987e9c6393d498d4_video_0.mp4","thumbnailStoreKey": "s_68dca5d7d4ac8191987e9c6393d498d4_thumbnail_0.webp","gifStoreKey": "s_68dca5d7d4ac8191987e9c6393d498d4_gif_0.gif","comments": [{"id": "68dcb87374948191bc6c9f88b5ea723e","text": "Ts gonna be the reason Viacom gonna shut this down😭😭","caption": null,"postedAt": 1759295603.455459,"updatedAt": 1759530609.903473,"likes": 16,"parentPostId": "s_68dca5d7d4ac8191987e9c6393d498d4","rootPostId": "s_68dca5d7d4ac8191987e9c6393d498d4","postUrl": "https://sora.chatgpt.com/p/s_68dca5d7d4ac8191987e9c6393d498d4","profile": {"id": "user-PDq6JrFlZ0qjFVKrdeAmiTnh","username": "skipppz","displayName": "C","profilePictureUrl": "https://cdn.openai.com/sora/images/profile_placeholder_v4.png","verified": false,"followerCount": 1,"followingCount": 2,"postCount": 9,"replyCount": 9,"likesReceivedCount": 98,"remixCount": 2,"cameoCount": 0}}]}
Output Fields Explained
Post Data
id
: Unique post identifiertext
: The prompt/description used to generate the videocaption
: Optional caption textlink
: Direct link to the post on SoracoverUrl
: URL to the cover imagegifUrl
: URL to the animated GIF previewemoji
: Associated emoji for the post
Engagement Metrics
likes
: Number of likesreplies
: Direct reply countviews
: Total view countuniqueViews
: Unique viewer countremixes
: Number of times the video was remixedrecursiveReplies
: Total replies including nested threadsdislikeCount
: Number of dislikes
Timestamps
postedAt
: Unix timestamp when post was createdupdatedAt
: Unix timestamp of last update
Attachments
id
: Attachment identifiertitle
: Attachment titleurl
: Direct video URLdownloadableUrl
: URL for downloadingthumbnail
: Thumbnail image URLgif
: GIF preview URLwidth
/height
: Video dimensionsgenerationId
: Sora generation IDgenerationType
: Type of generation (e.g., "video_gen")
Profile Data
Complete user profile including:
- Username, display name, profile picture
- Verification status
- Follower/following counts
- Post and reply counts
- Likes received, remix count, cameo count
- Account creation and update timestamps
- Privacy settings and location
Downloaded Media Keys
videoStoreKey
: Key-value store key for downloaded video (provided only whendownloadVideo
is enabled)thumbnailStoreKey
: Key-value store key for downloaded thumbnail (provided only whendownloadThumbnail
is enabled)gifStoreKey
: Key-value store key for downloaded GIF (provided only whendownloadGIF
is enabled)
Comments
Array of comment objects with:
- Comment text and timestamps
- Like counts
- Parent and root post IDs
- Full profile data for commenter
- Post URL for context
🎯 Advanced Features
Media Download System
When you enable media downloads (downloadVideo
, downloadThumbnail
, or downloadGIF
), files are automatically saved to Apify's key-value store with predictable keys:
- Videos:
{postId}_video_{index}.mp4
- Thumbnails:
{postId}_thumbnail_{index}.webp
- GIFs:
{postId}_gif_{index}.gif
Access downloaded files programmatically or through the Apify console's key-value store tab.
Comment Threading
Comments maintain parent-child relationships through parentPostId
and rootPostId
fields, allowing you to reconstruct conversation threads. Each comment includes:
- Full commenter profile
- Engagement metrics (likes)
- Timestamps for tracking conversation flow
Engagement Analytics
Track multiple engagement dimensions:
- Virality:
views
anduniqueViews
show reach - Interaction:
likes
,replies
, andrecursiveReplies
measure engagement depth - Creativity:
remixes
show how content inspires others - Trend tracking: Compare metrics across posts to identify patterns
🔧 Best Practices
- Start Small: Test with
maxItems: 10
to understand output structure before scaling. - Media Downloads: Only enable media downloads when necessary — they significantly increase run time.
- Comment Limits: Adjust
numOfComments
based on your needs. High-engagement posts can have hundreds of comments. - Proxy Configuration: Use Apify proxies for reliable access and to respect rate limits.
🌟 Why Choose Our Sora Scraper?
✅ First to Market — The only Sora 2 scraper available
✅ Comprehensive Data — Posts, profiles, comments, engagement metrics
✅ Media Support — Download videos, thumbnails, and GIFs
✅ Production Ready — Structured output, error handling, proxy support
✅ Well Maintained — Regular updates as Sora evolves
✅ Expert Support — Backed by certified Apify Partners
👀 p.s.
Got feedback or need an extension?
Lexis Solutions is a certified Apify Partner. We can help you with custom solutions or data extraction projects.
Contact us over Email or LinkedIn
Support Our Work 💝
If you're happy with our work and scrapers, you're welcome to leave us a company review here and leave a review for the scrapers you're subscribed to. It will take you less than a minute but it will mean a lot to us!
Image Credit: https://sora.chatgpt.com/
On this page
Share Actor: