Bluesky Scraper
Pricing
Pay per usage
Bluesky Scraper
Scrape posts from Bluesky. Extract text, author, likes, reposts, replies, images, quoted posts. Search posts or scrape profiles. No login needed.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Glass Ventures
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape posts and profiles from Bluesky (bsky.app) using the public AT Protocol API. Extract text, author info, likes, reposts, replies, images, and quoted posts.
What does Bluesky Scraper do?
Bluesky Scraper extracts public post data from the Bluesky social network. It uses the public AT Protocol API (no authentication required) to scrape user profiles and search for posts by keyword.
Whether you need to monitor brand mentions, analyze social trends, or collect public discourse data, this actor provides structured output with engagement metrics, media attachments, and quoted post content. It handles pagination automatically to collect large datasets efficiently.
Bluesky is a decentralized social network built on the AT Protocol with a rapidly growing user base. This scraper provides an easy way to access public data without needing to set up API credentials.
Use Cases
- Market researchers -- Monitor brand mentions and sentiment on Bluesky
- Data analysts -- Analyze engagement patterns, posting frequency, and content trends
- Journalists -- Track public discourse and find sources on trending topics
- Developers -- Collect training data or build dashboards from Bluesky content
Features
- Scrape posts from any public Bluesky profile
- Search posts by keywords or phrases
- Extract full engagement metrics (likes, reposts, replies)
- Capture images and quoted posts
- Automatic pagination for large datasets
- No authentication or API keys required
- Proxy support with automatic rotation
- Handles pagination and large datasets automatically
- Exports to JSON, CSV, Excel, or connect via API
How much will it cost?
| Results | Estimated Cost |
|---|---|
| 100 | ~$0.01 |
| 1,000 | ~$0.05 |
| 10,000 | ~$0.50 |
| Cost Component | Per 1,000 Results |
|---|---|
| Platform compute | ~$0.05 |
| Proxy (datacenter) | ~$0.00 |
| Total | ~$0.05 |
Bluesky's public API is very fast and lightweight, making this one of the most cost-effective social media scrapers available.
How to use
- Go to the Bluesky Scraper page on Apify Store
- Click "Start" or "Try for free"
- Enter Bluesky profile URLs, handles, or search terms
- Set the maximum number of posts to scrape
- Click "Start" and wait for the results
Input parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
| startUrls | array | Bluesky profile URLs to scrape | - |
| handles | array | Bluesky handles (e.g. bsky.app) | - |
| searchTerms | array | Search queries to find posts | - |
| maxItems | number | Max posts to return | 100 |
| proxyConfig | object | Proxy settings | Apify Proxy |
Output
The actor produces a dataset with the following fields:
{"url": "https://bsky.app/profile/bsky.app/post/3abc123","text": "Welcome to Bluesky!","author": "Bluesky","handle": "bsky.app","likesCount": 1500,"repostsCount": 300,"repliesCount": 85,"createdAt": "2024-06-15T10:30:00.000Z","images": ["https://cdn.bsky.app/img/feed_fullsize/..."],"quotedPost": {"text": "Original post text","author": "Original Author","handle": "author.bsky.social","url": "https://bsky.app/profile/author.bsky.social/post/xyz789"},"authorAvatar": "https://cdn.bsky.app/img/avatar/...","postId": "3abc123","scrapedAt": "2026-04-23T12:00:00.000Z"}
| Field | Type | Description |
|---|---|---|
| url | string | Post URL on Bluesky |
| text | string | Post text content |
| author | string | Author display name |
| handle | string | Author Bluesky handle |
| likesCount | number | Number of likes |
| repostsCount | number | Number of reposts |
| repliesCount | number | Number of replies |
| createdAt | string | Post creation timestamp (ISO 8601) |
| images | array | Image URLs attached to the post |
| quotedPost | object | Quoted/embedded post data |
| authorAvatar | string | Author avatar image URL |
| postId | string | AT Protocol post record key |
| scrapedAt | string | ISO 8601 scrape timestamp |
Integrations
Connect Bluesky Scraper with other tools:
- Apify API -- REST API for programmatic access
- Webhooks -- get notified when a run finishes
- Zapier / Make -- connect to 5,000+ apps
- Google Sheets -- export directly to spreadsheets
API Example (Node.js)
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_TOKEN' });const run = await client.actor('YOUR_USERNAME/bluesky-scraper').call({handles: ['bsky.app'],maxItems: 100,});const { items } = await client.dataset(run.defaultDatasetId).listItems();
API Example (Python)
from apify_client import ApifyClientclient = ApifyClient('YOUR_TOKEN')run = client.actor('YOUR_USERNAME/bluesky-scraper').call(run_input={'handles': ['bsky.app'],'maxItems': 100,})items = client.dataset(run['defaultDatasetId']).list_items().items
API Example (cURL)
curl "https://api.apify.com/v2/acts/YOUR_USERNAME~bluesky-scraper/runs" \-X POST \-H "Content-Type: application/json" \-H "Authorization: Bearer YOUR_TOKEN" \-d '{"handles": ["bsky.app"], "maxItems": 100}'
Tips and tricks
- Start with a small
maxItems(10-20) to test before running large scrapes - Use handles directly instead of URLs for convenience
- Combine profile scraping with search terms to get comprehensive data
- The public API has generous rate limits but very large scrapes may benefit from proxy rotation
FAQ
Q: Does this actor require login credentials? A: No. Bluesky Scraper uses the public AT Protocol API which does not require authentication for public data.
Q: How fast is the scraping? A: Approximately 500-1000 posts per minute depending on pagination and network conditions.
Q: What should I do if I get blocked? A: Enable proxy rotation in the Proxy Configuration settings. Datacenter proxies are usually sufficient for Bluesky.
Q: Can I scrape private/protected accounts? A: No. This actor only accesses publicly available data through the AT Protocol public API.
Is it legal to scrape Bluesky?
Web scraping of publicly available data is generally legal based on precedents like the LinkedIn v. HiQ Labs case. Bluesky is built on the open AT Protocol which is designed for data interoperability. This actor only accesses publicly available data through the official public API. Always review and respect the target site's Terms of Service. For more information, see Apify's blog on web scraping legality.
Limitations
- Only public posts are accessible (no private/protected content)
- The public API may have rate limits for very high-volume requests
- Historical search results may be limited by Bluesky's search index depth
- Deleted posts cannot be retrieved
Changelog
- v0.1 (2026-04-23) -- Initial release