Bluesky Scraper avatar

Bluesky Scraper

Pricing

Pay per usage

Go to Apify Store
Bluesky Scraper

Bluesky Scraper

Scrape posts from Bluesky. Extract text, author, likes, reposts, replies, images, quoted posts. Search posts or scrape profiles. No login needed.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Glass Ventures

Glass Ventures

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Scrape posts and profiles from Bluesky (bsky.app) using the public AT Protocol API. Extract text, author info, likes, reposts, replies, images, and quoted posts.

What does Bluesky Scraper do?

Bluesky Scraper extracts public post data from the Bluesky social network. It uses the public AT Protocol API (no authentication required) to scrape user profiles and search for posts by keyword.

Whether you need to monitor brand mentions, analyze social trends, or collect public discourse data, this actor provides structured output with engagement metrics, media attachments, and quoted post content. It handles pagination automatically to collect large datasets efficiently.

Bluesky is a decentralized social network built on the AT Protocol with a rapidly growing user base. This scraper provides an easy way to access public data without needing to set up API credentials.

Use Cases

  • Market researchers -- Monitor brand mentions and sentiment on Bluesky
  • Data analysts -- Analyze engagement patterns, posting frequency, and content trends
  • Journalists -- Track public discourse and find sources on trending topics
  • Developers -- Collect training data or build dashboards from Bluesky content

Features

  • Scrape posts from any public Bluesky profile
  • Search posts by keywords or phrases
  • Extract full engagement metrics (likes, reposts, replies)
  • Capture images and quoted posts
  • Automatic pagination for large datasets
  • No authentication or API keys required
  • Proxy support with automatic rotation
  • Handles pagination and large datasets automatically
  • Exports to JSON, CSV, Excel, or connect via API

How much will it cost?

ResultsEstimated Cost
100~$0.01
1,000~$0.05
10,000~$0.50
Cost ComponentPer 1,000 Results
Platform compute~$0.05
Proxy (datacenter)~$0.00
Total~$0.05

Bluesky's public API is very fast and lightweight, making this one of the most cost-effective social media scrapers available.

How to use

  1. Go to the Bluesky Scraper page on Apify Store
  2. Click "Start" or "Try for free"
  3. Enter Bluesky profile URLs, handles, or search terms
  4. Set the maximum number of posts to scrape
  5. Click "Start" and wait for the results

Input parameters

ParameterTypeDescriptionDefault
startUrlsarrayBluesky profile URLs to scrape-
handlesarrayBluesky handles (e.g. bsky.app)-
searchTermsarraySearch queries to find posts-
maxItemsnumberMax posts to return100
proxyConfigobjectProxy settingsApify Proxy

Output

The actor produces a dataset with the following fields:

{
"url": "https://bsky.app/profile/bsky.app/post/3abc123",
"text": "Welcome to Bluesky!",
"author": "Bluesky",
"handle": "bsky.app",
"likesCount": 1500,
"repostsCount": 300,
"repliesCount": 85,
"createdAt": "2024-06-15T10:30:00.000Z",
"images": ["https://cdn.bsky.app/img/feed_fullsize/..."],
"quotedPost": {
"text": "Original post text",
"author": "Original Author",
"handle": "author.bsky.social",
"url": "https://bsky.app/profile/author.bsky.social/post/xyz789"
},
"authorAvatar": "https://cdn.bsky.app/img/avatar/...",
"postId": "3abc123",
"scrapedAt": "2026-04-23T12:00:00.000Z"
}
FieldTypeDescription
urlstringPost URL on Bluesky
textstringPost text content
authorstringAuthor display name
handlestringAuthor Bluesky handle
likesCountnumberNumber of likes
repostsCountnumberNumber of reposts
repliesCountnumberNumber of replies
createdAtstringPost creation timestamp (ISO 8601)
imagesarrayImage URLs attached to the post
quotedPostobjectQuoted/embedded post data
authorAvatarstringAuthor avatar image URL
postIdstringAT Protocol post record key
scrapedAtstringISO 8601 scrape timestamp

Integrations

Connect Bluesky Scraper with other tools:

  • Apify API -- REST API for programmatic access
  • Webhooks -- get notified when a run finishes
  • Zapier / Make -- connect to 5,000+ apps
  • Google Sheets -- export directly to spreadsheets

API Example (Node.js)

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_TOKEN' });
const run = await client.actor('YOUR_USERNAME/bluesky-scraper').call({
handles: ['bsky.app'],
maxItems: 100,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();

API Example (Python)

from apify_client import ApifyClient
client = ApifyClient('YOUR_TOKEN')
run = client.actor('YOUR_USERNAME/bluesky-scraper').call(run_input={
'handles': ['bsky.app'],
'maxItems': 100,
})
items = client.dataset(run['defaultDatasetId']).list_items().items

API Example (cURL)

curl "https://api.apify.com/v2/acts/YOUR_USERNAME~bluesky-scraper/runs" \
-X POST \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_TOKEN" \
-d '{"handles": ["bsky.app"], "maxItems": 100}'

Tips and tricks

  • Start with a small maxItems (10-20) to test before running large scrapes
  • Use handles directly instead of URLs for convenience
  • Combine profile scraping with search terms to get comprehensive data
  • The public API has generous rate limits but very large scrapes may benefit from proxy rotation

FAQ

Q: Does this actor require login credentials? A: No. Bluesky Scraper uses the public AT Protocol API which does not require authentication for public data.

Q: How fast is the scraping? A: Approximately 500-1000 posts per minute depending on pagination and network conditions.

Q: What should I do if I get blocked? A: Enable proxy rotation in the Proxy Configuration settings. Datacenter proxies are usually sufficient for Bluesky.

Q: Can I scrape private/protected accounts? A: No. This actor only accesses publicly available data through the AT Protocol public API.

Web scraping of publicly available data is generally legal based on precedents like the LinkedIn v. HiQ Labs case. Bluesky is built on the open AT Protocol which is designed for data interoperability. This actor only accesses publicly available data through the official public API. Always review and respect the target site's Terms of Service. For more information, see Apify's blog on web scraping legality.

Limitations

  • Only public posts are accessible (no private/protected content)
  • The public API may have rate limits for very high-volume requests
  • Historical search results may be limited by Bluesky's search index depth
  • Deleted posts cannot be retrieved

Changelog

  • v0.1 (2026-04-23) -- Initial release