Bluesky Scraper avatar

Bluesky Scraper

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Bluesky Scraper

Bluesky Scraper

Scrape Bluesky (AT Protocol) posts by keyword, hashtag, author handle, or custom feed. Export likes, reposts, replies, hashtags, mentions, embeds, and full metadata as JSON or CSV. Built for AI training datasets, social analytics, brand monitoring, and trend tracking.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

NIJ KANANI

NIJ KANANI

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 days ago

Last modified

Categories

Share

๐Ÿฆ‹ Bluesky Scraper

Scrape posts from Bluesky (the AT Protocol social network) โ€” by keyword, hashtag, author handle, or any custom feed. No coding, no rate-limit headaches. Export to JSON, CSV, Excel, or push directly into your stack via API.

โšก Built for AI dataset builders, social media analysts, brand-monitoring teams, and trend hunters who need clean, structured Bluesky data at scale.


โœจ What you can do

  • ๐Ÿ”Ž Search posts by keyword or hashtag (#ai, bitcoin, climate change)
  • ๐Ÿ‘ค Pull any user's full feed by handle (jay.bsky.team)
  • ๐Ÿ“ก Subscribe to custom feeds by AT URI
  • ๐Ÿ“… Filter by date range and language
  • ๐Ÿ’ฌ Optionally include replies
  • ๐Ÿ“ค Get rich post metadata: text, likes, reposts, replies, embeds, hashtags, mentions, links

๐ŸŽฏ Use cases

WhoWhy
๐Ÿค– AI / LLM teamsBuild clean training datasets from a fast-growing social network
๐Ÿ“Š Social analyticsTrack hashtags, brand mentions, sentiment over time
๐Ÿ•ต๏ธ Competitor monitoringWatch what competitors post, what gets engagement
๐Ÿ“ฐ Journalists & researchersArchive public discourse around news events
๐Ÿ“ˆ Trend huntersFind rising topics & influencers before they hit mainstream

๐Ÿš€ Quick start

  1. Click Try for free
  2. Choose a mode: search, author, or feed
  3. Enter your search terms / handles / feed URIs
  4. (Optional but recommended) Add your Bluesky handle + an App Password for higher rate limits and full reliability
  5. Click Start

That's it. Your data appears in the Dataset tab in seconds.


๐Ÿ“ฅ Input

FieldTypeDescription
modeenumsearch / author / feed
searchTermsarrayKeywords or hashtags (search mode)
authorsarrayBluesky handles (author mode)
feedUrisarrayAT URIs (feed mode)
maxItemsintCap per target (default 1000)
since / untilISO dateDate range filter
languagestring2-letter code (e.g. en)
includeRepliesboolInclude replies in author mode
bskyHandlestringOptional โ€” your Bluesky handle
bskyAppPasswordsecretOptional โ€” your App Password

Example input

{
"mode": "search",
"searchTerms": ["#ai", "llm"],
"maxItems": 5000,
"since": "2026-04-01",
"language": "en",
"bskyHandle": "yourname.bsky.social",
"bskyAppPassword": "xxxx-xxxx-xxxx-xxxx"
}

๐Ÿ“ค Output (per post)

{
"uri": "at://did:plc:.../app.bsky.feed.post/...",
"cid": "bafyrei...",
"author": {
"did": "did:plc:...",
"handle": "username.bsky.social",
"displayName": "Display Name",
"avatar": "https://..."
},
"text": "Full post text",
"createdAt": "2026-04-15T12:34:56.000Z",
"indexedAt": "2026-04-15T12:34:57.000Z",
"langs": ["en"],
"likeCount": 42,
"repostCount": 7,
"replyCount": 3,
"quoteCount": 1,
"embed": { /* images, video, quoted posts */ },
"tags": ["ai"],
"mentions": ["did:plc:..."],
"links": ["https://..."],
"isReply": false,
"replyParent": null,
"replyRoot": null,
"bskyUrl": "https://bsky.app/profile/username.bsky.social/post/abc"
}

๐Ÿ’ก Why authentication?

Bluesky's public API rate-limits anonymous requests aggressively from datacenter IPs. Adding your own free Bluesky App Password (NOT your main password โ€” App Passwords are revocable single-purpose tokens) lifts limits and gives reliable, full-speed scraping. Your credentials are never stored โ€” they're passed only to Bluesky's official servers per run.


โ“ FAQ

Is this legal? Bluesky's public API is open by design โ€” the AT Protocol is built around portable, public data. This Actor uses official endpoints only.

Will my account get banned? No. App Passwords are intended for read access. Treat normal rate limits and you'll be fine.

Can I run this on a schedule? Yes โ€” use Apify's Schedule feature to run hourly/daily.

How fast is it? Authenticated runs typically pull ~3,000 posts/minute.


Got questions or feature requests? Open an issue or message us.