
BlueSky Feed Scraper
Pay $5.00 for 1,000 results

BlueSky Feed Scraper
Pay $5.00 for 1,000 results
Scrapes data from a specified BlueSky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts.
Actor Metrics
5 monthly users
No reviews yet
1 bookmark
>99% runs succeeded
Created in Nov 2024
Modified 2 months ago
Bluesky Feed Scraper for Apify
This is an Apify actor that scrapes data from a specified Bluesky feed URL and outputs detailed information about the posts, including metadata, authors, embedded media, and statistics such as likes, replies, and reposts.
Features
- Scrapes Bluesky feed posts from a given feed URL.
- Extracts detailed post data, including:
- Author details (DID, handle, display name, avatar URL, etc.).
- Post text, tags, and languages.
- Embedded images, with metadata (alt text, aspect ratio, URLs).
- Engagement statistics (likes, replies, reposts, quotes).
- Thread and reply information.
- Record metadata, including creation and indexing timestamps.
Input
The actor requires the following input:
Field | Type | Description |
---|---|---|
url | String | The URL of the Bluesky feed you want to scrape. Example: https://bsky.app/profile/username/feed . |
Example Input
1{ 2 "url": "https://bsky.app/profile/c3rmen.bsky.social/feed" 3}
Output
The actor produces a JSON array where each object represents a post from the feed. The structure includes:
uri
andcid
: Unique identifiers for the post.author
: Details about the author (DID, handle, avatar, etc.).record
: Post text, tags, languages, and embedded media.embed
: View-ready image metadata (e.g., thumbnails, full-size URLs).- Engagement metrics (
replyCount
,repostCount
,likeCount
,quoteCount
). - Thread and reply-related data.
- Timestamps (
createdAt
,indexedAt
).
Example Output
1[ 2 { 3 "uri": "at://did:plc:z72i7hdynmk6r22z27h6tvur/app.bsky.feed.post/3lbsizxfxa22r", 4 "cid": "bafyreifohcetdw6e5mudaz6anigzsm5ssjpm3oreyxu4a2l665k7hpxo4q", 5 "author": { 6 "did": "did:plc:z72i7hdynmk6r22z27h6tvur", 7 "handle": "bsky.app", 8 "displayName": "Bluesky", 9 "avatar": "https://cdn.bsky.app/img/avatar/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihagr2cmvl2jt4mgx3sppwe2it3fwolkrbtjrhcnwjk4jdijhsoze@jpeg", 10 "associated": { 11 "chat": { 12 "allowIncoming": "none" 13 } 14 }, 15 "labels": [], 16 "createdAt": "2023-04-12T04:53:57.057Z" 17 }, 18 "record": { 19 "createdAt": "2024-11-25T21:52:30.840Z", 20 "embed": { 21 "external": { 22 "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", 23 "thumb": { 24 "ref": { 25 "$link": "bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne" 26 }, 27 "mimeType": "image/jpeg", 28 "size": 384236, 29 "$type": "blob" 30 }, 31 "title": "BlueskySocial - Twitch", 32 "uri": "https://www.twitch.tv/blueskysocial" 33 }, 34 "$type": "app.bsky.embed.external" 35 }, 36 "facets": [ 37 { 38 "features": [ 39 { 40 "did": "did:plc:qjeavhlw222ppsre4rscd3n2", 41 "$type": "app.bsky.richtext.facet#mention" 42 } 43 ], 44 "index": { 45 "byteEnd": 55, 46 "byteStart": 40 47 }, 48 "$type": "app.bsky.richtext.facet" 49 }, 50 { 51 "features": [ 52 { 53 "did": "did:plc:ragtjsm2j2vknwkz3zp4oxrd", 54 "$type": "app.bsky.richtext.facet#mention" 55 } 56 ], 57 "index": { 58 "byteEnd": 76, 59 "byteStart": 64 60 }, 61 "$type": "app.bsky.richtext.facet" 62 }, 63 { 64 "features": [ 65 { 66 "did": "did:plc:4ewnpnebeh7zuk5pbardaxqz", 67 "$type": "app.bsky.richtext.facet#mention" 68 } 69 ], 70 "index": { 71 "byteEnd": 226, 72 "byteStart": 203 73 }, 74 "$type": "app.bsky.richtext.facet" 75 } 76 ], 77 "langs": [ 78 "en" 79 ], 80 "text": "Join us for another livestream with COO @rose.bsky.team and CTO @pfrazee.com, where they'll share team updates, the story of how Bluesky began, and what’s next. \n\nPlus, a special guest appearance from @flavorflav.bsky.social! 🎉\n\nToday 11/25 @ 5 pm PT / 8 pm ET / 1 am GMT / 10am JST", 81 "$type": "app.bsky.feed.post" 82 }, 83 "embed": { 84 "external": { 85 "uri": "https://www.twitch.tv/blueskysocial", 86 "title": "BlueskySocial - Twitch", 87 "description": "Bluesky is social media as it should be. Find your community among millions of users, unleash your creativity, and have some fun again. https://bsky.app", 88 "thumb": "https://cdn.bsky.app/img/feed_thumbnail/plain/did:plc:z72i7hdynmk6r22z27h6tvur/bafkreihh7dthuxfqel6zwcmxapcu47tr34rat7thjtxlfmrwidvxfsmqne@jpeg" 89 }, 90 "$type": "app.bsky.embed.external#view" 91 }, 92 "replyCount": 324, 93 "repostCount": 1041, 94 "likeCount": 9147, 95 "quoteCount": 84, 96 "indexedAt": "2024-11-25T21:52:35.058Z", 97 "labels": [] 98 }, 99 // ...more posts 100]
Usage
- Deploy the Actor: Use the Apify console to set up and deploy this actor.
- Provide Input: Supply the
url
in the input configuration. - Run the Actor: Start the actor, and it will scrape the feed URL and return the posts as JSON.
Notes
- Ensure the
url
is publicly accessible. - The actor fetches only visible posts; private or restricted feeds will not be included.
Feel free to suggest additional features or report any issues! 🚀