# Bluesky Scraper - Posts, Profiles & Followers (`ambitious_door/bluesky-scraper`) Actor

Extract posts, profiles, followers, and search results from Bluesky. Uses the open AT Protocol API — no proxy needed, no anti-bot issues.

- **URL**: https://apify.com/ambitious\_door/bluesky-scraper.md
- **Developed by:** [C. K.](https://apify.com/ambitious_door) (community)
- **Categories:** Social media
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Bluesky Scraper — Extract Posts, Profiles & Followers

Extract posts, profiles, followers, following lists, and search results from Bluesky via the open AT Protocol. No proxy needed, no anti-bot issues — Bluesky's protocol is fully open. Pay per item scraped.

### What it does

Bluesky runs on the AT Protocol, a fully open and decentralized social networking protocol. This means all public data (posts, profiles, follower graphs) is accessible through documented API endpoints without authentication barriers or anti-bot measures.

Give it a Bluesky handle. It pulls the data you need — posts with full engagement metrics, detailed profiles, or complete follower/following lists — and returns clean, structured JSON ready for your pipeline.

### Output format

#### Posts

| Field | Type | Description |
|-------|------|-------------|
| `type` | string | Always `"post"` |
| `uri` | string | AT Protocol URI |
| `author_handle` | string | Author's Bluesky handle |
| `author_display_name` | string | Author's display name |
| `text` | string | Post text content |
| `created_at` | string | ISO 8601 timestamp |
| `likes` | integer | Like count |
| `reposts` | integer | Repost count |
| `replies` | integer | Reply count |
| `quotes` | integer | Quote count |
| `images` | array | Image URLs |
| `links` | array | External link URLs |
| `labels` | array | Content labels |
| `language` | string | Post language code |
| `source_url` | string | Web URL on bsky.app |

#### Profiles

| Field | Type | Description |
|-------|------|-------------|
| `type` | string | Always `"profile"` |
| `handle` | string | Bluesky handle |
| `display_name` | string | Display name |
| `description` | string | Bio text |
| `followers_count` | integer | Number of followers |
| `following_count` | integer | Number following |
| `posts_count` | integer | Total posts |
| `avatar` | string | Avatar image URL |
| `source_url` | string | Profile URL on bsky.app |

### Input parameters

| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| `scrapeType` | string | `"posts"` | What to extract: `posts`, `profiles`, `followers`, `following`, `search`, or `thread` |
| `handles` | array | — | Bluesky handles to scrape (e.g. `["bsky.app"]`) |
| `searchQuery` | string | — | Search keyword (only for `search` type, requires auth) |
| `maxItems` | integer | `100` | Maximum items to return |
| `includeReplies` | boolean | `true` | Include replies in post feeds |
| `includeReposts` | boolean | `false` | Include reposts in post feeds |
| `blueskyHandle` | string | — | Your Bluesky handle (only needed for search) |
| `blueskyAppPassword` | string | — | App password (only needed for search) |

### Example usage

#### Get recent posts from a user

```json
{
    "scrapeType": "posts",
    "handles": ["bsky.app"],
    "maxItems": 50
}
````

#### Get profiles of multiple users

```json
{
    "scrapeType": "profiles",
    "handles": ["jay.bsky.team", "pfrazee.com", "why.bsky.team"]
}
```

#### Get followers of an account

```json
{
    "scrapeType": "followers",
    "handles": ["bsky.app"],
    "maxItems": 500
}
```

#### Search posts (requires authentication)

```json
{
    "scrapeType": "search",
    "searchQuery": "artificial intelligence",
    "maxItems": 100,
    "blueskyHandle": "yourname.bsky.social",
    "blueskyAppPassword": "your-app-password"
}
```

### Pricing

This Actor uses the **pay-per-event** model. You are charged per item (post, profile, or follower record) successfully scraped. No charge for failed or empty requests.

### Authentication

Most features work **without any authentication**. Bluesky's AT Protocol makes profiles, posts, and follower data publicly accessible.

**Search requires authentication.** To use the search feature:

1. Go to Bluesky Settings > App Passwords
2. Create a new app password
3. Enter your handle and app password in the input fields

Your credentials are used only for the current session and are never stored or logged.

### Responsible use

- Bluesky's AT Protocol is **designed for open access** — all public data is accessible by design.
- This Actor respects Bluesky's API rate limits (3,000 requests per 5 minutes).
- It does **not** bypass any authentication or access controls.
- **You are responsible** for ensuring your use complies with Bluesky's [Terms of Service](https://bsky.social/about/support/tos) and applicable laws. Only collect data you have the right to access and process.
- Be mindful of privacy — even though data is public, use it responsibly.

### Built with

- [AT Protocol](https://atproto.com/) — the open protocol behind Bluesky
- [httpx](https://www.python-httpx.org/) — async HTTP client
- [Apify SDK](https://docs.apify.com/sdk/python/) — Actor framework and pay-per-event billing

# Actor input Schema

## `scrapeType` (type: `string`):

What type of data to extract.

## `handles` (type: `array`):

Bluesky handles to scrape (e.g. jay.bsky.team). For thread scraping, provide AT URIs instead.

## `searchQuery` (type: `string`):

Search keyword or phrase (only used when scrapeType is 'search'). Requires authentication.

## `maxItems` (type: `integer`):

Maximum number of items to scrape.

## `includeReplies` (type: `boolean`):

Include replies when scraping posts.

## `includeReposts` (type: `boolean`):

Include reposts when scraping posts.

## `blueskyHandle` (type: `string`):

Your Bluesky handle for authenticated requests (required for search). Example: yourname.bsky.social

## `blueskyAppPassword` (type: `string`):

App password for authentication (create one at Settings > App Passwords in Bluesky). Required for search.

## Actor input object example

```json
{
  "scrapeType": "posts",
  "handles": [
    "bsky.app"
  ],
  "maxItems": 100,
  "includeReplies": true,
  "includeReposts": false
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "handles": [
        "bsky.app"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("ambitious_door/bluesky-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "handles": ["bsky.app"] }

# Run the Actor and wait for it to finish
run = client.actor("ambitious_door/bluesky-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "handles": [
    "bsky.app"
  ]
}' |
apify call ambitious_door/bluesky-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=ambitious_door/bluesky-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Bluesky Scraper - Posts, Profiles & Followers",
        "description": "Extract posts, profiles, followers, and search results from Bluesky. Uses the open AT Protocol API — no proxy needed, no anti-bot issues.",
        "version": "0.1",
        "x-build-id": "UMmpZWCzNL7kNVDZf"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/ambitious_door~bluesky-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-ambitious_door-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/ambitious_door~bluesky-scraper/runs": {
            "post": {
                "operationId": "runs-sync-ambitious_door-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/ambitious_door~bluesky-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-ambitious_door-bluesky-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "scrapeType"
                ],
                "properties": {
                    "scrapeType": {
                        "title": "Scrape Type",
                        "enum": [
                            "posts",
                            "profiles",
                            "followers",
                            "following",
                            "search",
                            "thread"
                        ],
                        "type": "string",
                        "description": "What type of data to extract.",
                        "default": "posts"
                    },
                    "handles": {
                        "title": "Bluesky Handles",
                        "type": "array",
                        "description": "Bluesky handles to scrape (e.g. jay.bsky.team). For thread scraping, provide AT URIs instead.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "searchQuery": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search keyword or phrase (only used when scrapeType is 'search'). Requires authentication."
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum number of items to scrape.",
                        "default": 100
                    },
                    "includeReplies": {
                        "title": "Include Replies",
                        "type": "boolean",
                        "description": "Include replies when scraping posts.",
                        "default": true
                    },
                    "includeReposts": {
                        "title": "Include Reposts",
                        "type": "boolean",
                        "description": "Include reposts when scraping posts.",
                        "default": false
                    },
                    "blueskyHandle": {
                        "title": "Bluesky Handle (for auth)",
                        "type": "string",
                        "description": "Your Bluesky handle for authenticated requests (required for search). Example: yourname.bsky.social"
                    },
                    "blueskyAppPassword": {
                        "title": "App Password",
                        "type": "string",
                        "description": "App password for authentication (create one at Settings > App Passwords in Bluesky). Required for search."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
