# Glama MCP Registry Scraper - 23K Model Context Protocol Servers (`jungle_synthesizer/glama-mcp-registry-scraper`) Actor

Scrape Glama's MCP registry for 23K+ Model Context Protocol servers. Returns metadata, tool/resource/prompt schemas, install commands, attributes (official, remote-capable), source repos, and language. Supports full catalog, search, and single-server modes. Pairs with Smithery MCP scraper.

- **URL**: https://apify.com/jungle\_synthesizer/glama-mcp-registry-scraper.md
- **Developed by:** [BowTiedRaccoon](https://apify.com/jungle_synthesizer) (community)
- **Categories:** AI, Developer tools, MCP servers
- **Stats:** 1 total users, 1 monthly users, 0.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Glama MCP Registry Scraper

Scrape the [Glama MCP registry](https://glama.ai/mcp/servers) — the largest open catalog of Model Context Protocol servers with 23,000+ entries — via the site's public REST API. Returns server metadata, tool schemas, attributes (official, remote-capable), namespaces, source repositories, licenses, and environment-variable schemas. Pairs cleanly with the Smithery MCP scraper for full ecosystem coverage.

---

### Glama MCP Scraper Features

- Crawls Glama's full MCP server catalog — 23,000+ servers indexed
- Three modes: full catalog dump, search-by-query, or single server detail
- Returns tool schemas exposed by each server, not just names
- Captures Glama attributes (`author:official`, `hosting:remote-capable`, etc.) for filtering
- Includes source repository URL, SPDX license, namespace, and slug per server
- Returns environment-variable JSON schemas as a JSON string, ready to drop into a config UI
- Cursor-paginated through the public API — runs to completion without manual pagination
- No API key, no proxy, no captcha

---

### Who Uses MCP Registry Data?

- **MCP client developers** — Build a server picker UI that lists every available MCP server with metadata
- **AI agent platforms** — Pre-index the MCP ecosystem and surface relevant servers to users at runtime
- **Security researchers** — Audit server attributes, source repos, and license distribution across the open MCP catalog
- **MCP framework maintainers** — Monitor namespace growth, attribute distribution, and tool-schema patterns
- **Developer-tool indexers** — Add MCP servers to broader AI-tooling search platforms with first-class metadata
- **Internal AI ops teams** — Maintain an allow-list of vetted MCP servers with full schemas for self-hosted deployment

---

### How the Glama MCP Scraper Works

1. **Pick a mode** — `catalog` for the full registry, `search` for a query-filtered slice, or `server_detail` for a single server by ID
2. **The scraper hits the Glama REST API** — Cursor-paginates 100 servers per page through the public endpoint
3. **Optionally filters by attribute or primary language** — Pass `author:official` or `Python` to narrow the result set
4. **Returns one record per server** with metadata, tools, attributes, repo URL, license, and env-var schema

The API is public and unauthenticated. The scraper observes a 200ms delay between requests — polite enough not to draw attention, fast enough to walk the full catalog without setting up camp.

---

### Input

```json
{
  "mode": "catalog",
  "maxItems": 500
}
````

| Field | Type | Default | Description |
|-------|------|---------|-------------|
| mode | string | catalog | One of `catalog`, `search`, `server_detail`. |
| query | string | "" | Search query for `search` mode (server name, description, or tags). |
| serverId | string | "" | Glama server ID for `server_detail` mode. |
| attributeFilter | string | "" | Filter by Glama attribute (e.g. `author:official`, `hosting:remote-capable`). |
| languageFilter | string | "" | Filter by primary language (Python, TypeScript, Go, Rust). |
| maxItems | integer | 500 | Maximum servers to return. Set to 0 for unlimited. |
| proxyConfiguration | object | none | Proxy settings. Not required — the public API accepts datacenter IPs. |

#### Search by query

```json
{
  "mode": "search",
  "query": "github",
  "maxItems": 100
}
```

#### Single server detail

```json
{
  "mode": "server_detail",
  "serverId": "glama-server-id-here"
}
```

***

### Glama MCP Scraper Output Fields

```json
{
  "server_id": "abc123",
  "name": "GitHub MCP Server",
  "description": "Read-only MCP server for GitHub repositories, issues, and pull requests.",
  "attributes": ["author:official", "hosting:remote-capable"],
  "tools": ["search_repos|Search GitHub repositories", "get_issue|Fetch an issue by number"],
  "resources": [],
  "prompts": [],
  "install_command": null,
  "source_repo_url": "https://github.com/example/github-mcp",
  "homepage_url": null,
  "license": "MIT",
  "language": null,
  "star_count": null,
  "namespace": "github",
  "slug": "github-mcp",
  "env_schema_json": "{\"type\":\"object\",\"properties\":{\"GITHUB_TOKEN\":{\"type\":\"string\"}}}",
  "server_url": "https://glama.ai/mcp/servers/abc123"
}
```

| Field | Type | Description |
|-------|------|-------------|
| server\_id | string | Glama server ID. |
| name | string | Server name. |
| description | string | Server description. |
| attributes | array | Glama attributes (e.g. `hosting:remote-capable`, `author:official`). |
| tools | array | Tools exposed by this server, formatted as `name|description`. |
| resources | array | Resources exposed by this server. |
| prompts | array | Prompts exposed by this server. |
| install\_command | string | Install command (when published by the server author). |
| source\_repo\_url | string | Source repository URL. |
| homepage\_url | string | Homepage URL. |
| license | string | SPDX license name. |
| language | string | Primary language inferred from the repository. |
| star\_count | integer | GitHub star count, when linked. |
| namespace | string | Glama namespace (author or org). |
| slug | string | Glama server slug. |
| env\_schema\_json | string | Environment-variable JSON schema as a JSON string. |
| server\_url | string | Glama server page URL. |

***

### FAQ

#### How do I scrape the Glama MCP registry?

Glama MCP Scraper hits the public REST API at `glama.ai/api/mcp/v1/servers` and cursor-paginates through the catalog. No auth, no proxy, no captcha. Pick a mode, set `maxItems`, and run.

#### How many MCP servers can I get?

Glama MCP Scraper covers 23,000+ servers — every entry in the Glama registry at crawl time. Set `maxItems` to 0 to pull the full catalog or pass a number to sample.

#### Can I filter to only official or remote-capable servers?

Glama MCP Scraper accepts an `attributeFilter` like `author:official` or `hosting:remote-capable`. The filter is applied during the crawl, so you don't pay for records you'll discard.

#### How much does the Glama MCP Scraper cost to run?

Glama MCP Scraper is priced per record returned via the pay-per-event model. A full-catalog run lands well under the cost of building and maintaining your own registry crawler.

#### Does this need proxies?

Glama MCP Scraper does not need proxies. The public API accepts datacenter IPs without rate-limiting beyond the polite 200ms delay the scraper observes between requests.

***

### Need More Features?

Need additional fields, multi-source MCP registry merging (Smithery + Glama + others), or scheduled updates? [File an issue](https://console.apify.com/actors/issues) or get in touch.

### Why Use the Glama MCP Scraper?

- **Largest open registry** — Glama indexes 23K+ servers, well ahead of every other MCP catalog
- **Real tool schemas** — Returns the tools each server exposes with names and descriptions, not just a count. Useful when you want to surface capability summaries.
- **Clean JSON** — Normalized records with consistent field names, env-var schemas pre-stringified, ready to drop into a config UI or registry mirror

# Actor input Schema

## `sp_intended_usage` (type: `string`):

Please describe how you plan to use the data extracted by this crawler.

## `sp_improvement_suggestions` (type: `string`):

Provide any feedback or suggestions for improvements.

## `sp_contact` (type: `string`):

Provide your email address so we can get in touch with you.

## `mode` (type: `string`):

Catalog listing or single-server detail.

## `query` (type: `string`):

Search server name, description, or tags. Used in 'search' mode.

## `serverId` (type: `string`):

Single server ID. Used in 'server\_detail' mode.

## `attributeFilter` (type: `string`):

Filter by Glama attribute (e.g. 'author:official', 'hosting:remote-capable'). Leave blank for all.

## `languageFilter` (type: `string`):

Filter by primary language (Python, TypeScript, Go, Rust). Leave blank for all.

## `maxItems` (type: `integer`):

Maximum servers to return. 0 = unlimited.

## Actor input object example

```json
{
  "sp_intended_usage": "Describe your intended use...",
  "sp_improvement_suggestions": "Share your suggestions here...",
  "sp_contact": "Share your email here...",
  "mode": "catalog",
  "maxItems": 500
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "sp_intended_usage": "Describe your intended use...",
    "sp_improvement_suggestions": "Share your suggestions here...",
    "sp_contact": "Share your email here...",
    "mode": "catalog",
    "query": "",
    "serverId": "",
    "attributeFilter": "",
    "languageFilter": "",
    "maxItems": 500
};

// Run the Actor and wait for it to finish
const run = await client.actor("jungle_synthesizer/glama-mcp-registry-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "sp_intended_usage": "Describe your intended use...",
    "sp_improvement_suggestions": "Share your suggestions here...",
    "sp_contact": "Share your email here...",
    "mode": "catalog",
    "query": "",
    "serverId": "",
    "attributeFilter": "",
    "languageFilter": "",
    "maxItems": 500,
}

# Run the Actor and wait for it to finish
run = client.actor("jungle_synthesizer/glama-mcp-registry-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "sp_intended_usage": "Describe your intended use...",
  "sp_improvement_suggestions": "Share your suggestions here...",
  "sp_contact": "Share your email here...",
  "mode": "catalog",
  "query": "",
  "serverId": "",
  "attributeFilter": "",
  "languageFilter": "",
  "maxItems": 500
}' |
apify call jungle_synthesizer/glama-mcp-registry-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=jungle_synthesizer/glama-mcp-registry-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Glama MCP Registry Scraper - 23K Model Context Protocol Servers",
        "description": "Scrape Glama's MCP registry for 23K+ Model Context Protocol servers. Returns metadata, tool/resource/prompt schemas, install commands, attributes (official, remote-capable), source repos, and language. Supports full catalog, search, and single-server modes. Pairs with Smithery MCP scraper.",
        "version": "0.1",
        "x-build-id": "jyM07KukhfgyhVbhI"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/jungle_synthesizer~glama-mcp-registry-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-jungle_synthesizer-glama-mcp-registry-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/jungle_synthesizer~glama-mcp-registry-scraper/runs": {
            "post": {
                "operationId": "runs-sync-jungle_synthesizer-glama-mcp-registry-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/jungle_synthesizer~glama-mcp-registry-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-jungle_synthesizer-glama-mcp-registry-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "sp_intended_usage",
                    "sp_improvement_suggestions"
                ],
                "properties": {
                    "sp_intended_usage": {
                        "title": "What is the intended usage of this data?",
                        "minLength": 1,
                        "type": "string",
                        "description": "Please describe how you plan to use the data extracted by this crawler."
                    },
                    "sp_improvement_suggestions": {
                        "title": "How can we improve this crawler for you?",
                        "minLength": 1,
                        "type": "string",
                        "description": "Provide any feedback or suggestions for improvements."
                    },
                    "sp_contact": {
                        "title": "Contact Email",
                        "minLength": 1,
                        "type": "string",
                        "description": "Provide your email address so we can get in touch with you."
                    },
                    "mode": {
                        "title": "Mode",
                        "enum": [
                            "catalog",
                            "server_detail",
                            "search"
                        ],
                        "type": "string",
                        "description": "Catalog listing or single-server detail."
                    },
                    "query": {
                        "title": "Search Query",
                        "type": "string",
                        "description": "Search server name, description, or tags. Used in 'search' mode."
                    },
                    "serverId": {
                        "title": "Server ID",
                        "type": "string",
                        "description": "Single server ID. Used in 'server_detail' mode."
                    },
                    "attributeFilter": {
                        "title": "Attribute Filter",
                        "type": "string",
                        "description": "Filter by Glama attribute (e.g. 'author:official', 'hosting:remote-capable'). Leave blank for all."
                    },
                    "languageFilter": {
                        "title": "Primary Language",
                        "type": "string",
                        "description": "Filter by primary language (Python, TypeScript, Go, Rust). Leave blank for all."
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "type": "integer",
                        "description": "Maximum servers to return. 0 = unlimited."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
