Pricing

Pay per event

Go to Apify Store

PyPI Scraper

Try for free

Extract Python package metadata from PyPI — names, versions, authors, licenses, dependencies, and release history.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

What does PyPI Scraper do?

PyPI Scraper looks up packages on PyPI by name and extracts comprehensive metadata for each one. It fetches data directly from the PyPI JSON API and pypistats.org for download statistics. No browser needed — the actor uses fast HTTP requests for reliable, efficient extraction.

For each package you get: current version, summary, author info, license, Python version requirements, all dependencies, classifiers, keywords, release count, monthly download numbers, and direct URLs.

Why use PyPI Scraper?

Fast and reliable — uses official PyPI JSON API, no HTML scraping or browser rendering
Download statistics — includes monthly download counts from pypistats.org
Complete metadata — versions, dependencies, classifiers, license, Python requirements
Bulk lookup — process hundreds of packages in a single run
Structured output — clean JSON ready for analysis, dashboards, or integration

Use cases

Dependency auditing — check versions, licenses, and Python compatibility across your stack
Package popularity tracking — monitor download trends for competing libraries
Supply chain analysis — map dependency trees and identify widely-used packages
Market research — analyze the Python ecosystem for trends and opportunities
License compliance — verify licenses across all packages in your organization
Developer tooling — feed package metadata into internal tools, dashboards, or reports

How to scrape PyPI packages

Go to PyPI Scraper on Apify Store
Add package names to the Package names list
Click Start and wait for the run to finish
Review package metadata, versions, and download stats
Download your data in JSON, CSV, or Excel format

Input parameters

Parameter	Type	Required	Description
`packageNames`	array	Yes	List of PyPI package names to look up (e.g., `requests`, `flask`, `django`)

Example input

{
    "packageNames": ["requests", "flask", "django", "numpy", "pandas"]
}

Output example

Each package returns a structured object with full metadata:

{
    "name": "requests",
    "version": "2.32.5",
    "summary": "Python HTTP for Humans.",
    "author": "Kenneth Reitz",
    "authorEmail": "me@kennethreitz.org",
    "license": "Apache-2.0",
    "homePage": "https://requests.readthedocs.io",
    "projectUrl": "https://pypi.org/project/requests/",
    "requiresPython": ">=3.9",
    "dependencies": [
        "charset_normalizer<4,>=2",
        "idna<4,>=2.5",
        "urllib3<3,>=1.21.1",
        "certifi>=2017.4.17"
    ],
    "classifiers": [
        "Development Status :: 5 - Production/Stable",
        "Programming Language :: Python :: 3"
    ],
    "keywords": [],
    "releaseCount": 157,
    "downloadsLastMonth": 1141725900,
    "packageUrl": "https://pypi.org/project/requests/",
    "scrapedAt": "2026-03-03T05:21:06.098Z"
}

Output fields

Field	Type	Description
`name`	string	Official package name on PyPI
`version`	string	Latest release version
`summary`	string	Short package description
`author`	string	Package author name
`authorEmail`	string	Author contact email
`license`	string	License identifier
`homePage`	string	Project home page URL
`projectUrl`	string	PyPI project page URL
`requiresPython`	string	Minimum Python version required
`dependencies`	array	List of package dependencies (requires_dist)
`classifiers`	array	PyPI trove classifiers
`keywords`	array	Package keywords
`releaseCount`	number	Total number of releases on PyPI
`downloadsLastMonth`	number	Download count in the last 30 days
`packageUrl`	string	Direct link to the PyPI project page
`scrapedAt`	string	ISO 8601 timestamp of extraction

How much does it cost to scrape PyPI?

PyPI Scraper uses pay-per-event pricing:

Event	Price
Run started	$0.001
Package extracted	$0.001 per package

Cost examples

Packages	Cost
10 packages	$0.011
100 packages	$0.101
1,000 packages	$1.001

API usage

Python

from apify_client import ApifyClient

client = ApifyClient("YOUR_API_TOKEN")

run = client.actor("automation-lab/pypi-scraper").call(
    run_input={"packageNames": ["requests", "flask", "django"]}
)

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(f"{item['name']} v{item['version']} — {item['downloadsLastMonth']:,} downloads/month")

Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const run = await client.actor('automation-lab/pypi-scraper').call({
    packageNames: ['requests', 'flask', 'django'],
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(item => {
    console.log(`${item.name} v${item.version} — ${item.downloadsLastMonth.toLocaleString()} downloads/month`);
});

Use with Claude AI (MCP)

This actor is available as a tool in Claude AI through the Model Context Protocol (MCP). Add it to Claude Desktop, Cursor, Windsurf, or any MCP-compatible client.

Setup for Claude Code

$claude mcp add --transport http apify "https://mcp.apify.com?tools=automation-lab/pypi-scraper"

Setup for Claude Desktop, Cursor, or VS Code

Add this to your MCP config file:

{
    "mcpServers": {
        "apify": {
            "url": "https://mcp.apify.com?tools=automation-lab/pypi-scraper"
        }
    }
}

Example prompts

"Get metadata for requests, httpx, and aiohttp from PyPI and compare their monthly download counts."
"Look up the latest version and dependencies of numpy, pandas, and scikit-learn on PyPI."
"Which Python web frameworks on PyPI have the most downloads? Check flask, django, and fastapi."

Learn more in the Apify MCP documentation.

Integrations

Connect PyPI Scraper to your workflow with Apify integrations:

Webhooks — trigger actions when a run finishes
Google Sheets — export package data to spreadsheets automatically
Slack — get notifications about new package versions
Zapier / Make — connect to 5,000+ apps and services
REST API — call the actor programmatically from any language

Tips and best practices

Use exact package names as they appear on PyPI (e.g., scikit-learn, not sklearn)
Download statistics come from pypistats.org and may be slightly delayed
The actor handles packages that don't exist gracefully — they're skipped with a warning
For very large batches (1,000+ packages), consider splitting into multiple runs
Keywords may be empty for many packages — check classifiers for categorization instead

A package I entered was skipped with no output. The scraper skips packages that don't exist on PyPI. Double-check the exact package name — PyPI names are case-insensitive but must match (e.g., scikit-learn, not sklearn).

Download counts seem unusually high or low. Download stats come from pypistats.org and include CI/CD pipeline installs, mirrors, and automated downloads. They may be slightly delayed (up to 24 hours).

Other developer tools

Pub.dev Scraper — scrape Dart and Flutter package metadata from pub.dev
npm Scraper — scrape npm package metadata and download stats
Crates Scraper — scrape Rust crate metadata from crates.io
Homebrew Scraper — scrape Homebrew formula metadata
Docker Hub Scraper — scrape Docker image metadata from Docker Hub
PPE Cost Estimator — estimate per-result costs for any Apify actor
Apify Store Analyzer — analyze actors and trends on the Apify Store

Legality

Scraping publicly available data is generally legal according to the US Court of Appeals ruling (HiQ Labs v. LinkedIn). This actor only accesses publicly available information and does not require authentication. Always review and comply with the target website's Terms of Service before scraping. For personal data, ensure compliance with GDPR, CCPA, and other applicable privacy regulations.

Changelog

v0.1 — Initial release with package metadata and download stats extraction

PyPI Python Package Scraper

cloud9_ai/pypi-package-scraper

Search and extract Python package data from PyPI. Get versions, dependencies, download stats, and classifiers. No API key needed.

cloud9

🐍 PyPI Scraper — Python Package Data

nexgendata/pypi-scraper

Extract Python package data from PyPI — download stats, dependencies, version history & maintainers. Build Python ecosystem analytics, dependency audits & monitoring dashboards. Pay per package.

Stephan Corbeil

Pypi Package Scraper

openclawmara/pypi-package-scraper

Scrape PyPI the Python Package Index. Extract package metadata, download statistics, version history, dependencies, and maintainer info. Track new releases and popularity trends. Perfect for Python ecosystem analysis and package research.

OpenClaw Mara

🛡️ PyPI Vulnerability Scraper

taroyamada/pypi-package-intelligence

Extract Python package metadata from PyPI and enrich it with OSV database alerts. Monitor dependencies for new version releases and critical CVE identifiers.

太郎山田

PyPI (Python) Packages Scraper

gio21/pypi-packages-scraper

Scrape PyPI Python packages by name-search or top-downloads. Returns full metadata: name, version, summary, author, license, downloads, dependencies, project URLs, classifiers. Pay per package returned.

Gio

PyPI Package Dependency Intelligence

taroyamada/pypi-package-dependency-intelligence

Extract Python package dependency declarations, release cadence, maintainer hints, download stats, and OSV vulnerability summaries from the official PyPI JSON API.

太郎山田

PyPi MCP Server

agentify/pypi-mcp-server

An Apify Actor that proxies the pypi-query-mcp-server over Streamable HTTP with optional charging and tool whitelisting.

agentify

NPM Scraper

muscular_quadruplet/npm-scraper

Scrape NPM package data. Get downloads, versions, dependencies, maintainers. Analyze JavaScript ecosystem trends, track package popularity, monitor dependencies. Build developer tools.

Do It

npm Package Dependency Intelligence

taroyamada/npm-package-dependency-intelligence

Analyze npm package metadata, versions, dependencies, maintainer hints, release cadence, and package risk signals using the official npm registry API.

太郎山田

📦 npm Scraper — Downloads & Dependencies

nexgendata/npm-scraper

Extract npm package data — download counts, dependencies, version history, maintainers & READMEs. Build dependency analysis, package monitoring & JS ecosystem trackers. Pay per package.

Stephan Corbeil

PyPI Scraper

What does PyPI Scraper do?

Why use PyPI Scraper?

Use cases

How to scrape PyPI packages

Input parameters

Example input

Output example

Output fields

How much does it cost to scrape PyPI?

Cost examples

API usage

Python

Node.js

Use with Claude AI (MCP)

Setup for Claude Code

Setup for Claude Desktop, Cursor, or VS Code

Example prompts

Integrations

Tips and best practices

Other developer tools

Legality

Changelog

You might also like

PyPI Python Package Scraper

🐍 PyPI Scraper — Python Package Data

Pypi Package Scraper

🛡️ PyPI Vulnerability Scraper

PyPI (Python) Packages Scraper

PyPI Package Dependency Intelligence

PyPi MCP Server

NPM Scraper

npm Package Dependency Intelligence

📦 npm Scraper — Downloads & Dependencies