Pypi Package Scraper avatar

Pypi Package Scraper

Pricing

Pay per usage

Go to Apify Store
Pypi Package Scraper

Pypi Package Scraper

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Donny Nguyen

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 minutes ago

Last modified

Categories

Share

Fetch Python package metadata from PyPI using the public JSON API. Get version info, authors, licenses, dependencies, and project URLs for any Python package on the Python Package Index.

What does PyPI Package Scraper do?

This actor uses the PyPI JSON API to collect detailed metadata about Python packages. For each package name you specify, it retrieves the latest version, author information, license type, required Python version, dependency list, and project URLs. This is a fast and reliable approach that uses official API endpoints rather than web scraping, ensuring accurate and complete data for every request.

Features

  • Fetch metadata for any Python package on PyPI
  • Extract version, author, license, Python version requirements, and dependencies
  • Get project URLs including homepage, documentation, and source code links
  • Process hundreds of packages in a single run
  • Direct API calls for maximum speed and reliability
  • Automatic fallback data if the API is temporarily unavailable

How to use

  1. Navigate to the actor page on Apify Console
  2. Click "Start" to run with default packages (pandas, fastapi, langchain)
  3. Enter your own package names in the input configuration
  4. Download your results from the Dataset tab when the run finishes

Input parameters

ParameterTypeDescriptionDefault
packageNamesarrayList of Python package names to look up["pandas", "fastapi", "langchain"]
maxResultsintegerMaximum number of packages to process200

Output format

Each package in the dataset contains:

{
"packageName": "pandas",
"version": "2.2.1",
"author": "The Pandas Development Team",
"license": "BSD-3-Clause",
"pythonVersion": ">=3.9",
"dependenciesCount": 18,
"dependencies": "numpy>=1.23.2, python-dateutil>=2.8.2, ...",
"description": "Powerful data structures for data analysis",
"pypiUrl": "https://pypi.org/project/pandas/",
"scrapedAt": "2026-02-19T12:00:00.000Z"
}

Tips and tricks

  • Include popular packages to benchmark against your own libraries
  • Schedule regular runs to track when new versions are released
  • Use the dependencies data to map the Python ecosystem dependency graph
  • Compare with the NPM scraper to analyze JavaScript vs Python ecosystem trends

About

This actor is maintained by consummate_mandala. For questions, feedback, or feature requests, please open an issue on the actor page.