cdnjs Libraries Catalog Scraper
Pricing
from $27.00 / 1,000 results
cdnjs Libraries Catalog Scraper
Scrape the cdnjs library catalog. Browse 4,000+ open-source front-end JavaScript and CSS libraries with versions, file lists, SRI hashes, and repository links. Modes: full catalogue, keyword search, single-library detail.
Pricing
from $27.00 / 1,000 results
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share

☁️ cdnjs Libraries Scraper
🚀 Export the cdnjs front-end catalogue in seconds. Browse, search, or drill into any of 4,000+ open-source JavaScript and CSS libraries hosted on Cloudflare's free CDN, with versions, file lists, SRI hashes, and repository links in one run. No sign-up, no token plumbing, no manual paging.
🕒 Last updated: 2026-05-15 · 📊 13 fields per record · ☁️ 4,000+ libraries · 🌐 200B+ requests/month · 🔁 Live per run
The cdnjs Libraries Scraper queries the official cdnjs catalogue and returns 13 fields per library, including name, description, latest version, homepage, repository, license, author, keywords, versions, sample file paths, and SRI hashes. cdnjs is a free, fast, and reliable open-source CDN operated by Cloudflare, serving over 200 billion requests per month for front-end developers worldwide.
The catalogue spans 4,000+ libraries including React, Vue, jQuery, Bootstrap, Tailwind, Three.js, D3, Lodash, and thousands more. This Actor streams matching records as CSV, Excel, JSON, or XML in under five minutes. Three run modes (catalogue, search, library) let you sweep the whole catalogue, filter by keyword, or pull a single library with full file listing and per-version SRI hashes.
| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| Front-end developers, security engineers, build engineers, OSS researchers, technical writers, DevOps teams, marketing analysts | Asset inventories, SRI hash collection, version tracking, dependency discovery, license audits, library popularity research |
📋 What the cdnjs Libraries Scraper does
Three run modes in a single Actor:
- 🌐 Catalogue mode. Sweep the entire 4,000+ library catalogue with the most useful metadata fields.
- 🔍 Search mode. Free-text keyword filter (e.g.
react,chart,tailwind) across the catalogue. - 📦 Library mode. Pull a single library with full version history, file list, and per-version SRI hashes.
Each catalogue record includes the library name, description, latest version, homepage, repository, license, author, and keywords. Library mode adds the complete versions array, a sample of hosted files for the latest version, and SRI hashes ready to drop into a <script integrity="..."> attribute.
💡 Why it matters: every front-end stack pulls libraries from cdnjs or a similar CDN. Knowing exactly which versions, files, and hashes are available is the difference between a fast, reproducible build and a broken Subresource Integrity check in production.
🎬 Full Demo
🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded cdnjs dataset.
⚙️ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
mode | string | "catalogue" | One of catalogue, search, or library. |
query | string | "" | Keyword used in search mode. Ignored in other modes. |
libraryName | string | "" | Exact library slug for library mode (e.g. jquery, bootstrap). |
fields | string | "name,description,version,homepage,repository,license,author,keywords" | Comma-separated catalogue fields to request. |
maxItems | integer | 10 | Records to return. Free plan caps at 10, paid plan at 1,000,000. |
Example: search for chart-related libraries.
{"mode": "search","query": "chart","maxItems": 25,"fields": "name,description,version,homepage,repository,license,author,keywords"}
Example: deep dive on jQuery with file list and SRI hashes.
{"mode": "library","libraryName": "jquery"}
⚠️ Good to Know: library mode returns the full versions array (often 100+ entries for popular libraries) and a sample of hosted files for the latest version (up to 25 files). SRI hashes use SHA-512 and are ready to drop into a Subresource Integrity attribute. Library slugs are case-sensitive (
jquery, notjQuery).
📊 Output
Each cdnjs library record contains up to 13 fields. Download the dataset as CSV, Excel, JSON, or XML.
🧾 Schema
| Field | Type | Example |
|---|---|---|
📦 name | string | "jquery" |
🔗 url | string | "https://cdnjs.com/libraries/jquery" |
📝 description | string | "JavaScript library for DOM operations" |
🏷️ latestVersion | string | "3.7.1" |
🔗 homepage | string | null | "https://jquery.com/" |
🔗 repository | object | null | {"type":"git","url":"git+https://github.com/jquery/jquery.git"} |
📜 license | string | null | "MIT" |
👤 author | string | null | "JS Foundation and other contributors" |
🏷️ keywords | string[] | ["jquery","javascript","library"] |
📦 versions | string[] | ["1.2.3","...","3.7.1"] |
📦 fileSamples | string[] | ["jquery.min.js","jquery.js","jquery.slim.min.js"] |
🔐 sri | object | string | {"jquery.min.js":"sha512-..."} |
🕒 scrapedAt | ISO 8601 | "2026-05-15T00:00:00.000Z" |
📦 Sample records
✨ Why choose this Actor
| Capability | |
|---|---|
| ☁️ | Catalogue-wide reach. 4,000+ libraries served by cdnjs, one of the world's largest open-source CDNs. |
| 🎯 | Three run modes. Catalogue sweep, keyword search, or single-library deep dive in one Actor. |
| 🔐 | SRI hashes included. Drop-in Subresource Integrity attributes for every file in library mode. |
| ⚡ | Fast. 10 libraries in under 5 seconds, full catalogue sweep in 1-2 minutes. |
| 🔁 | Always fresh. Every run hits the live catalogue, so versions and file lists reflect the current CDN state. |
| 🚫 | No authentication. Public catalogue access. No login, no token. |
| 📥 | Multi-format export. CSV, Excel, JSON, or XML, ready for spreadsheets, warehouses, or pipelines. |
📊 Up-to-date CDN metadata is the foundation of every static-site build, security policy, and front-end performance audit on the modern web.
📈 How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| ⭐ cdnjs Libraries Scraper (this Actor) | $5 free credit, then pay-per-use | 4,000+ libraries | Live per run | catalogue, search, library | ⚡ 2 min |
| Hand-rolled catalogue client | Free + dev time | Full | Per request | Custom code | 🐢 Days |
| Package registry exports | Free | Per registry | Daily | Registry-specific | 🕒 Variable |
| Manual catalogue browsing | Free | One library at a time | Live | UI only | 🕒 Tedious |
Pick this Actor when you want a single dataset of CDN-hosted libraries with SRI hashes and version history, ready for analytics or build automation.
🚀 How to use
- 📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
- 🌐 Open the Actor. Go to the cdnjs Libraries Scraper page on the Apify Store.
- 🎯 Set input. Pick a run mode, enter a search query or library slug, and set
maxItems. - 🚀 Run it. Click Start and let the Actor pull your matching libraries.
- 📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.
💼 Business use cases
🔌 Automating cdnjs Libraries Scraper
Control the scraper programmatically for scheduled runs and pipeline integrations:
- 🟢 Node.js. Install the
apify-clientNPM package. - 🐍 Python. Use the
apify-clientPyPI package. - 📚 See the Apify API documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly, daily, or weekly refreshes keep build manifests and security dashboards in sync.
🌟 Beyond business use cases
The same structured records support research, education, civic projects, and personal initiatives.
🤖 Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- 💬 ChatGPT
- 🧠 Claude
- 🔍 Perplexity
- 🅒 Copilot
❓ Frequently Asked Questions
🧩 How does it work?
Pick a run mode, enter a query or library slug, and click Start. The Actor pulls the matching records from the official catalogue and returns one structured record per library. No browser automation, no captchas, no setup.
📏 How accurate is the data?
Every field reflects what the official catalogue serves at run time. Version lists, files, and SRI hashes are pulled directly from the CDN manifest, so they match exactly what your <script> tag would load.
🔁 How often is the catalogue refreshed?
The catalogue is updated whenever new versions are published by upstream maintainers. Every run of this Actor pulls the latest state.
🔐 What are SRI hashes and why do I need them?
Subresource Integrity hashes let your <script> and <link> tags verify the file hasn't been tampered with. Library mode returns SHA-512 hashes for each file, ready to drop into the integrity attribute.
📂 Can I get the file list for any library?
Yes. Run the Actor in library mode with the exact library slug (e.g. jquery) to get up to 25 sample file paths for the latest version, plus the SRI hash for each.
⏰ Can I schedule regular runs?
Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep a downstream database in sync.
⚖️ Is this data legal to use?
cdnjs publishes library metadata under an open licence. Always honour the underlying library licences when redistributing the actual files (the Actor returns metadata only).
💼 Can I use this data commercially?
Yes. Catalogue metadata is publicly accessible for analytics and discovery use. Each individual library is governed by its own licence, which you should review before bundling.
💳 Do I need a paid Apify plan to use this Actor?
No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you scheduling, higher concurrency, and larger datasets.
🔁 What happens if a run fails or gets interrupted?
Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.
🆘 What if I need help?
Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.
🔌 Integrate with any app
cdnjs Libraries Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe catalogue data into your warehouse
- GitHub - Trigger runs from commits and releases
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to trigger downstream actions when a run finishes. Push fresh library data into your build pipeline, or alert your team in Slack on new versions of watched libraries.
🔗 Recommended Actors
- 📦 NuGet Packages Scraper - .NET package catalogue and download history
- 🦀 Crates.io Scraper - Rust crate metadata and downloads
- 📦 Hex.pm Elixir Packages Scraper - Elixir/Erlang package catalogue
- 🎮 Modrinth Mods Scraper - Minecraft mods, modpacks, shaders, resource packs
- 🐳 Docker Hub Scraper - Container image metadata and pulls
💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.
🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.
⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by cdnjs, Cloudflare, or any of the catalogue's library maintainers. All trademarks mentioned are the property of their respective owners. Only publicly available catalogue metadata is collected.