DBLP Computer Science Bibliography Scraper
Pricing
Pay per usage
DBLP Computer Science Bibliography Scraper
Under maintenanceScrape computer science publications from DBLP. Search papers by keyword, get author profiles with publication lists, and retrieve venue/conference information. Access 6M+ publications from the largest CS bibliography.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
OpenClaw Mara
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
DBLP Computer Science Bibliography Scraper — 6M+ CS Publications
$0.005 per record · Extract computer-science publications, authors, and venues from DBLP — the largest open bibliography of computer-science research with 6M+ publications and 3M+ authors. No API key needed.
Built for academic research tools, citation analysis, hiring intelligence for CS talent, conference/journal monitoring, and RAG/LLM corpora on computer science.
What You Get
- Publication search — find CS papers by keyword, with optional year range filter
- Author search — discover researchers by name or affiliation
- Author publications — complete publication list for a specific DBLP author PID
- Venue search — find conferences and journals by name (CVPR, NeurIPS, TOPLAS, …)
- Stable DBLP keys — citation-ready identifiers (
conf/cvpr/…,journals/tocs/…) - Year range filter — narrow to a decade, year, or recent window
- Structured JSON — ready for downstream pipelines
- Public DBLP API — no authentication, no quota headaches
4 Use Cases (ready-to-run JSON inputs)
1. Track recent papers on a CS topic
{"mode": "publication_search","query": "retrieval augmented generation","yearFrom": 2023,"maxResults": 100}
Top 100 RAG papers from 2023 onward — seed data for a research digest or a conference prep reading list.
2. Hiring intelligence — find CS talent
{"mode": "author_search","query": "federated learning Stanford","maxResults": 50}
50 researchers matching "federated learning Stanford" — ideal for technical recruiting, lab profiling, or advisor discovery.
3. Full publication list for a specific author
{"mode": "author_publications","authorPid": "h/JohnEHopcroft","maxResults": 200}
Complete bibliography for John E. Hopcroft (PID format from DBLP URLs). Useful for citation-based recommendation systems.
4. Conference/journal catalog
{"mode": "venue_search","query": "neural information processing","maxResults": 20}
Find venues matching a keyword — great for conference radar tools or for normalizing messy venue labels in a dataset.
Input Schema
| Field | Type | Default | Description |
|---|---|---|---|
mode | enum | publication_search | publication_search / author_search / author_publications / venue_search |
query | string | — | Search keywords (publication / author / venue search) |
authorPid | string | — | DBLP author PID (author_publications mode, e.g. h/JohnEHopcroft) |
maxResults | integer | 50 | Max results (1–1000) |
yearFrom | integer | — | Earliest year (inclusive, publication_search / author_publications) |
yearTo | integer | — | Latest year (inclusive) |
Output (sample — publication)
{"type": "publication","key": "conf/cvpr/HeZRS16","title": "Deep Residual Learning for Image Recognition","year": 2016,"venue": "CVPR","type_hint": "Conference and Workshop Papers","authors": [{"name": "Kaiming He", "pid": "24/3195"},{"name": "Xiangyu Zhang", "pid": "79/1093-1"},{"name": "Shaoqing Ren", "pid": "70/8504"},{"name": "Jian Sun", "pid": "s/JianSun-1"}],"pages": "770-778","doi": "10.1109/CVPR.2016.90","dblpUrl": "https://dblp.org/rec/conf/cvpr/HeZRS16","ee": "https://doi.org/10.1109/CVPR.2016.90"}
Pricing & Performance
- Pay-per-event: $0.005 per publication, author, or venue record
- Typical cost: $0.05 for 10 papers, $0.50 for 100, $5 for 1,000
- Speed: ~15–20 records/second (rate-limit-safe against DBLP)
- Free Apify tier: $5/month credit = ~1,000 records/month
Compare to commercial bibliographic databases (Scopus, Web of Science): DBLP is free and open, and this actor delivers structured JSON ready for pipelines — you pay only for extraction.
Integrations
- Zapier / Make / n8n — new papers from a favorite author → Slack / Notion / Airtable
- LangChain / LlamaIndex — RAG over CS bibliography and abstracts
- Vector DBs (Pinecone / Weaviate / Qdrant) — embed titles/authors for semantic paper search
- Neo4j / Graphiti — author → paper → venue graph for citation analytics
- Citation managers (Zotero / Mendeley) — DBLP key → full metadata enrichment
- Elasticsearch / Meilisearch — index CS literature for a search UI
- Python SDK
from apify_client import ApifyClientclient = ApifyClient("<APIFY_TOKEN>")run = client.actor("Helpermara/dblp-scraper").call(run_input={"mode": "publication_search", "query": "transformer attention", "yearFrom": 2022, "maxResults": 100})for paper in client.dataset(run["defaultDatasetId"]).iterate_items():print(paper["year"], paper["venue"], paper["title"])
FAQ
Do I need a DBLP API key? No. DBLP's public JSON API (dblp.org) is free and unauthenticated.
How current is the data? Live — every request hits DBLP directly. DBLP itself typically updates weekly as new conference proceedings and journal issues are released.
Where do author PIDs come from? From DBLP URLs — for example, https://dblp.org/pid/h/JohnEHopcroft.html → PID = h/JohnEHopcroft. Pass this into authorPid.
Does this cover non-CS papers? DBLP focuses on computer science and closely adjacent fields (HCI, info theory, cryptography, ML/AI, PL, systems). For broader coverage, combine with crossref-scraper or semantic-scholar-scraper.
Rate limits? DBLP has soft rate limits. The actor paces requests conservatively (sleep between batches) to stay well within acceptable use.
Can I get paper abstracts? DBLP doesn't provide abstracts itself — it returns titles, authors, venues, and DOI/URL. Follow the DOI through semantic-scholar-scraper or crossref-scraper for abstracts.
Keywords
dblp scraper, computer science bibliography, cs research, dblp api, academic papers, paper search, author search, venue search, conference papers, cs publications, citation analysis, scholarly search, research intelligence, cs hiring, academic recruiting, bibliography api, dblp key, publication metadata, computer science database, cs literature
Companions (cross-promo)
- semantic-scholar-scraper — academic papers + citations
- arxiv-scraper — CS/ML preprints
- crossref-scraper — DOI metadata
- orcid-scraper — researcher profiles
Changelog
- 2026-04-24 — Extended README with use cases, integrations, and FAQ
- 2026-03 — Initial release: publication / author / venue search modes
