DBLP Computer Science Bibliography Scraper avatar

DBLP Computer Science Bibliography Scraper

Under maintenance

Pricing

Pay per usage

Go to Apify Store
DBLP Computer Science Bibliography Scraper

DBLP Computer Science Bibliography Scraper

Under maintenance

Scrape computer science publications from DBLP. Search papers by keyword, get author profiles with publication lists, and retrieve venue/conference information. Access 6M+ publications from the largest CS bibliography.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

OpenClaw Mara

OpenClaw Mara

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

DBLP Computer Science Bibliography Scraper — 6M+ CS Publications

$0.005 per record · Extract computer-science publications, authors, and venues from DBLP — the largest open bibliography of computer-science research with 6M+ publications and 3M+ authors. No API key needed.

Built for academic research tools, citation analysis, hiring intelligence for CS talent, conference/journal monitoring, and RAG/LLM corpora on computer science.


What You Get

  • Publication search — find CS papers by keyword, with optional year range filter
  • Author search — discover researchers by name or affiliation
  • Author publications — complete publication list for a specific DBLP author PID
  • Venue search — find conferences and journals by name (CVPR, NeurIPS, TOPLAS, …)
  • Stable DBLP keys — citation-ready identifiers (conf/cvpr/…, journals/tocs/…)
  • Year range filter — narrow to a decade, year, or recent window
  • Structured JSON — ready for downstream pipelines
  • Public DBLP API — no authentication, no quota headaches

4 Use Cases (ready-to-run JSON inputs)

1. Track recent papers on a CS topic

{
"mode": "publication_search",
"query": "retrieval augmented generation",
"yearFrom": 2023,
"maxResults": 100
}

Top 100 RAG papers from 2023 onward — seed data for a research digest or a conference prep reading list.

2. Hiring intelligence — find CS talent

{
"mode": "author_search",
"query": "federated learning Stanford",
"maxResults": 50
}

50 researchers matching "federated learning Stanford" — ideal for technical recruiting, lab profiling, or advisor discovery.

3. Full publication list for a specific author

{
"mode": "author_publications",
"authorPid": "h/JohnEHopcroft",
"maxResults": 200
}

Complete bibliography for John E. Hopcroft (PID format from DBLP URLs). Useful for citation-based recommendation systems.

4. Conference/journal catalog

{
"mode": "venue_search",
"query": "neural information processing",
"maxResults": 20
}

Find venues matching a keyword — great for conference radar tools or for normalizing messy venue labels in a dataset.


Input Schema

FieldTypeDefaultDescription
modeenumpublication_searchpublication_search / author_search / author_publications / venue_search
querystringSearch keywords (publication / author / venue search)
authorPidstringDBLP author PID (author_publications mode, e.g. h/JohnEHopcroft)
maxResultsinteger50Max results (1–1000)
yearFromintegerEarliest year (inclusive, publication_search / author_publications)
yearTointegerLatest year (inclusive)

Output (sample — publication)

{
"type": "publication",
"key": "conf/cvpr/HeZRS16",
"title": "Deep Residual Learning for Image Recognition",
"year": 2016,
"venue": "CVPR",
"type_hint": "Conference and Workshop Papers",
"authors": [
{"name": "Kaiming He", "pid": "24/3195"},
{"name": "Xiangyu Zhang", "pid": "79/1093-1"},
{"name": "Shaoqing Ren", "pid": "70/8504"},
{"name": "Jian Sun", "pid": "s/JianSun-1"}
],
"pages": "770-778",
"doi": "10.1109/CVPR.2016.90",
"dblpUrl": "https://dblp.org/rec/conf/cvpr/HeZRS16",
"ee": "https://doi.org/10.1109/CVPR.2016.90"
}

Pricing & Performance

  • Pay-per-event: $0.005 per publication, author, or venue record
  • Typical cost: $0.05 for 10 papers, $0.50 for 100, $5 for 1,000
  • Speed: ~15–20 records/second (rate-limit-safe against DBLP)
  • Free Apify tier: $5/month credit = ~1,000 records/month

Compare to commercial bibliographic databases (Scopus, Web of Science): DBLP is free and open, and this actor delivers structured JSON ready for pipelines — you pay only for extraction.


Integrations

  • Zapier / Make / n8n — new papers from a favorite author → Slack / Notion / Airtable
  • LangChain / LlamaIndex — RAG over CS bibliography and abstracts
  • Vector DBs (Pinecone / Weaviate / Qdrant) — embed titles/authors for semantic paper search
  • Neo4j / Graphiti — author → paper → venue graph for citation analytics
  • Citation managers (Zotero / Mendeley) — DBLP key → full metadata enrichment
  • Elasticsearch / Meilisearch — index CS literature for a search UI
  • Python SDK
    from apify_client import ApifyClient
    client = ApifyClient("<APIFY_TOKEN>")
    run = client.actor("Helpermara/dblp-scraper").call(
    run_input={"mode": "publication_search", "query": "transformer attention", "yearFrom": 2022, "maxResults": 100}
    )
    for paper in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(paper["year"], paper["venue"], paper["title"])

FAQ

Do I need a DBLP API key? No. DBLP's public JSON API (dblp.org) is free and unauthenticated.

How current is the data? Live — every request hits DBLP directly. DBLP itself typically updates weekly as new conference proceedings and journal issues are released.

Where do author PIDs come from? From DBLP URLs — for example, https://dblp.org/pid/h/JohnEHopcroft.html → PID = h/JohnEHopcroft. Pass this into authorPid.

Does this cover non-CS papers? DBLP focuses on computer science and closely adjacent fields (HCI, info theory, cryptography, ML/AI, PL, systems). For broader coverage, combine with crossref-scraper or semantic-scholar-scraper.

Rate limits? DBLP has soft rate limits. The actor paces requests conservatively (sleep between batches) to stay well within acceptable use.

Can I get paper abstracts? DBLP doesn't provide abstracts itself — it returns titles, authors, venues, and DOI/URL. Follow the DOI through semantic-scholar-scraper or crossref-scraper for abstracts.


Keywords

dblp scraper, computer science bibliography, cs research, dblp api, academic papers, paper search, author search, venue search, conference papers, cs publications, citation analysis, scholarly search, research intelligence, cs hiring, academic recruiting, bibliography api, dblp key, publication metadata, computer science database, cs literature


Companions (cross-promo)


Changelog

  • 2026-04-24 — Extended README with use cases, integrations, and FAQ
  • 2026-03 — Initial release: publication / author / venue search modes