USPTO Patent Scraper avatar

USPTO Patent Scraper

Pricing

Pay per event

Go to Apify Store
USPTO Patent Scraper

USPTO Patent Scraper

Search US issued patents via the official USPTO Patents Public Search API — patent number, title, abstract, inventors, assignee, classifications, priority, filing, and grant dates — bulk export to JSON or CSV. Free, no key, no login.

Pricing

Pay per event

Rating

0.0

(0)

Developer

DevilScrapes

DevilScrapes

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 hours ago

Last modified

Categories

Share


🎯 What this scrapes

The USPTO Patents Public Search API (developer.uspto.gov/ds-api/patents) is technically free but punishing — quirky XML responses, an under-documented throttle, and a query syntax (TI/<text>, AN/<assignee>, IN/<inventor>) that bites first-timers. This Actor wraps the search endpoint, handles pagination safely, and writes one structured row per patent. No account, no key, no quota to manage on your end.

Every request runs through our fingerprint-rotating HTTP stack so the API sees consistent, well-behaved traffic — the kind that keeps your runs off the block-list.

🔥 Features

  • 🛡️ Browser fingerprint rotationcurl-cffi replays real Chrome / Firefox / Safari TLS handshakes. The target sees a real browser, not a Python script.
  • 🌐 Residential proxy rotation via Apify Proxy — fresh session and exit IP whenever the endpoint pushes back.
  • 🔁 Retries with exponential backoff on 408 / 429 / 5xx — up to 5 attempts per page; Retry-After headers are honoured.
  • 🧱 Rate-limit-aware pacing — we slow down before the API cuts us off, not after.
  • 🧊 Clean, typed dataset rows — Pydantic-validated fields, ISO-8601 timestamps, stable IDs. Export to JSON, CSV, or Excel straight from Apify Console.
  • 💰 Pay-Per-Event pricing — you pay only when results land in your dataset. No data, no charge (only the small actor-start warm-up fee).

💡 Use cases

  • IP-landscape monitoring — schedule weekly runs against a competitor's assignee name and diff the output to catch new filings before they publish in trade press.
  • Freedom-to-operate (FTO) research — query by CPC class to surface all patents relevant to a new product before you build.
  • Patent assignee scraper workflow — pull a company's full portfolio, pivot by inventor, assignee, or classification, and feed downstream analysis pipelines.
  • Inventor tracking — follow a key researcher or prolific inventor across employers and publication windows.
  • M&A / acquisition diligence — quantify and categorise a target's patent assets in minutes instead of hours inside expensive IP databases.
  • Academic patent-statistics research — feed a corpus of patent abstracts, classification codes, and grant dates into NLP / citation-analysis workflows.

⚙️ How to use it

  1. Click Try for free at the top of the page.
  2. Fill in the input form — searchQuery is the only required field; all others have sensible defaults.
  3. Click Start. Results stream into the run's dataset in real time.
  4. When the run finishes, open Storage → Dataset to export as JSON, CSV, or Excel — or pull rows via the Apify API.

Query syntax tips:

  • TI/(neural network) — title field only
  • AN/Apple — assignee name
  • IN/Hinton — inventor surname
  • Combine with AND / OR: TI/(transformer) AND AN/Google

📥 Input

FieldTypeRequiredDefaultNotes
searchQuerystringyesneural network trainingFree-text or field-qualified query. Use TI/, AN/, IN/ qualifiers to narrow scope.
maxResultsintegerno30Total patents to return across all pages. Max 2 000 per run.
sortBystringnorelevancerelevance, publication_date_desc, or publication_date_asc.
proxyConfigurationobjectno{"useApifyProxy": false}Apify Proxy configuration. Direct calls work fine; enable proxy if you hit rate limits.

Example input

{
"searchQuery": "TI/(neural network)",
"maxResults": 3,
"sortBy": "publication_date_desc",
"proxyConfiguration": {
"useApifyProxy": false
}
}

📤 Output

Every row is one dataset item.

FieldTypeNotes
patent_numberstringPatent or publication number (without country prefix).
publication_idstringInternal Patents Public Search publication ID.
titlestringPatent title.
abstractstring | nullPatent abstract text.
inventorsarrayInventor names.
assigneesarrayAssignee organisations.
applicantsarrayApplicants listed on the filing.
cpc_classificationsarrayCPC classification codes.
ipc_classificationsarrayIPC classification codes.
publication_datestring | nullPublication date (YYYY-MM-DD).
grant_datestring | nullGrant date if granted.
filing_datestring | nullFiling date.
priority_datestring | nullPriority date.
kind_codestring | nullKind code (B1, A1, etc.).
patent_urlstringUSPTO Patents Public Search URL for the full document.
scraped_atstringISO-8601 timestamp when this row was recorded.

Example output

{
"patent_number": "11,948,025",
"title": "System and method for training neural networks",
"abstract": "A method for training a neural network comprising...",
"inventors": ["Jane Smith", "John Doe"],
"assignees": ["Example Corp"],
"cpc_classifications": ["G06N 3/08", "G06N 20/00"],
"publication_date": "2024-04-02",
"grant_date": "2024-04-02",
"filing_date": "2021-09-15",
"priority_date": "2021-09-15",
"kind_code": "B2",
"patent_url": "https://ppubs.uspto.gov/pubwebapp/external.html?db=USPAT&docId=11948025",
"scraped_at": "2026-06-01T12:00:00Z"
}

💰 Pricing

Pay-Per-Event — you pay only when these events fire:

EventUSDWhat it is
actor-start$0.005One-off warm-up charge per run
result$0.003Per dataset item written

Example: 1 000 results at the rates above ≈ $3.00. No subscription, no minimum — Apify gives every new account $5 of free credit to start. Commercial IP databases charge $10 000+/seat/year for the same underlying public data.

🚧 Limitations

  • Result ordering is controlled by USPTO Patents Public Search internally; we pass your sortBy preference but the ranking algorithm is theirs.
  • Inventor address details (city, state) and full claims text require follow-up document calls and are not included in this Actor. The patent_url field links directly to the full patent document.
  • Patent family members (foreign equivalents, continuations) are not surfaced — USPTO's cross-reference data requires a separate API call sequence.
  • Publication cadence — the USPTO typically publishes new grants on Tuesdays; new filings appear in the Patents Public Search API within approximately one week of publication. This is not a real-time stream.
  • Legal disclaimer — this tool retrieves public patent metadata for research purposes. It is not a substitute for professional patent counsel. Always consult a registered patent attorney for FTO opinions, invalidity analysis, or litigation strategy.

❓ FAQ

What is the USPTO Patents Public Search API?

The Patents Public Search API (developer.uspto.gov/ds-api/patents) is the USPTO's free, publicly documented REST endpoint for querying the full US patent corpus — issued patents back to 1976 and published applications from 2001. This Actor handles the pagination, query encoding, and field normalisation so you get clean rows instead of raw XML.

Is a patent search API key required?

No. The USPTO Patents Public Search API is open — no registration, no key, no quota letter. We still run our fingerprint-rotation and backoff stack to keep your runs clean and consistent.

Is the data real-time?

USPTO publishes new grants weekly (typically Tuesday issue dates). New patent applications appear in the search index within approximately one week of their publication date. For monitoring workflows, daily or weekly scheduled runs are appropriate.

Can I get full patent claims or drawings?

We surface the patent_url field — click it to view the full document in the USPTO Patents Public Search viewer. Full-text claims extraction and drawing download are roadmap items. For bulk full-text needs, the USPTO also publishes weekly XML bulk data files.

What query syntax does the Actor support?

The actor passes your searchQuery directly to the Patents Public Search API. Field qualifiers: TI/ (title), AB/ (abstract), AN/ (assignee name), IN/ (inventor name), CPC/ (CPC class). Boolean operators: AND, OR, NOT. Proximity: ADJ. See the USPTO query guide for the full reference.

Does this cover WIPO or EPO patents?

No — this Actor covers US patents and published applications only. WIPO (PCT applications) and EPO patents require separate endpoints. We can build dedicated Actors for those on request.

Why is assignees sometimes empty?

Pre-grant publications (kind code A1) often don't carry an assignee — the assignment may not be recorded until grant. Check applicants as a fallback; it is populated on most pre-grant records.

How does the pricing compare to PatSnap or Derwent?

Commercial IP platforms start at roughly $10 000/seat/year. This Actor charges $3.00 per 1 000 results, billed only for what lands in your dataset. It's a wrapper around the same publicly available USPTO data — not a replacement for the analytics layers those platforms provide, but orders of magnitude cheaper for raw patent data retrieval.

💬 Your feedback

Spotted a bug, hit an edge case, or need a new field (inventor address, claim text, family members)? Open an issue on the Actor's Issues tab in Apify Console — we ship fixes weekly and read every report.