China CNIPA Patent Scraper
Pricing
Pay per event
China CNIPA Patent Scraper
Scrape Chinese patent data from Google Patents. Search by keyword, IPC classification, assignee, or date range. Returns bilingual (Chinese + English) patent records including title, abstract, inventors, assignees, CPC/IPC codes, legal status, and patent family information.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape Chinese patent data from Google Patents. Search by keyword, IPC/CPC classification code, assignee name, or date range. Returns bilingual (Chinese + English) patent records with full bibliographic data.
What it does
Two operating modes:
Search mode (default): Queries Google Patents for Chinese patents using the provided keyword, classification code, or assignee name. Pages through results and fetches each patent's detail page.
Direct URL mode: Provide specific Google Patents URLs (e.g. https://patents.google.com/patent/CN115982415A/en) and the actor scrapes only those patents, skipping the search step entirely.
Output fields
Each scraped patent produces one dataset record with 20 fields:
| Field | Description |
|---|---|
publication_number | CNIPA publication number (e.g. CN114547329B) |
application_number | Application number (e.g. CN:202310093204.3A) |
title_en | Patent title in English (Google-translated) |
abstract_en | Full abstract in English (Google-translated) |
abstract_cn | Full abstract in Chinese (original text) |
filing_date | Filing date (YYYY-MM-DD) |
publication_date | Publication date (YYYY-MM-DD) |
grant_date | Grant date (YYYY-MM-DD), empty if not yet granted |
priority_date | Priority date (YYYY-MM-DD) |
status | Legal status (Pending, Active, Expired, etc.) |
inventors_cn | Inventors in Chinese characters, comma-separated |
assignees_en | Current assignees in English, comma-separated |
assignees_cn | Original assignees in Chinese, comma-separated |
cpc_codes | CPC classification codes, comma-separated (leaf codes only) |
ipc_codes | IPC classification codes, comma-separated |
legal_events | Legal event timeline as JSON string |
family_members | Patent family members, comma-separated |
pdf_url | URL to patent PDF |
google_patents_url | Google Patents detail page URL |
scraped_at | ISO timestamp of when the record was scraped |
Input parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
maxItems | integer | Yes | Maximum patents to scrape (default: 10, Google Patents caps at 1,000 per query) |
query | string | No | Search keywords, IPC/CPC codes, or assignee name. Examples: battery electric vehicle, H04L, Huawei |
dateFrom | string | No | Filter patents filed on or after this date (YYYY-MM-DD) |
dateTo | string | No | Filter patents filed on or before this date (YYYY-MM-DD) |
startUrls | array | No | Direct Google Patents URLs to scrape. When provided, skips search entirely |
Example inputs
Search by keyword with date range:
{"query": "artificial intelligence medical","dateFrom": "2022-01-01","dateTo": "2024-12-31","maxItems": 50}
Search by IPC classification code:
{"query": "H04L","maxItems": 100}
Search by assignee:
{"query": "Huawei","maxItems": 200}
Direct URL mode:
{"startUrls": ["https://patents.google.com/patent/CN115982415A/en","https://patents.google.com/patent/CN108081978B/en"],"maxItems": 2}
Example output
{"publication_number": "CN115982415A","application_number": "CN:202310093204.3A","title_en": "Incremental graph division method, device, equipment, medium and product","abstract_en": "The application discloses an incremental graph division method...","abstract_cn": "本申请公开了一种增量图划分方法、装置、设备、介质及产品...","filing_date": "2023-02-06","publication_date": "2023-04-18","grant_date": "","priority_date": "2023-02-06","status": "","inventors_cn": "汤韬, 高鹏飞, 孙权, 潘婧, 赵金涛, 郑建宾, 艾博轩, 庞悦","assignees_en": "China Unionpay Co Ltd","assignees_cn": "China Unionpay Co Ltd","cpc_codes": "G06F16/174, G06F16/901, G06F16/9536, G06Q50/00, Y02D10/00","ipc_codes": "","legal_events": "[{\"date\":\"2023-02-06\",\"title\":\"Application filed by China Unionpay Co Ltd\",\"type\":\"filed\"},...]","family_members": "CN115982415A, WO2024164667A1, ...","pdf_url": "https://patentimages.storage.googleapis.com/64/48/49/800ddc17fec1fc/CN115982415A.pdf","google_patents_url": "https://patents.google.com/patent/CN115982415A/en","scraped_at": "2026-05-11T04:22:20.537Z"}
Notes
- Google Patents caps search results at 1,000 patents per query. Use
dateFrom/dateToto narrow large result sets. - The search API returns patents sorted by Google's relevance ranking. Results are not guaranteed to be exhaustive.
- Chinese abstracts (
abstract_cn) are extracted when available in the original-language source text. Not all patents have the original Chinese text displayed. - Legal events are returned as a JSON string containing an array of
{date, title, type}objects. - Family members include all patent numbers in the global patent family (not limited to CN).