Google Patents Scraper — Claims, Inventors & Citations avatar

Google Patents Scraper — Claims, Inventors & Citations

Pricing

$8.99/month + usage

Go to Apify Store
Google Patents Scraper — Claims, Inventors & Citations

Google Patents Scraper — Claims, Inventors & Citations

Scrape complete patent data from Google Patents. Returns full claims text, abstract, inventors, applicants, IPC/CPC classifications, filing dates, legal status and citation counts. Search by patent ID, keyword, assignee or inventor. RESIDENTIAL proxy included.

Pricing

$8.99/month + usage

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

15 days ago

Last modified

Share

🔬 Google Patents Scraper — Extract Patent Data, Claims, Inventors & Classifications

The most complete Google Patents Scraper on Apify. Extract full patent records from Google Patents — title, abstract, claims, inventors, assignees, filing dates, IPC/CPC classifications, legal status, citation counts, and patent family data. Search by keyword, assignee, inventor, or country. No API key required.


🆓 Try FREE for 2 hours — no credit card needed. Then just $8.99/month for unlimited Google Patents scraping — by patent ID, keyword search, assignee, inventor, or country.


📌 Table of Contents


🔍 What Is This Actor?

Google Patents Scraper is a production-grade Apify actor that extracts complete patent records from Google Patents — the world's largest free patent search engine covering patents from the USPTO, EPO, WIPO, and dozens of national patent offices worldwide.

This Google Patents scraper supports two input modes: provide specific patent IDs (US10000000B2, EP1234567A1, etc.) for full detail extraction, or provide a search query with optional filters (assignee, inventor, country, date range) to discover and extract matching patents at scale.

Every patent record returned by this Google Patents scraper includes the complete set of structured fields: patent title, abstract, full claims text, description excerpt, inventors list, applicants and assignees, filing date, publication date, priority date, legal status (active/expired/pending), IPC and CPC classification codes, citation counts, patent family members, and country of origin.

Whether you are an IP attorney building a prior art database, a corporate IP team monitoring competitor patent activity, a researcher studying technology trends, or a developer integrating patent data into a product — this Google Patents scraper delivers enterprise-grade data with no API key and no per-query fees.


🚀 Why This Is the Best Google Patents Scraper

FeatureThis ActorGoogle Patents API (via BigQuery)Lens.orgDerwent / Clarivate
No API key required❌ GCP account needed
Full claims text⚠️ Limited
Full abstract
IPC + CPC classifications✅ Both
Legal status
Patent family data
Citation counts
Assignee search✅ Built-in
Inventor search✅ Built-in
Country filter✅ Built-in
Bulk patent ID lookup⚠️ Complex⚠️
Price$8.99/moFree (complex setup)Free (limited)$5,000+/yr

This Google Patents scraper gives you enterprise-grade patent data at $8.99/month — compared to $5,000+/year for professional patent intelligence platforms.


💰 Pricing

🆓 Free Trial — 2 Hours, No Credit Card

Start using this Google Patents scraper immediately with a full 2-hour free trial. No credit card. Click Try for free, enter a patent ID or search query, and get your first patent records in under 2 minutes.

During the free trial you get:

  • ✅ Both scraping modes — Patent ID lookup + Search query
  • ✅ All output fields — claims, abstract, inventors, classifications, legal status, citations
  • ✅ Assignee, inventor, country, and date filters
  • ✅ Full detail mode — visits each patent's detail page for complete data
  • ✅ JSON output ready for export or API integration

After the free trial, continue with $8.99/month for unlimited Google Patents scraping:

  • Unlimited patent ID lookups — extract full records for any number of patents
  • Unlimited search runs — search by keyword, assignee, inventor, country, date
  • Full claims text — the most valuable part of any patent, fully extracted
  • IPC + CPC classification codes — for technology landscape mapping
  • Legal status — know instantly if a patent is active, expired, or pending
  • Apify scheduling — automate weekly competitor patent monitoring
  • API access — integrate patent data into your IP platform or research tool

💡 $8.99/Month vs Patent Intelligence Alternatives

ToolPriceClaimsClassificationsBulk LookupAPI
This Google Patents Scraper$8.99/mo✅ Full text✅ IPC + CPC✅ Apify
Google Patents (manual)Free❌ One-by-one
Lens.orgFree⚠️ Limited⚠️✅ Limited
PatSnap$500+/mo
Derwent Innovation$5,000+/yr
Questel Orbit$3,000+/yr

🎯 $8.99/month delivers the same core patent data as tools costing $3,000–$5,000 per year. Try it free for 2 hours first.


🎬 2 Scraping Modes

Mode 1: Patent ID Lookup (Direct)

Provide one or many patent IDs and extract the complete detail record for each — title, abstract, full claims, inventors, assignees, dates, classifications, legal status, citation counts, and patent family.

Supported patent ID formats:

US10000000B2US patent, granted
EP1234567A1 ← European patent application
WO2023123456A1PCT international application
CN112345678A ← China patent
JP2023123456A ← Japan patent
KR102345678B1 ← South Korea patent
DE102021123456A1 ← Germany patent
GB2601234AUK patent

Mode 2: Search Query (Discovery)

Provide a keyword search query — with optional filters for assignee company, inventor name, country, and date range — and extract all matching patents. When fetch_full_details: true, the scraper visits each result's detail page for the complete record.

Search query examples:

machine learning drug discovery
electric vehicle battery management system
quantum computing error correction
CRISPR gene editing
natural language processing transformer
solar panel efficiency improvement

🎯 Use Cases

Use this Google Patents scraper to perform prior art searches for patent applications, freedom-to-operate analyses, and patent validity assessments. Extract full claims text for any patent portfolio in minutes rather than hours.

🏢 Corporate IP Strategy & Competitor Monitoring

Monitor competitor patent filings by scraping all patents assigned to a specific company (assignee: "Tesla", assignee: "Samsung"). Track new filings weekly to understand competitor R&D direction and identify technology whitespace.

🔬 Technology & Innovation Research

Map technology landscapes by extracting patents across a technology domain. Analyze IPC/CPC classification distributions to identify which subtechnologies are most active, which inventors are most prolific, and how filing rates are trending over time.

🎓 Academic & Policy Research

Build structured patent datasets for academic research on innovation, technology transfer, patent thickets, or industrial policy. Extract inventor networks, assignee histories, and citation graphs from Google Patents at scale without manual data collection.

💹 Investment & Market Intelligence

Identify technology leaders in emerging sectors by analyzing patent filing velocity, citation impact, and patent family breadth. Track which startups and research institutions are most active in your investment thesis areas.

🤝 Licensing & Partnership Research

Find patent holders in a specific technology area by searching Google Patents by IPC classification or technology keyword. Identify potential licensing partners or acquisition targets based on their patent portfolio.

🛡️ Patent Infringement Analysis

Extract claims text from competitor patents to identify potential infringement risk areas. The full claims text returned by this Google Patents scraper is the primary legal document defining patent scope.

💻 Building IP Intelligence Platforms

Developers building patent analytics tools, IP management platforms, or technology intelligence products can use this Google Patents scraper as their data pipeline — clean JSON with all structured fields ready for any database.


📋 Output Fields (Full Reference)

Every record from this Google Patents scraper is labeled "ENTERPRISE" grade and contains:

📄 Core Patent Identity Fields

FieldTypeDescriptionExample
Patent_IDstringNormalized patent number"US10000000B2"
TitlestringPatent title (max 500 chars)"System and method for neural network..."
Country_Codestring2-letter country code"US", "EP", "WO"
CountrystringFull country / office name"United States", "World (PCT)"
Source_URLstringDirect Google Patents URL"https://patents.google.com/patent/US10000000B2"

📝 Document Content Fields

FieldTypeDescriptionExample
AbstractstringPatent abstract (max 3000 chars)"A system comprising..."
ClaimsstringFull claims text (max 5000 chars)"1. A method comprising the steps of..."
Claims_CountintegerNumber of claims in the patent20
DescriptionstringDescription excerpt (max 3000 chars)"The present invention relates to..."

👥 People & Organization Fields

FieldTypeDescriptionExample
InventorsarrayList of inventor names["John Smith", "Jane Doe"]
ApplicantsarrayList of applicant/assignee names["Apple Inc."]
AssigneestringPrimary assignee (first in list)"Apple Inc."
AgentstringPatent agent or attorney"Kilpatrick Townsend & Stockton LLP"

📅 Date Fields

FieldTypeDescriptionExample
Publication_DatestringPublication date (YYYY-MM-DD)"2024-06-18"
Filing_DatestringApplication filing date"2022-03-15"
Priority_DatestringPriority / earliest claim date"2021-09-30"

🏷️ Classification Fields

FieldTypeDescriptionExample
IPC_ClassificationarrayInternational Patent Classification codes["G06N3/04", "G06N20/00"]
CPC_ClassificationarrayCooperative Patent Classification codes (max 10)["G06N3/044", "G06N3/08"]

⚖️ Status & Citation Fields

FieldTypeDescriptionExample
Legal_StatusstringPatent legal status"Active", "Expired", "Pending"
Cited_Patents_CountintegerNumber of patents cited by this patent45
Cited_By_CountintegerNumber of patents that cite this patent312

👨‍👩‍👧‍👦 Patent Family Fields

FieldTypeDescriptionExample
Patent_FamilyarrayRelated patents in the same family (max 10)["US10000001B2", "EP3456789A1"]
Patent_Family_CountintegerTotal family member count8

🔧 Meta Fields

FieldTypeDescriptionExample
Extraction_StatusstringExtraction result indicator"Verified ✅" or "Failed ❌"
Data_GradestringQuality label"ENTERPRISE"
TimestampstringISO extraction timestamp"2024-11-01T10:30:00Z"

⚙️ Input Parameters

{
"patent_ids": ["US10000000B2", "EP1234567A1"],
"search_query": "electric vehicle battery",
"assignee": "Tesla",
"inventor": "",
"country": "US",
"date_from": "2022-01-01",
"max_results": 20,
"fetch_full_details": true,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
ParameterTypeDefaultDescription
patent_idsarray or string[]Specific patent IDs to look up. Accepts array or newline/comma-separated string. IDs are normalized automatically (e.g. 10000000US10000000).
search_querystring""Keyword search query for Google Patents search. Used when no patent_ids are provided.
assigneestring""Filter search results by assignee (company) name — e.g. "Apple", "Samsung", "MIT"
inventorstring""Filter search results by inventor name — e.g. "Elon Musk", "Jane Smith"
countrystring""Filter by country code — e.g. "US", "EP", "CN", "JP"
date_fromstring""Filter patents published before this date (YYYY-MM-DD format)
max_resultsinteger20Maximum number of patent records to extract
fetch_full_detailsbooleantrueWhen true, visits each patent's detail page for complete data (recommended). When false, returns search-result-level data only (faster).
proxyConfigurationobjectOffApify proxy config — RESIDENTIAL required for Google Patents

📦 Example Inputs & Outputs

Example 1: Specific Patent ID Lookup

Input:

{
"patent_ids": ["US10000000B2"],
"fetch_full_details": true
}

Output:

{
"Patent_ID": "US10000000B2",
"Title": "Neuromorphic Computing System with Adaptive Learning",
"Abstract": "A neuromorphic computing system comprising adaptive synaptic weights...",
"Claims": "1. A neuromorphic computing system comprising: a plurality of artificial neurons...",
"Claims_Count": 24,
"Description": "The present invention relates to neuromorphic computing architectures...",
"Inventors": ["John A. Smith", "Maria L. Rodriguez"],
"Applicants": ["IBM Corporation"],
"Assignee": "IBM Corporation",
"Agent": "Eschweiler & Associates LLC",
"Publication_Date": "2018-06-19",
"Filing_Date": "2016-11-15",
"Priority_Date": "2015-08-22",
"IPC_Classification": ["G06N3/04", "G06N20/00"],
"CPC_Classification": ["G06N3/044", "G06N3/08", "G06N20/00"],
"Legal_Status": "Active",
"Cited_Patents_Count": 45,
"Cited_By_Count": 312,
"Patent_Family": ["US10000001B2", "EP3234567A1", "CN107851096A"],
"Patent_Family_Count": 8,
"Country_Code": "US",
"Country": "United States",
"Source_URL": "https://patents.google.com/patent/US10000000B2",
"Extraction_Status": "Verified ✅",
"Data_Grade": "ENTERPRISE",
"Timestamp": "2024-11-01T10:30:00Z"
}

Example 2: Bulk Patent ID List

Input:

{
"patent_ids": [
"US10000000B2",
"EP1234567A1",
"WO2023123456A1",
"CN112345678A"
],
"fetch_full_details": true,
"max_results": 4
}

Output: 4 complete patent records — one per ID — each with full claims, inventors, classifications, legal status, and citation data. The country is auto-detected from the patent ID prefix.


Example 3: Search by Keyword

Input:

{
"search_query": "quantum computing error correction",
"max_results": 10,
"fetch_full_details": true
}

Output: Up to 10 complete patent records for quantum computing error correction patents — with full claims, inventor lists, assignee companies, IPC/CPC codes, and legal status for each.


Input:

{
"search_query": "electric vehicle",
"assignee": "Tesla",
"country": "US",
"max_results": 20
}

Output: Up to 20 US patents assigned to Tesla in the electric vehicle technology area — useful for competitor IP monitoring and technology landscape analysis.


Input:

{
"inventor": "Elon Musk",
"max_results": 15,
"fetch_full_details": true
}

Output: Up to 15 patents where Elon Musk is listed as an inventor — with full patent details, assignees, dates, and classifications.


Example 6: Technology + Country + Date Filter

Input:

{
"search_query": "CRISPR gene editing",
"country": "EP",
"date_from": "2020-01-01",
"max_results": 15
}

Output: Up to 15 European patents for CRISPR gene editing technology published since January 2020.


⚙️ How the Google Patents Scraper Works

This Google Patents scraper uses a smart two-phase extraction strategy optimized for Google Patents' specific page structure:

Patent ID Mode — Direct Detail Extraction

Step 1 — ID Normalization Patent IDs are automatically normalized to the correct format. 10000000 becomes US10000000, missing country prefixes are added for US patents, and formatting inconsistencies (spaces, dashes) are cleaned.

Step 2 — Detail Page Fetch The scraper requests the Google Patents detail page for each patent ID using Chrome browser impersonation and a residential proxy. The full HTML page is returned — including rendered structured data markup.

Step 3 — Multi-Layer HTML Parsing Patent data is extracted using a priority system: first from itemprop schema.org attributes (most reliable), then from CSS class selectors for Google Patents' specific layout, then from structured data tables as fallback.

Step 4 — Classification Extraction IPC and CPC classification codes are extracted from dedicated code elements. Both systems are extracted independently — giving you the international (IPC) and cooperative (CPC) classification in separate fields.

Step 5 — Claims Analysis The full claims section is extracted as text, and the number of individual claims is counted by detecting numbered claim openings in the text.

Search Mode — Discovery + Full Detail

Step 1 — Search Query Construction Keyword, assignee, inventor, country, and date filters are combined into a Google Patents search URL. The search is executed against Google Patents' results page.

Step 2 — Results Parsing Search results are parsed — first from any embedded JSON data in the page, then from HTML card elements. Patent IDs, titles, assignees, and publication dates are extracted from the results list.

Step 3 — Full Detail Fetch (when enabled) When fetch_full_details: true, each result's patent ID is used to fetch its full detail page — providing complete claims, classifications, legal status, citation counts, and patent family data beyond what appears in search results.


🔍 Search Filters Guide

Keyword Search (search_query)

Full-text search across patent titles, abstracts, and claims:

machine learning image recognition
lithium ion battery thermal management
autonomous vehicle lidar sensor fusion
mRNA vaccine delivery mechanism

Assignee Filter (assignee)

Narrow results to patents filed by a specific company or organization:

assignee: "Apple"
assignee: "Samsung Electronics"
assignee: "MIT"
assignee: "Pfizer"
assignee: "Toyota Motor"

Inventor Filter (inventor)

Find all patents listing a specific person as inventor:

inventor: "Jensen Huang"
inventor: "Demis Hassabis"
inventor: "Frances Arnold"

Country Filter (country)

Filter by patent office using standard 2-letter codes:

CodePatent Office
USUnited States Patent and Trademark Office
EPEuropean Patent Office
WOWorld Intellectual Property Organization (PCT)
CNChina National Intellectual Property Administration
JPJapan Patent Office
KRKorean Intellectual Property Office
DEGerman Patent and Trade Mark Office
GBUK Intellectual Property Office
AUIP Australia
CACanadian Intellectual Property Office

📚 Patent ID Formats

This Google Patents scraper accepts and auto-normalizes all major patent ID formats:

FormatExampleCountry
US granted patentUS10000000B2United States
US applicationUS20230123456A1United States
European patentEP1234567A1European Patent Office
PCT applicationWO2023123456A1International (WIPO)
China patentCN112345678AChina
Japan patentJP2023123456AJapan
Korea patentKR102345678B1South Korea
Germany patentDE102021123456A1Germany
UK patentGB2601234AUnited Kingdom
Australia patentAU2023123456A1Australia
Bare US number10000000Auto-prefixed as US10000000

🗂️ IPC & CPC Classification Guide

International Patent Classification (IPC)

The IPC is a hierarchical classification system used by patent offices worldwide. Each code identifies the technology field of a patent:

Code PrefixTechnology Domain
AHuman necessities (food, clothing, personal items, health)
BPerforming operations; transporting
CChemistry; metallurgy
DTextiles; paper
EFixed constructions
FMechanical engineering; lighting; heating; weapons
GPhysics
HElectricity

Example: G06N3/04 = Physics → Computing → Neural networks → Architecture

Cooperative Patent Classification (CPC)

The CPC is a more granular extension of the IPC used by the USPTO and EPO. It has finer subdivision levels and is particularly useful for precise technology landscape mapping.

Both IPC and CPC codes are extracted separately by this Google Patents scraper, giving you flexibility to use either system for your research or analytics.


🌐 Proxy Configuration

⚠️ Residential Proxy Is Required for Google Patents

Google aggressively blocks datacenter IP addresses from automated access. A residential proxy is mandatory for this Google Patents scraper to work reliably.

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Without a residential proxy, Google Patents will return 403 Forbidden or CAPTCHA pages after just 1–2 requests. With a residential proxy, the scraper can reliably fetch dozens or hundreds of patent records per run.

The actor applies a 2–4 second random delay between requests to Google Patents to respect rate limits and avoid triggering additional blocking.


⚡ Performance & Speed

Speed Benchmarks (with residential proxy)

ModePatentsFull DetailsEstimated Time
Patent ID lookup1Yes~15–30 seconds
Patent ID lookup10Yes~3–5 minutes
Patent ID lookup20Yes~6–10 minutes
Search query10 resultsNo~30–60 seconds
Search query10 resultsYes (full detail)~4–6 minutes
Search query20 resultsYes (full detail)~8–12 minutes

fetch_full_details Impact

Setting fetch_full_details: true (recommended) visits each patent's individual page — providing claims, description, legal status, citation counts, and classification codes not available in search results. This doubles run time but dramatically increases data completeness.

Set fetch_full_details: false for faster runs when you only need basic fields (title, abstract, assignee, dates) from search results.


❓ FAQ

Q: Do I need a Google account or API key? A: No. Google Patents is a public search engine — no account, no API key, and no authentication is required. This Google Patents scraper accesses only public patent information.

Q: Why is a residential proxy required? A: Google blocks datacenter IP addresses from automated access. Residential proxies use real home internet connections that Google cannot distinguish from normal user traffic. Without a residential proxy, Google returns 403 or CAPTCHA after just a few requests.

Q: Can I extract the full claims text for a patent? A: Yes — the Claims field contains the full claims text (up to 5,000 characters). Claims are the legally binding part of a patent defining exactly what is protected. The Claims_Count field tells you how many individual claims the patent has.

Q: What is the difference between IPC and CPC classifications? A: IPC (International Patent Classification) is the standard global system used by all patent offices. CPC (Cooperative Patent Classification) is a finer-grained extension used by the USPTO and EPO. Both are extracted by this scraper — use whichever system your research or database requires.

Q: Can I monitor a company's patent filings automatically? A: Yes — set assignee: "CompanyName" and schedule this Google Patents scraper to run weekly. Use Apify webhooks to send new patent alerts to Slack or email automatically.

Q: What does fetch_full_details: false do? A: In search mode, it returns only the data available on the search results page — title, abstract snippet, assignee, publication date. It is much faster but does not include claims, legal status, citation counts, or classification codes. Use true for complete records.

Q: Can I look up European (EP) or PCT (WO) patents? A: Yes — provide the full patent ID including the country prefix: EP1234567A1, WO2023123456A1. The country is auto-detected from the prefix and the Google Patents URL is constructed automatically.

Q: How many patents can I extract per run? A: There is no hard limit. For large runs (100+ patents), we recommend batching into groups of 20–30 per run for reliability. The 2–4 second delay between requests helps avoid rate limiting.

Q: What is the 2-hour free trial? A: The free trial gives you complete access to all features — both modes, all filters, all output fields, full claims and classification extraction — for 2 hours with no credit card required.

Q: Can I use this for freedom-to-operate (FTO) analysis? A: This scraper extracts the claims text which is the starting point for FTO analysis. However, FTO analysis requires legal interpretation of claims by a qualified patent attorney — this tool provides the data, not the legal analysis.


📜 Changelog

v1.0.0 (Current)

  • ✅ Patent ID lookup mode — full detail extraction for any patent ID (US, EP, WO, CN, JP, KR, DE, GB)
  • ✅ Patent ID auto-normalization — bare numbers, missing prefixes, formatting variants all handled
  • ✅ Search mode — keyword, assignee, inventor, country, and date filters
  • ✅ Full claims text extraction (up to 5,000 characters)
  • ✅ Claims count — number of individual claims counted automatically
  • ✅ Abstract extraction (up to 3,000 characters)
  • ✅ Description excerpt (up to 3,000 characters)
  • ✅ IPC and CPC classification codes — both systems extracted independently
  • ✅ Legal status extraction (Active / Expired / Pending)
  • ✅ Citation count: patents cited + times cited by others
  • ✅ Patent family members list and count
  • ✅ Multi-layer fallback parsing — itemprop schema → CSS selectors → data tables
  • fetch_full_details mode for search results
  • ✅ 2–4 second rate-limit-safe delay between requests
  • "ENTERPRISE" data grade label on all records
  • Extraction_Status flag: "Verified ✅" or "Failed ❌"

This Google Patents scraper accesses publicly available patent information from Google Patents — the same data visible to anyone visiting the site in a browser. Patent documents are public domain in most jurisdictions once published.

Usage guidelines:

  • Use extracted patent data for legitimate IP research, legal due diligence, academic research, and business intelligence
  • Patent documents (text, claims, drawings) are generally public domain once published — consult a patent attorney for jurisdiction-specific guidance
  • Do not use this tool to scrape at volumes that impact Google Patents' services or circumvent access controls
  • This tool extracts publicly available bibliographic data — it does not provide legal advice on patent validity, infringement, or freedom-to-operate

🤝 Support & Feedback

  • Bug or extraction issue? Contact via the Apify actor page — we respond fast
  • Need a new field or filter? Request via the Apify Community forum
  • Loving it? Please leave a ⭐ review — it helps other IP professionals find this Google Patents scraper!

🆓 2-hour free trial → 💳 $8.99/month after — enterprise-grade patent data from Google Patents at the lowest price available.


Built with ❤️ on Apify · Google Patents Scraper — Claims, Classifications & Full Patent Records
Extract complete patent data from Google Patents — no API key, no quotas, no limits