Rockwell Datasheet PDF Parser — Specs JSON
Pricing
from $13.00 / 1,000 parsed datasheet pdfs
Rockwell Datasheet PDF Parser — Specs JSON
Parse Allen-Bradley technical PDFs into structured JSON: specification key-values, technical tables, limit values, and catalog-number row matches. Browser-free; chain after Document Downloader or parse direct literature URLs.
Pricing
from $13.00 / 1,000 parsed datasheet pdfs
Rating
0.0
(0)
Developer
Andrej Kiva
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 hours ago
Last modified
Categories
Share
Crawloop Rockwell Automation Suite — Structured data extraction for Rockwell Automation and Allen-Bradley hardware catalog. Built for procurement teams, system integrators, and BOM engineering workflows.
Suite hub: github.com/PLCSPS-DEV/rockwell-automation
Product site: crawloop.com/rockwell-automation
| Discovery | Enrichment | Documents | PDF parsing |
|---|---|---|---|
| Full Catalog Crawler | Product Scraper | Document Downloader | Datasheet Parser |
| Lifecycle Tracker |
Disclaimer: This is an unofficial integration developed independently of Rockwell Automation Inc. It is not affiliated with, sponsored by, or endorsed by Rockwell Automation Inc. or any of its subsidiaries.
Rockwell Automation, Allen-Bradley, and related names are trademarks of Rockwell Automation Inc. Product data is read from publicly accessible Rockwell web sources only; no proprietary databases are redistributed.
This Actor is provided for informational and research purposes only (e.g. procurement research, BOM audits, internal engineering workflows). You are solely responsible for ensuring your use complies with applicable laws, Rockwell website terms of use, and your organization's policies.
No warranty is given as to accuracy, completeness, or continued availability of third-party data. Use at your own risk.
Parse-only extractor for Allen-Bradley technical PDFs from literature.rockwellautomation.com. Reads PDF bytes from a Key-Value Store or direct URLs and outputs structured specifications, technical tables, and catalog-number row matches. No browser, no Rockwell session, no proxy.
Pair with the Document Downloader in a two-step pipeline: download TD PDFs first, then parse with this Actor.
For web PDP specifications without PDF parsing, use the Product Scraper instead.
When to use this Actor
Use the Datasheet Parser when you have Allen-Bradley technical PDFs in Key-Value Store (from the Document Downloader) or as direct literature URLs, and need structured JSON specs extracted from the PDF content — especially catalog-number row matches in multi-SKU TD documents.
Use the Product Scraper for web API specifications without PDF download.
Rockwell Automation Pipeline
Phase 1 — Discover SKUs Phase 2 — Screen & enrich Phase 3 — Documents & specs───────────────────────── ───────────────────────── ─────────────────────────────Full Catalog Crawler ──┐├──► catalogNumber list ──► Lifecycle Tracker ──► Product Scraper│┌──────┴──────────────────────────────────────────────────────────────────┐│ │▼ ▼Document Downloader Datasheet Parser ◄── you are herePDFs to Key-Value Store specs from TD PDFs
Key Features
- Parse-only — No browser; reads PDF bytes directly from Key-Value Store or URL.
- TD datasheet focus — Optimized for Allen-Bradley technical data (TD) PDFs.
- Specification extraction — Key-value pairs, technical tables, and limit values.
- Catalog row matching — Finds selection-table rows for a specific catalog number in multi-SKU TD PDFs.
- Optional diagrams — Extract large page images to Key-Value Store (
pymupdf). - Parallel processing — Configurable concurrency for bulk parsing runs.
Input Parameters
| Parameter | Description | Default |
|---|---|---|
keyValueStoreId | Required when using items or keyValueStoreKeys. Store ID from the downloader run. | — |
items | Downloader dataset rows (catalogNumber, keyValueStoreKey, fileName). | — |
keyValueStoreKeys | PDF keys only (alternative to items). | — |
catalogNumber | Default SKU for row matching in multi-SKU TD PDFs. | — |
extractDiagrams | Save large page images to Key-Value Store. | false |
maxConcurrency | Parallel PDF parsers. | 3 |
Input Example — from Document Downloader
{"keyValueStoreId": "YOUR_DOWNLOADER_STORE_ID","items": [{"catalogNumber": "140MT-F9E-C38","documentType": "TECHNICAL_DATA","keyValueStoreKey": "140MT-F9E-C38_140-td005_-en-p_140-td005_-en-p.pdf","fileName": "140-td005_-en-p.pdf"}],"extractDiagrams": false,"maxConcurrency": 3}
Input Example — direct literature URL
{"items": [{"catalogNumber": "140MT-F9E-C38","url": "https://literature.rockwellautomation.com/idc/groups/literature/documents/td/140-td005_-en-p.pdf"}]}
Output Format
{"catalogNumber": "140MT-F9E-C38","fileName": "140-td005_-en-p.pdf","documentType": "TECHNICAL_DATA","status": "PARSED","specificationCount": 42,"tableCount": 88,"catalogMatchCount": 1,"pageCount": 110,"specifications": {"Catalog Number": "140MT-F9E-C38","Bulletin": "140MT"},"catalogMatches": [{"page": 29,"tableRow": 8,"catalogNumber": "140MT-F9E-C38","cells": ["38", "30-38", "570", "...", "140MT-F9E-C38"]}],"parsedAt": "2026-06-19T12:00:00+00:00"}
Status values
status | Meaning |
|---|---|
PARSED | Specs and tables extracted successfully |
PARTIAL | Some content extracted |
METADATA_ONLY | PDF had little extractable structure |
FAILED | Invalid PDF or Key-Value Store key missing |
Typical Workflow
Document Downloader → PDFs in Key-Value Store ($5/1k)│▼Datasheet Parser → structured specs ($10/1k)
Actor Comparison
| Task | Datasheet Parser | Document Downloader | Product Scraper |
|---|---|---|---|
| Parse TD catalog PDFs | Yes | No | No |
| Download PDFs | No | Yes | No |
| Web API specifications | No | No | Yes |
| Catalog row matching in TD PDFs | Yes | No | No |
| Browser required | No | No | No |
Pricing
Pay-per-event billing. Charged on PARSED and PARTIAL only; FAILED rows are not billed.
Current Store setup (live — verify PPE events after 18 July 2026 if adjusting prices):
| Event | Price |
|---|---|
| Actor start | $0.05 per run |
Parsed datasheet PDF (parsed-datasheet-pdf) | $0.01 per PDF ($10.00 / 1,000) |
Recommended chain cost: ~$15/1,000 fully parsed technical datasheets (download + parse).
See ../../docs/rockwell_ppe_july2026.md for Store setup and code verification steps.
Learn more: Product page · Suite hub · GitHub docs
Also from Crawloop Industrial: Siemens SiePortal Suite · GitHub docs