Rockwell Datasheet PDF Parser — Specs JSON avatar

Rockwell Datasheet PDF Parser — Specs JSON

Pricing

from $13.00 / 1,000 parsed datasheet pdfs

Go to Apify Store
Rockwell Datasheet PDF Parser — Specs JSON

Rockwell Datasheet PDF Parser — Specs JSON

Parse Allen-Bradley technical PDFs into structured JSON: specification key-values, technical tables, limit values, and catalog-number row matches. Browser-free; chain after Document Downloader or parse direct literature URLs.

Pricing

from $13.00 / 1,000 parsed datasheet pdfs

Rating

0.0

(0)

Developer

Andrej Kiva

Andrej Kiva

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 hours ago

Last modified

Share

Crawloop Rockwell Automation Suite — Structured data extraction for Rockwell Automation and Allen-Bradley hardware catalog. Built for procurement teams, system integrators, and BOM engineering workflows.

Suite hub: github.com/PLCSPS-DEV/rockwell-automation

Product site: crawloop.com/rockwell-automation

DiscoveryEnrichmentDocumentsPDF parsing
Full Catalog CrawlerProduct ScraperDocument DownloaderDatasheet Parser
Lifecycle Tracker

Disclaimer: This is an unofficial integration developed independently of Rockwell Automation Inc. It is not affiliated with, sponsored by, or endorsed by Rockwell Automation Inc. or any of its subsidiaries.

Rockwell Automation, Allen-Bradley, and related names are trademarks of Rockwell Automation Inc. Product data is read from publicly accessible Rockwell web sources only; no proprietary databases are redistributed.

This Actor is provided for informational and research purposes only (e.g. procurement research, BOM audits, internal engineering workflows). You are solely responsible for ensuring your use complies with applicable laws, Rockwell website terms of use, and your organization's policies.

No warranty is given as to accuracy, completeness, or continued availability of third-party data. Use at your own risk.

Parse-only extractor for Allen-Bradley technical PDFs from literature.rockwellautomation.com. Reads PDF bytes from a Key-Value Store or direct URLs and outputs structured specifications, technical tables, and catalog-number row matches. No browser, no Rockwell session, no proxy.

Pair with the Document Downloader in a two-step pipeline: download TD PDFs first, then parse with this Actor.

For web PDP specifications without PDF parsing, use the Product Scraper instead.

When to use this Actor

Use the Datasheet Parser when you have Allen-Bradley technical PDFs in Key-Value Store (from the Document Downloader) or as direct literature URLs, and need structured JSON specs extracted from the PDF content — especially catalog-number row matches in multi-SKU TD documents.

Use the Product Scraper for web API specifications without PDF download.

Rockwell Automation Pipeline

Phase 1 — Discover SKUs Phase 2 — Screen & enrich Phase 3 — Documents & specs
───────────────────────── ───────────────────────── ─────────────────────────────
Full Catalog Crawler ──┐
├──► catalogNumber list ──► Lifecycle Tracker ──► Product Scraper
┌──────┴──────────────────────────────────────────────────────────────────┐
│ │
▼ ▼
Document Downloader Datasheet Parser ◄── you are here
PDFs to Key-Value Store specs from TD PDFs

Key Features

  • Parse-only — No browser; reads PDF bytes directly from Key-Value Store or URL.
  • TD datasheet focus — Optimized for Allen-Bradley technical data (TD) PDFs.
  • Specification extraction — Key-value pairs, technical tables, and limit values.
  • Catalog row matching — Finds selection-table rows for a specific catalog number in multi-SKU TD PDFs.
  • Optional diagrams — Extract large page images to Key-Value Store (pymupdf).
  • Parallel processing — Configurable concurrency for bulk parsing runs.

Input Parameters

ParameterDescriptionDefault
keyValueStoreIdRequired when using items or keyValueStoreKeys. Store ID from the downloader run.
itemsDownloader dataset rows (catalogNumber, keyValueStoreKey, fileName).
keyValueStoreKeysPDF keys only (alternative to items).
catalogNumberDefault SKU for row matching in multi-SKU TD PDFs.
extractDiagramsSave large page images to Key-Value Store.false
maxConcurrencyParallel PDF parsers.3

Input Example — from Document Downloader

{
"keyValueStoreId": "YOUR_DOWNLOADER_STORE_ID",
"items": [
{
"catalogNumber": "140MT-F9E-C38",
"documentType": "TECHNICAL_DATA",
"keyValueStoreKey": "140MT-F9E-C38_140-td005_-en-p_140-td005_-en-p.pdf",
"fileName": "140-td005_-en-p.pdf"
}
],
"extractDiagrams": false,
"maxConcurrency": 3
}

Input Example — direct literature URL

{
"items": [
{
"catalogNumber": "140MT-F9E-C38",
"url": "https://literature.rockwellautomation.com/idc/groups/literature/documents/td/140-td005_-en-p.pdf"
}
]
}

Output Format

{
"catalogNumber": "140MT-F9E-C38",
"fileName": "140-td005_-en-p.pdf",
"documentType": "TECHNICAL_DATA",
"status": "PARSED",
"specificationCount": 42,
"tableCount": 88,
"catalogMatchCount": 1,
"pageCount": 110,
"specifications": {
"Catalog Number": "140MT-F9E-C38",
"Bulletin": "140MT"
},
"catalogMatches": [
{
"page": 29,
"tableRow": 8,
"catalogNumber": "140MT-F9E-C38",
"cells": ["38", "30-38", "570", "...", "140MT-F9E-C38"]
}
],
"parsedAt": "2026-06-19T12:00:00+00:00"
}

Status values

statusMeaning
PARSEDSpecs and tables extracted successfully
PARTIALSome content extracted
METADATA_ONLYPDF had little extractable structure
FAILEDInvalid PDF or Key-Value Store key missing

Typical Workflow

Document Downloader → PDFs in Key-Value Store ($5/1k)
Datasheet Parser → structured specs ($10/1k)

Actor Comparison

TaskDatasheet ParserDocument DownloaderProduct Scraper
Parse TD catalog PDFsYesNoNo
Download PDFsNoYesNo
Web API specificationsNoNoYes
Catalog row matching in TD PDFsYesNoNo
Browser requiredNoNoNo

Pricing

Pay-per-event billing. Charged on PARSED and PARTIAL only; FAILED rows are not billed.

Current Store setup (live — verify PPE events after 18 July 2026 if adjusting prices):

EventPrice
Actor start$0.05 per run
Parsed datasheet PDF (parsed-datasheet-pdf)$0.01 per PDF ($10.00 / 1,000)

Recommended chain cost: ~$15/1,000 fully parsed technical datasheets (download + parse).

See ../../docs/rockwell_ppe_july2026.md for Store setup and code verification steps.


Learn more: Product page · Suite hub · GitHub docs

Also from Crawloop Industrial: Siemens SiePortal Suite · GitHub docs