Siemens Datasheet PDF Parser — TED Specs
Pricing
from $10.00 / 1,000 parsed product datasheets
Siemens Datasheet PDF Parser — TED Specs
Parse-only: extract structured specifications, technical tables & limit values from official Siemens TED product datasheet PDFs in Key-Value Store. No browser. Chain after TED Datasheet Downloader.
Pricing
from $10.00 / 1,000 parsed product datasheets
Rating
0.0
(0)
Developer
Andrej Kiva
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
13 hours ago
Last modified
Categories
Share
Crawloop Siemens Automation Suite — Structured data extraction for Siemens SiePortal (Industry Mall), SIOS, and TED product datasheets. Built for procurement teams, system integrators, and BOM engineering workflows.
Suite hub: github.com/PLCSPS-DEV/siemens-sieportal-automation
| Discovery | Enrichment | SIOS documents | TED datasheets |
|---|---|---|---|
| Catalog Crawler | SiePortal Scraper | Document Downloader | TED Datasheet Downloader |
| Category Scraper | Lifecycle Tracker | Document PDF Parser | TED Datasheet Parser |
Disclaimer: This is an unofficial integration developed independently of Siemens AG. It is not affiliated with, sponsored by, or endorsed by Siemens AG or any of its subsidiaries.
Siemens, SiePortal, SIMATIC, and related names are trademarks of Siemens AG. Product data is read from publicly accessible Siemens web sources only; no proprietary databases are redistributed.
This Actor is provided for informational and research purposes only (e.g. procurement research, BOM audits, internal engineering workflows). You are solely responsible for ensuring your use complies with applicable laws, Siemens website terms of use, and your organization's policies.
No warranty is given as to accuracy, completeness, or continued availability of third-party data. Use at your own risk.
Parse-only extractor for official Siemens TED product data sheet PDFs. Reads PDF bytes from a Key-Value Store and outputs structured specifications, technical tables, and limit values. No browser, no SiePortal session, no proxy.
Pair with the TED Datasheet Downloader in a two-step pipeline: download TED PDFs first, then parse with this Actor.
For SIOS equipment manuals and certificates, use the Document PDF Parser instead. For combined discover-and-parse of SIOS documents, see the legacy Document PDF Parser discover mode.
When to use this Actor
Use the TED Datasheet Parser when you have TED product datasheet PDFs in Key-Value Store (from the TED Datasheet Downloader) and need structured JSON specs extracted from the PDF content.
Use the Document PDF Parser for SIOS manuals, certificates, and brochures.
Siemens Automation Pipeline
Phase 1 — Discover MPNs Phase 2 — Screen & enrich Phase 3 — Documents & specs───────────────────────── ───────────────────────── ─────────────────────────────Catalog Crawler ──┐├──► MPN list ──► Lifecycle Tracker ──► SiePortal ScraperCategory Scraper ──┘ ││┌────────────────────────────┴────────────────────────────┐│ │▼ ▼Document Downloader (SIOS) TED Datasheet Downloadercertificates, manuals, CAD compact catalog PDFs│ │▼ ▼Document PDF Parser TED Datasheet Parser ◄── you are herespecs from SIOS PDFs specs from TED PDFs
Key Features
- Parse-only — No browser; reads PDF bytes directly from Key-Value Store.
- TED datasheet focus — Optimized for compact Industry Mall product data sheet PDFs.
- Specification extraction — Key-value pairs, technical tables, and limit values.
- Wiring diagrams — Optional image extraction from PDF pages to Key-Value Store.
- Parallel processing — Configurable concurrency for bulk parsing runs.
Input Parameters
| Parameter | Description | Default |
|---|---|---|
keyValueStoreId | Required when using items or keyValueStoreKeys. Store ID from the downloader run. | — |
items | Downloader dataset rows (partNumber, keyValueStoreKey, fileName). | — |
keyValueStoreKeys | PDF keys only (alternative to items). | — |
extractDiagrams | Save large page images to Key-Value Store. | false |
maxConcurrency | Parallel PDF parsers. | 3 |
Input Example
{"keyValueStoreId": "YOUR_TED_DOWNLOADER_STORE_ID","items": [{"partNumber": "3SU1950-0FP80-0AA0-Z Y05","keyValueStoreKey": "3SU1950-0FP80-0AA0-Z Y05_en_datasheet.pdf","fileName": "3SU1950-0FP80-0AA0-Z Y05_en_datasheet.pdf"}],"extractDiagrams": false,"maxConcurrency": 3}
Output Format
Status values
status | Meaning |
|---|---|
PARSED | Specs and tables extracted successfully |
PARTIAL | Some content extracted |
METADATA_ONLY | PDF had little extractable structure |
FAILED | Invalid PDF or Key-Value Store key missing |
Typical Workflow
TED Datasheet Downloader → PDFs in Key-Value Store ($2/1k)│▼TED Datasheet Parser → structured specs ($2/1k)
Actor Comparison
| Task | TED Datasheet Parser | TED Datasheet Downloader | Document PDF Parser |
|---|---|---|---|
| Parse TED catalog PDFs | Yes | No | No |
| Parse SIOS/manual PDFs | No | No | Yes |
| Download PDFs | No | Yes | No |
| Browser required | No | No | Parse-only: No |
Pricing
Pay-per-event billing. Charged on PARSED and PARTIAL only.
| Event | Price |
|---|---|
parsed-datasheet-pdf | $2.00 / 1,000 |
apify-actor-start | $0.00005 |
Recommended chain cost: ~$4/1,000 fully parsed TED datasheets (download + parse).