Dutch Energy Contract PDF Extractor
Pricing
from $20.00 / 1,000 results
Dutch Energy Contract PDF Extractor
Extracts structured data from Dutch energy contract PDFs (direct PDF or Dropbox file/folder links): customer & address details, contract term, electricity/gas tariffs, totals, cashback. Supports Energiedirect.nl, Budget Energie, NLE, Essent, Huismerk Energie. Text PDFs only, no OCR.
Pricing
from $20.00 / 1,000 results
Rating
0.0
(0)
Developer
Alkausari M
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
Turn batches of Dutch energy contract PDFs into clean, structured data. Paste a direct PDF link or a Dropbox link — a single PDF or an entire folder — and this Actor downloads every PDF, parses each contract, and returns one tidy record per file. No opening PDFs by hand, no copy-pasting customer details, tariffs, or IBANs.
Tuned for the contract layouts of Energiedirect.nl, Budget Energie, NLE, Essent, and Huismerk Energie.
Built and maintained by Alkausari M.
✦ Highlights
- 📥 Direct PDF & Dropbox links — direct
.pdfURLs, single Dropbox files, or whole Dropbox folders (downloaded as a zip and unpacked automatically) - 🇳🇱 Dutch supplier-aware — detects the supplier and parses personal data, address, contract terms, electricity & gas tariffs
- 📊 Any format — JSON, CSV, Excel, XML, or live via the Apify API
- 🗂 One row per PDF — 28 normalized fields, ready for a CRM, spreadsheet, or database
- ⚡ Batch in one run — drop a folder link and process dozens or hundreds of contracts at once
- ⏰ Automate it — schedule recurring runs or trigger via API when new contracts land in your shared folder
⚙ How it works
- Share your PDFs — host them at a publicly accessible URL, or put them in a Dropbox location shared as "Anyone with the link."
- Paste the link(s) — any mix of direct PDF URLs and Dropbox file/folder links.
- Click Start — downloading, unzipping, and parsing are handled for you.
- Download or pipe — grab results as JSON, CSV, Excel, or pull from the API.
// Example input — direct PDF links
{"source_urls": ["https://example.com/contracts/contract1.pdf","https://example.com/contracts/contract2.pdf"]}
// Example input — a Dropbox folder
{"source_urls": ["https://www.dropbox.com/scl/fo/abc123/def456?rlkey=xxxx&dl=1"]}
[!TIP] You don't need to change the
dl=0/dl=1flag yourself — the Actor forces a direct download automatically. Just paste the share link as Dropbox gives it to you.
[!IMPORTANT] Links must be public ("Anyone with the link can view"). Files that require sign-in, a password, or are shared only with specific accounts cannot be downloaded and will be skipped. The Actor extracts embedded text only — it does not run OCR, so scanned or image-only PDFs won't produce data.
📦 What you get back
Each processed PDF becomes one structured record:
{"file_name": "contract_123.pdf","Leverncier": "Essent","Datum": "12-05-2026","Contract": "Vast 1 jaar","Geslacht": "Man","Voorletters": "J.","Tussenvoegsel": "de","Achternaam": "Vries","Geboortedatum": "01-01-1985","Straat": "Hoofdstraat","Huisnummer": "12","Huisnummer toevoeging": "A","Postcode": "1234 AB","Plaats": "Amsterdam","E-mailadres": "j.devries@example.com","Telefoon": "0612345678","IBAN rekeningnummer": "NL00BANK0123456789","Gewenste startdatum / Ingangsdatum": "01-06-2026","Contractduur": "12 maanden","Variabele leveringskosten normaaltarief": "0,10","Variabele leveringskosten daltarief": "0,09","Stroom Tarief (incl.btw)": "€ 0,12","Stroom Daltarief (incl.btw)": "€ 0,11","Netbeheerkosten": "€ 0,00","Verbruik gas": "1.200","Gas Tarief": "€ 0,98","Totaalkosten": "€ 1.234,56","Welkomstcadeau(cashback)": "€ 50,00"}
Fields captured
| Field | Description |
|---|---|
file_name | Source PDF file name |
Leverncier | Detected energy supplier (Energiedirect.nl, Budget Energie, NLE, Essent, Huismerk Energie) |
Datum | Document date |
Contract | Contract type / name |
Geslacht, Voorletters, Tussenvoegsel, Achternaam, Geboortedatum | Customer personal details |
Straat, Huisnummer, Huisnummer toevoeging, Postcode, Plaats | Customer address |
E-mailadres, Telefoon, IBAN rekeningnummer | Contact & payment details |
Gewenste startdatum / Ingangsdatum, Contractduur | Contract start date and duration |
Variabele leveringskosten normaaltarief / daltarief | Variable electricity supply costs |
Stroom Tarief (incl.btw), Stroom Daltarief (incl.btw) | Electricity tariffs incl. VAT |
Netbeheerkosten | Grid management costs |
Verbruik gas, Gas Tarief | Gas usage and tariff |
Totaalkosten | Total first-year cost |
Welkomstcadeau(cashback) | Welcome gift / cashback |
[!NOTE] Sellers don't always fill every field, and some contracts use a different page layout. Any value that can't be found comes back as an empty string or
N/Afor that field — the rest of the record is still extracted.
📋 Input
| Field | Description | Required | Default |
|---|---|---|---|
PDF / Dropbox links (source_urls) | Public links to PDFs: direct .pdf URLs and/or Dropbox file or folder links. All PDFs found are downloaded and processed. | Yes | — |
💡 Use cases
- Bulk onboarding — convert a folder of signed contracts into a customer spreadsheet in minutes.
- CRM data entry — feed structured records straight into your CRM or database instead of typing them.
- Auditing & QA — quickly compare tariffs, durations, and totals across many contracts.
- Automated intake — schedule the Actor to process new contracts as they're dropped into a shared Dropbox folder.
- Data pipelines — pipe results to Google Sheets, BigQuery, or webhooks via the Apify API.
💰 Cost estimation
The Actor spends most of its time downloading PDFs and parsing text locally, which is lightweight — the main cost driver is the number and size of your PDFs. A batch of a few dozen contracts typically finishes in minutes and uses only a small number of compute units. Exact usage is shown on the run's detail page in the Apify Console.
❓ FAQ
Which suppliers are supported? The parser is tuned for Energiedirect.nl, Budget Energie, NLE, Essent, and Huismerk Energie. It expects the standard 3-section layout (personal details → electricity → gas). PDFs with significantly different layouts may yield partial results.
Can it read scanned contracts? No. It extracts the text embedded in the PDF and does not perform OCR. Image-only or scanned PDFs won't produce data.
Why is a file missing from the results? The most common reasons: the link isn't public, the file requires sign-in, or it's a scanned/image PDF. Check the run log — skipped files and download errors are reported there.
Are folder links supported? Yes — that's the recommended way. Dropbox folder links are downloaded as a single zip and unpacked automatically, so one link can deliver an entire batch of contracts.
Is this legal / GDPR-safe? Only use the Actor on documents you own or are authorized to process. Extracted records contain personal data (names, addresses, IBANs) — handle them in accordance with GDPR and your organization's data-protection policies.
📮 Support
Bugs, feature requests, a new supplier layout, or custom work — open an issue on Apify or email alkausarimujahid@gmail.com.