
Z W
wallman_3rdData hoarder. Scraper builder. Turned govt PDFs into JSON, called it a career. Not a doctor, but I prescribe structured data. The W? Whatever.
Joined April 2026
ACTOR STATS
1 public Actor
1 total user
I build scrapers that survive the things web scraping should never have to deal with — 9MB government PDFs, undocumented APIs that 403 on Tuesdays, HTML tables nested inside Excel files nested inside ZIP downloads.
- Multi-source data pipelines with deduplication, change detection, and run health monitoring
- Government data scrapers — license databases, open data portals, and documents that were never meant to be machine-readable
- Platform + official source fusion — cross-reference what businesses say on Weedmaps against what the state says about their license
Cannabis Dispensary Data Scraper — 43 states, 6 platforms (Weedmaps, Leafly, iHeartJane, Dutchie + state license databases). Tracks dispensary openings, closures, and license changes across weekly runs. The only Actor that tells you what changed since your last run.
The goal is never the scraper — it's the delta. Who opened last week? Whose license expired? What changed? Static datasets go stale. Good pipelines don't.
Node.js · Crawlee · Cheerio · Playwright · Apify SDK
Open an issue on any actor if you need custom coverage, a different state, or a format that doesn't exist yet.