Chicago Open Data Portal Scraper
Pricing
from $15.00 / 1,000 result items
Chicago Open Data Portal Scraper
Export records from any City of Chicago Open Data Portal dataset. Browse catalog or pull rows by resource ID with filters and full-text search. Permits, crimes, businesses, transit, budget, salaries, 311, parks, and 600+ public datasets.
Pricing
from $15.00 / 1,000 result items
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share

๐๏ธ Chicago Open Data Portal Scraper
๐ Export any City of Chicago dataset in seconds. Browse the catalog of 600+ public datasets or pull rows from food inspections, crimes, 311 requests, permits, business licenses, salaries, and more. Filter with native SoQL.
๐ Last updated: 2026-05-22 ยท ๐ 13 fields per record ยท ๐๏ธ 600+ datasets ยท ๐๏ธ Official City of Chicago source ยท ๐ SoQL filtering
The Chicago Open Data Portal Scraper taps data.cityofchicago.org, the official Socrata-powered open-data hub of the City of Chicago. The Actor returns 13 structured fields per record, including the resource ID, dataset name, description, category, creation and update timestamps, row and column counts, the row payload itself, the dataset URL, and a collection timestamp.
The catalog covers more than 600 public datasets across food inspections, crimes 2001-present, 311 service requests, building permits, business licenses, employee salaries, transit ridership, traffic crashes, parks, COVID indicators, budget appropriations, and dozens of other domains. This Actor wraps Socrata's SoQL query language so you can filter, sort, search, and project columns without touching the API directly.
| ๐ฏ Target Audience | ๐ก Primary Use Cases |
|---|---|
| Urban planners, journalists, civic-tech developers, researchers, transparency advocates, local-government analysts, public-health teams | Crime trend analysis, restaurant inspection dashboards, 311 monitoring, permit-issuance studies, business-license intelligence, salary transparency, civic-tech apps |
๐ What the Chicago Open Data Portal Scraper does
Two modes in a single Actor:
- ๐ Catalog mode. Browse the full City of Chicago dataset catalog with names, descriptions, categories, and metadata.
- ๐ Dataset mode. Pull rows from any dataset by its 4x4 resource ID (e.g.
22u3-xenrBuilding Violations,ijzp-q8t2Crimes 2001-Present). - ๐งฎ Column filters. Pass
{"violation_status": "OPEN"}for simple equality filters. - ๐ SoQL WHERE clause. Native SoQL expression for advanced filtering (
violation_date > '2024-01-01' AND ward = '42'). - ๐ค Full-text search. Apply a
$qquery across all columns. - ๐ฏ Field selection. Return only the columns you need with
selectFields. - ๐ Ordering. Sort by any column (
violation_date DESC).
Each record bundles either the catalog metadata or one row of the chosen dataset, alongside the resource ID, dataset URL, and a collection timestamp.
๐ก Why it matters: Chicago has been one of the most ambitious municipal open-data programs in the U.S. since 2010. Hand-coding a Socrata client means dealing with pagination, polling, SoQL quirks, and per-dataset schemas. This Actor delivers consistent records you can pipe straight into BI tools, notebooks, or civic-tech apps.
๐ฌ Full Demo
๐ง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.
โ๏ธ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
maxItems | integer | 10 | Records to return. Free plan caps at 10, paid plan at 1,000,000. |
mode | enum | "dataset" | catalog lists all datasets. dataset pulls rows by resource ID. |
resourceId | string | "22u3-xenr" | 4x4 resource ID for the dataset. Required for dataset mode. |
filters | object | {} | Optional column-equality filters as key/value pairs. |
where | string | "" | Optional SoQL $where expression. |
searchQuery | string | "" | Optional $q full-text search query. |
selectFields | string | "" | Optional comma-separated column list. |
orderBy | string | "" | Optional sort expression. |
Example: pull recent open building violations in Ward 42.
{"maxItems": 100,"mode": "dataset","resourceId": "22u3-xenr","filters": { "violation_status": "OPEN", "ward": "42" },"orderBy": "violation_date DESC"}
Example: browse the full catalog of Chicago datasets.
{"maxItems": 200,"mode": "catalog"}
โ ๏ธ Good to Know: Chicago's open-data portal applies polite rate limits to anonymous callers. The Actor paces requests automatically so larger pulls run without 429 errors.
๐ Output
Each record contains 13 fields. Download the dataset as CSV, Excel, JSON, or XML.
๐งพ Schema
| Field | Type | Example |
|---|---|---|
๐ท๏ธ recordType | string | "row" |
๐ resourceId | string | "22u3-xenr" |
๐ name | string | null | "Building Violations" |
๐ description | string | null | "Building code violations cited by City of Chicago inspectors..." |
๐๏ธ category | string | null | "Buildings" |
๐ updatedAt | ISO 8601 | null | "2026-05-20T09:00:00.000Z" |
๐ createdAt | ISO 8601 | null | "2011-09-30T00:00:00.000Z" |
๐ rowsCount | integer | null | 3850000 |
๐ columnsCount | integer | null | 22 |
๐ฆ data | object | null | { "violation_status": "OPEN", "ward": "42", ... } |
๐ url | string | "https://data.cityofchicago.org/resource/22u3-xenr" |
๐ scrapedAt | ISO 8601 | "2026-05-22T00:00:00.000Z" |
โ ๏ธ error | string | null | null |
๐ฆ Sample records
โจ Why choose this Actor
| Capability | |
|---|---|
| ๐๏ธ | Official source. Direct from the City of Chicago open-data hub. |
| ๐ | Catalog + dataset modes. Discover datasets, then pull rows in the same workflow. |
| ๐ | Full SoQL. Filters, WHERE, full-text search, select, and order pass through to the source. |
| ๐ | 600+ datasets. Crime, food, 311, permits, licenses, salaries, transit, parks, COVID, budget. |
| โก | Fast. Catalog browse in seconds, dataset rows in under a minute for typical pulls. |
| ๐ | Always fresh. Every run hits the live Chicago feed. |
| ๐ซ | No API key. Public Chicago Socrata endpoints need no token. |
๐ Chicago's open-data program is one of the most-used municipal data resources in North America, powering academic research, journalism, and civic-tech apps.
๐ How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| โญ Chicago Open Data Portal Scraper (this Actor) | $5 free credit, then pay-per-use | 600+ datasets | Live per run | SoQL filter, where, q, select, order | โก 2 min |
| Manual CSV downloads from portal | Free | Per-dataset | Manual | None | ๐ข Hours per dataset |
| Hand-coded Socrata client | Free | Full | Per-build | Custom | โณ Days |
| Commercial municipal-data aggregators | $$$/year | Curated subsets | Vendor schedule | Vendor-defined | ๐ Sales cycle |
Pick this Actor when you want the authoritative source, SoQL filtering, and zero pipeline maintenance.
๐ How to use
- ๐ Sign up. Create a free account with $5 credit (takes 2 minutes).
- ๐ Open the Actor. Go to the Chicago Open Data Portal Scraper page on the Apify Store.
- ๐ฏ Set input. Run catalog mode to discover datasets, then dataset mode with a
resourceId. - ๐ Run it. Click Start and let the Actor collect your data.
- ๐ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.
๐ผ Business use cases
๐ Automating Chicago Open Data Portal Scraper
Control the scraper programmatically for scheduled runs and pipeline integrations:
- ๐ข Node.js. Install the
apify-clientNPM package. - ๐ Python. Use the
apify-clientPyPI package. - ๐ See the Apify API documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly or daily refreshes keep downstream dashboards in sync automatically.
๐ Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
๐ค Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- ๐ฌ ChatGPT
- ๐ง Claude
- ๐ Perplexity
- ๐ Copilot
โ Frequently Asked Questions
๐งฉ How does it work?
Pick catalog or dataset mode, set a resourceId and optional SoQL filters, click Start, and the Actor pulls live records from the Chicago open-data hub. No browser automation, no captchas, no setup.
๐ How accurate is the data?
Records mirror the live Chicago feed at run time. The City of Chicago publishes data directly from the source systems of each department, with documented refresh cadence per dataset.
๐ How often is the dataset refreshed?
Different datasets have different refresh cadences. Crime is daily, food inspections weekly, 311 near-real-time, permits weekly. The updatedAt field on each catalog entry tells you when the underlying dataset last changed.
๐ Where do I find the resource ID?
Every Chicago dataset has a 4x4 ID (e.g. 22u3-xenr) visible in its URL on data.cityofchicago.org. You can also run catalog mode to list every dataset alongside its ID.
โฐ Can I schedule regular runs?
Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep a downstream dashboard in sync.
โ๏ธ Is this data legal to use?
The City of Chicago publishes its open data under permissive terms. Review the specific dataset disclaimer for any usage limits, but most reuse is permitted.
๐ผ Can I use this data commercially?
Yes. Chicago open data may be used for commercial purposes with attribution. You are responsible for complying with the portal terms and with any downstream regulatory requirements.
๐ณ Do I need a paid Apify plan to use this Actor?
No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling, higher concurrency, and larger datasets.
๐ What happens if a run fails or gets interrupted?
Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.
๐ What if I need help?
Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.
๐ Integrate with any app
Chicago Open Data Portal Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe Chicago data into your warehouse
- GitHub - Trigger runs from commits and releases
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to trigger downstream actions when a run finishes. Push fresh Chicago data into your product backend, or alert your team in Slack.
๐ Recommended Actors
- ๐ณ๐ฑ Dutch CBS Statistics Scraper - Official Netherlands statistics
- ๐ Indexmundi Scraper - Global demographic and economic indicators
- ๐บ๏ธ Nominatim OSM Scraper - Geocode Chicago addresses via OpenStreetMap
- ๐ World Bank Open Data Scraper - Cross-country indicators
- ๐ฆ๏ธ NOAA Weather Scraper - U.S. weather observations and forecasts
๐ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.
๐ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.
โ ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the City of Chicago, Socrata, Tyler Technologies, or any government body. All trademarks mentioned are the property of their respective owners. Only publicly available open data is collected.