Chicago Open Data Portal Scraper avatar

Chicago Open Data Portal Scraper

Pricing

from $15.00 / 1,000 result items

Go to Apify Store
Chicago Open Data Portal Scraper

Chicago Open Data Portal Scraper

Export records from any City of Chicago Open Data Portal dataset. Browse catalog or pull rows by resource ID with filters and full-text search. Permits, crimes, businesses, transit, budget, salaries, 311, parks, and 600+ public datasets.

Pricing

from $15.00 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

ParseForge Banner

๐Ÿ™๏ธ Chicago Open Data Portal Scraper

๐Ÿš€ Export any City of Chicago dataset in seconds. Browse the catalog of 600+ public datasets or pull rows from food inspections, crimes, 311 requests, permits, business licenses, salaries, and more. Filter with native SoQL.

๐Ÿ•’ Last updated: 2026-05-22 ยท ๐Ÿ“Š 13 fields per record ยท ๐Ÿ™๏ธ 600+ datasets ยท ๐Ÿ›๏ธ Official City of Chicago source ยท ๐Ÿ”Ž SoQL filtering

The Chicago Open Data Portal Scraper taps data.cityofchicago.org, the official Socrata-powered open-data hub of the City of Chicago. The Actor returns 13 structured fields per record, including the resource ID, dataset name, description, category, creation and update timestamps, row and column counts, the row payload itself, the dataset URL, and a collection timestamp.

The catalog covers more than 600 public datasets across food inspections, crimes 2001-present, 311 service requests, building permits, business licenses, employee salaries, transit ridership, traffic crashes, parks, COVID indicators, budget appropriations, and dozens of other domains. This Actor wraps Socrata's SoQL query language so you can filter, sort, search, and project columns without touching the API directly.

๐ŸŽฏ Target Audience๐Ÿ’ก Primary Use Cases
Urban planners, journalists, civic-tech developers, researchers, transparency advocates, local-government analysts, public-health teamsCrime trend analysis, restaurant inspection dashboards, 311 monitoring, permit-issuance studies, business-license intelligence, salary transparency, civic-tech apps

๐Ÿ“‹ What the Chicago Open Data Portal Scraper does

Two modes in a single Actor:

  • ๐Ÿ“š Catalog mode. Browse the full City of Chicago dataset catalog with names, descriptions, categories, and metadata.
  • ๐Ÿ“Š Dataset mode. Pull rows from any dataset by its 4x4 resource ID (e.g. 22u3-xenr Building Violations, ijzp-q8t2 Crimes 2001-Present).
  • ๐Ÿงฎ Column filters. Pass {"violation_status": "OPEN"} for simple equality filters.
  • ๐Ÿ”Ž SoQL WHERE clause. Native SoQL expression for advanced filtering (violation_date > '2024-01-01' AND ward = '42').
  • ๐Ÿ”ค Full-text search. Apply a $q query across all columns.
  • ๐ŸŽฏ Field selection. Return only the columns you need with selectFields.
  • ๐Ÿ“ Ordering. Sort by any column (violation_date DESC).

Each record bundles either the catalog metadata or one row of the chosen dataset, alongside the resource ID, dataset URL, and a collection timestamp.

๐Ÿ’ก Why it matters: Chicago has been one of the most ambitious municipal open-data programs in the U.S. since 2010. Hand-coding a Socrata client means dealing with pagination, polling, SoQL quirks, and per-dataset schemas. This Actor delivers consistent records you can pipe straight into BI tools, notebooks, or civic-tech apps.


๐ŸŽฌ Full Demo

๐Ÿšง Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


โš™๏ธ Input

InputTypeDefaultBehavior
maxItemsinteger10Records to return. Free plan caps at 10, paid plan at 1,000,000.
modeenum"dataset"catalog lists all datasets. dataset pulls rows by resource ID.
resourceIdstring"22u3-xenr"4x4 resource ID for the dataset. Required for dataset mode.
filtersobject{}Optional column-equality filters as key/value pairs.
wherestring""Optional SoQL $where expression.
searchQuerystring""Optional $q full-text search query.
selectFieldsstring""Optional comma-separated column list.
orderBystring""Optional sort expression.

Example: pull recent open building violations in Ward 42.

{
"maxItems": 100,
"mode": "dataset",
"resourceId": "22u3-xenr",
"filters": { "violation_status": "OPEN", "ward": "42" },
"orderBy": "violation_date DESC"
}

Example: browse the full catalog of Chicago datasets.

{
"maxItems": 200,
"mode": "catalog"
}

โš ๏ธ Good to Know: Chicago's open-data portal applies polite rate limits to anonymous callers. The Actor paces requests automatically so larger pulls run without 429 errors.


๐Ÿ“Š Output

Each record contains 13 fields. Download the dataset as CSV, Excel, JSON, or XML.

๐Ÿงพ Schema

FieldTypeExample
๐Ÿท๏ธ recordTypestring"row"
๐Ÿ†” resourceIdstring"22u3-xenr"
๐Ÿ“› namestring | null"Building Violations"
๐Ÿ“ descriptionstring | null"Building code violations cited by City of Chicago inspectors..."
๐Ÿ—‚๏ธ categorystring | null"Buildings"
๐Ÿ•’ updatedAtISO 8601 | null"2026-05-20T09:00:00.000Z"
๐Ÿ•’ createdAtISO 8601 | null"2011-09-30T00:00:00.000Z"
๐Ÿ“Š rowsCountinteger | null3850000
๐Ÿ“ columnsCountinteger | null22
๐Ÿ“ฆ dataobject | null{ "violation_status": "OPEN", "ward": "42", ... }
๐Ÿ”— urlstring"https://data.cityofchicago.org/resource/22u3-xenr"
๐Ÿ•“ scrapedAtISO 8601"2026-05-22T00:00:00.000Z"
โš ๏ธ errorstring | nullnull

๐Ÿ“ฆ Sample records


โœจ Why choose this Actor

Capability
๐Ÿ™๏ธOfficial source. Direct from the City of Chicago open-data hub.
๐Ÿ“šCatalog + dataset modes. Discover datasets, then pull rows in the same workflow.
๐Ÿ”ŽFull SoQL. Filters, WHERE, full-text search, select, and order pass through to the source.
๐Ÿ“Š600+ datasets. Crime, food, 311, permits, licenses, salaries, transit, parks, COVID, budget.
โšกFast. Catalog browse in seconds, dataset rows in under a minute for typical pulls.
๐Ÿ”Always fresh. Every run hits the live Chicago feed.
๐ŸšซNo API key. Public Chicago Socrata endpoints need no token.

๐Ÿ“Š Chicago's open-data program is one of the most-used municipal data resources in North America, powering academic research, journalism, and civic-tech apps.


๐Ÿ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
โญ Chicago Open Data Portal Scraper (this Actor)$5 free credit, then pay-per-use600+ datasetsLive per runSoQL filter, where, q, select, orderโšก 2 min
Manual CSV downloads from portalFreePer-datasetManualNone๐Ÿข Hours per dataset
Hand-coded Socrata clientFreeFullPer-buildCustomโณ Days
Commercial municipal-data aggregators$$$/yearCurated subsetsVendor scheduleVendor-defined๐Ÿ•’ Sales cycle

Pick this Actor when you want the authoritative source, SoQL filtering, and zero pipeline maintenance.


๐Ÿš€ How to use

  1. ๐Ÿ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. ๐ŸŒ Open the Actor. Go to the Chicago Open Data Portal Scraper page on the Apify Store.
  3. ๐ŸŽฏ Set input. Run catalog mode to discover datasets, then dataset mode with a resourceId.
  4. ๐Ÿš€ Run it. Click Start and let the Actor collect your data.
  5. ๐Ÿ“ฅ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

โฑ๏ธ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


๐Ÿ’ผ Business use cases

๐Ÿ›๏ธ Civic tech and government

  • 311 service-request dashboards and SLAs
  • Permit-issuance trend analysis for housing
  • Open-budget tracking and accountability tools
  • Transparency portals on public salaries

๐Ÿ“ฐ Journalism and research

  • Crime-trend analyses for newsroom investigations
  • Restaurant-inspection feeds for local-news apps
  • Election-cycle reporting on ward-level data
  • Long-form research on neighbourhood inequality

๐Ÿ™๏ธ Urban planning and real estate

  • Building-violation hot-spot mapping
  • Business-license density for site selection
  • Traffic and crash analysis for road design
  • Park and amenity feeds for neighbourhood scores

๐Ÿฅ Public health and safety

  • Food-inspection compliance dashboards
  • COVID-indicator monitoring and modelling
  • 911/911-alt response-time analysis
  • Environmental and pollution tracking

๐Ÿ”Œ Automating Chicago Open Data Portal Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • ๐ŸŸข Node.js. Install the apify-client NPM package.
  • ๐Ÿ Python. Use the apify-client PyPI package.
  • ๐Ÿ“š See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Hourly or daily refreshes keep downstream dashboards in sync automatically.


๐ŸŒŸ Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

๐ŸŽ“ Research and academia

  • Theses on urban inequality, mobility, or housing
  • Reproducible studies with cited dataset versions
  • Teaching datasets for data-journalism courses
  • Cross-city comparisons mixing Chicago with peer feeds

๐ŸŽจ Personal and creative

  • Side projects exploring Chicago neighbourhoods
  • Visualization projects on crime, transit, or weather
  • Data art pieces using public records
  • Local-history projects with archival data

๐Ÿค Non-profit and civic

  • Community-organization advocacy tools
  • Tenant-rights apps using building violations
  • Local-government accountability platforms
  • Citizen-science projects on environmental data

๐Ÿงช Experimentation

  • Train forecasting models on Chicago crime or 311
  • Test civic-tech product hypotheses with real records
  • Prototype agent pipelines that answer city questions
  • Build LLM-grounded urban assistants with cited rows

๐Ÿค– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


โ“ Frequently Asked Questions

๐Ÿงฉ How does it work?

Pick catalog or dataset mode, set a resourceId and optional SoQL filters, click Start, and the Actor pulls live records from the Chicago open-data hub. No browser automation, no captchas, no setup.

๐Ÿ“ How accurate is the data?

Records mirror the live Chicago feed at run time. The City of Chicago publishes data directly from the source systems of each department, with documented refresh cadence per dataset.

๐Ÿ” How often is the dataset refreshed?

Different datasets have different refresh cadences. Crime is daily, food inspections weekly, 311 near-real-time, permits weekly. The updatedAt field on each catalog entry tells you when the underlying dataset last changed.

๐Ÿ†” Where do I find the resource ID?

Every Chicago dataset has a 4x4 ID (e.g. 22u3-xenr) visible in its URL on data.cityofchicago.org. You can also run catalog mode to list every dataset alongside its ID.

โฐ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (hourly, daily, weekly) and keep a downstream dashboard in sync.

The City of Chicago publishes its open data under permissive terms. Review the specific dataset disclaimer for any usage limits, but most reuse is permitted.

๐Ÿ’ผ Can I use this data commercially?

Yes. Chicago open data may be used for commercial purposes with attribution. You are responsible for complying with the portal terms and with any downstream regulatory requirements.

๐Ÿ’ณ Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling, higher concurrency, and larger datasets.

๐Ÿ” What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. If a run still fails, you can inspect the log in the Runs tab, fix the input, and re-run. Partial datasets from failed runs are preserved so you never lose progress.

๐Ÿ†˜ What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


๐Ÿ”Œ Integrate with any app

Chicago Open Data Portal Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe Chicago data into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh Chicago data into your product backend, or alert your team in Slack.


๐Ÿ’ก Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


๐Ÿ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


โš ๏ธ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the City of Chicago, Socrata, Tyler Technologies, or any government body. All trademarks mentioned are the property of their respective owners. Only publicly available open data is collected.