# US Building-Permits Bulk Extractor (Socrata) (`xtracto/building-permits`) Actor

Extract millions of US building-permit records from city open-data portals (Socrata) — construction-activity intelligence. One permit per row. Public data, no login.

- **URL**: https://apify.com/xtracto/building-permits.md
- **Developed by:** [xtractoo](https://apify.com/xtracto) (community)
- **Categories:** Lead generation, Real estate
- **Stats:** 2 total users, 1 monthly users, 0.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $1.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## US Building Permits Scraper

Extract **building-permit records** from US & Canadian city open-data portals — permit number, type, work description, status, issue date, address, reported cost, and coordinates. One **permit per row**, ready for a construction-activity model, spreadsheet, or database.

Built for real-estate funds and REITs, building-material suppliers, contractors, proptech, and analysts who treat permit volume as a leading construction-activity signal.

---

### Why use this actor

- **No account, no login, no API key required.**
- **Huge volume in one run** — pull a whole city, or a date slice across many cities (millions of permits available across the supported portals).
- **Filter before you pull** — restrict to a date range so you only get (and only pay for) what you need.
- **Rich, structured detail** per permit — type, work description, status, address, reported cost, and coordinates where available.
- **Stable JSON output** ready for pipelines, CRMs, spreadsheets, or databases (download as JSON, CSV, or Excel).

### How it works

1. You pick one or more **cities** from the list (or point the actor at any other city's open-data table you know).
2. *(Optional)* Add a **date filter** to narrow the pull.
3. The actor collects every matching permit — number, type, work description, status, issue date, address, reported cost, and coordinates — and de-duplicates as it goes.
4. Results stream into your dataset, ready to download as JSON, CSV, or Excel.

You don't manage any scrapers, accounts, or infrastructure — just pick cities and press **Run**.

### Use cases

**Track construction activity in a market**
Pull a city's permits filtered to the last 12 months and chart new-construction volume month over month — an early read on local building activity before it shows up in official statistics.

**Find leads for building-material & trade suppliers**
Filter permits by work description (e.g. roofing, electrical, new construction) to build a list of active projects and the addresses behind them.

**Feed a real-estate or lending model**
Join permit counts and reported construction cost by ZIP to a property model to flag neighborhoods with rising development.

### Input

| Field | Type | Description |
| --- | --- | --- |
| `cities` | multi-select | Pick built-in cities (Chicago, NYC, LA, San Francisco, Austin, Dallas, Seattle, and more). |
| `customDatasets` | list | Optional — point at any other city's open-data table by domain + dataset id. |
| `where` | string | Optional date/field filter, e.g. `issue_date > '2026-01-01T00:00:00'`. |
| `appToken` | string | Optional free token from the data portal — speeds up very large pulls. |
| `pageSize` | integer | Records fetched per request (default 1000). |
| `maxItemsPerDataset` | integer | 0 = all. |

### Output — one record per permit

```json
{
  "_input": "chicago",
  "_source": "S1-socrata",
  "_scrapedAt": "2026-06-03T10:00:00Z",
  "recordType": "PERMIT",
  "city_key": "chicago",
  "permit_": "100123456",
  "permit_type": "PERMIT - NEW CONSTRUCTION",
  "issue_date": "2026-02-14T00:00:00.000",
  "work_description": "...",
  "street_number": "1234", "street_name": "N State St",
  "reported_cost": "250000"
}
````

| Field | Type | Description |
| --- | --- | --- |
| `city_key` | string | Which city the permit is from. |
| `permit_type` | string | Permit category (field names vary slightly by city). |
| `issue_date` | string | When the permit was issued. |
| `work_description` | string | What the work is. |
| `reported_cost` | string | Declared construction cost where available. |

(Each city's portal uses its own column names; all fields are passed through as-is.)

### Known limits

- Public government open-data — no account needed. Very large pulls run faster with a free portal token (optional input).
- Each supported city's table uses its own column names, passed through verbatim.
- Verified across Chicago (837k+), New York (3.9M+), Austin (2.3M+), Los Angeles (392k+) and more.

# Actor input Schema

## `cities` (type: `array`):

Pick one or more built-in city permit datasets. (Need another city? Use Custom datasets below.)

## `customDatasets` (type: `array`):

Any Socrata dataset: \[{"domain":"data.seattle.gov","datasetId":"76t5-zqzr","cityKey":"seattle"}]

## `where` (type: `string`):

Optional. e.g. issue\_date > '2026-01-01T00:00:00' — narrows the pull (and your result count).

## `appToken` (type: `string`):

Optional free token (data.gov / Socrata). Lifts the anonymous rate limit — recommended for large pulls.

## `pageSize` (type: `integer`):

How many permits to fetch per request (default 1000).

## `maxItemsPerDataset` (type: `integer`):

Cap the number of permits pulled per city. 0 means all.

## `maxConcurrency` (type: `integer`):

How many cities to process in parallel.

## `proxyConfiguration` (type: `object`):

Not required (public open-data API). Leave off unless you hit rate limits.

## Actor input object example

```json
{
  "cities": [
    "chicago"
  ],
  "customDatasets": [],
  "where": "",
  "pageSize": 1000,
  "maxItemsPerDataset": 0,
  "maxConcurrency": 3,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "cities": [
        "chicago"
    ],
    "customDatasets": [],
    "where": "",
    "appToken": ""
};

// Run the Actor and wait for it to finish
const run = await client.actor("xtracto/building-permits").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "cities": ["chicago"],
    "customDatasets": [],
    "where": "",
    "appToken": "",
}

# Run the Actor and wait for it to finish
run = client.actor("xtracto/building-permits").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "cities": [
    "chicago"
  ],
  "customDatasets": [],
  "where": "",
  "appToken": ""
}' |
apify call xtracto/building-permits --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=xtracto/building-permits",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "US Building-Permits Bulk Extractor (Socrata)",
        "description": "Extract millions of US building-permit records from city open-data portals (Socrata) — construction-activity intelligence. One permit per row. Public data, no login.",
        "version": "0.1",
        "x-build-id": "SDK7qxLIfp89j38P0"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/xtracto~building-permits/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-xtracto-building-permits",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/xtracto~building-permits/runs": {
            "post": {
                "operationId": "runs-sync-xtracto-building-permits",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/xtracto~building-permits/run-sync": {
            "post": {
                "operationId": "run-sync-xtracto-building-permits",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "cities": {
                        "title": "Cities",
                        "type": "array",
                        "description": "Pick one or more built-in city permit datasets. (Need another city? Use Custom datasets below.)",
                        "items": {
                            "type": "string",
                            "enum": [
                                "chicago",
                                "sf",
                                "austin",
                                "nyc",
                                "la",
                                "calgary",
                                "seattle",
                                "cincinnati",
                                "winnipeg",
                                "mesa",
                                "batonrouge",
                                "dallas",
                                "sandiego_co",
                                "gainesville",
                                "marin",
                                "roseville"
                            ],
                            "enumTitles": [
                                "Chicago, IL",
                                "San Francisco, CA",
                                "Austin, TX",
                                "New York City (DOB)",
                                "Los Angeles, CA",
                                "Calgary, AB",
                                "Seattle, WA",
                                "Cincinnati, OH",
                                "Winnipeg, MB",
                                "Mesa, AZ",
                                "Baton Rouge, LA",
                                "Dallas, TX",
                                "San Diego County, CA",
                                "Gainesville, FL",
                                "Marin County, CA",
                                "Roseville, CA"
                            ]
                        },
                        "default": [
                            "chicago"
                        ]
                    },
                    "customDatasets": {
                        "title": "Custom Socrata datasets",
                        "type": "array",
                        "description": "Any Socrata dataset: [{\"domain\":\"data.seattle.gov\",\"datasetId\":\"76t5-zqzr\",\"cityKey\":\"seattle\"}]"
                    },
                    "where": {
                        "title": "SoQL $where filter",
                        "type": "string",
                        "description": "Optional. e.g. issue_date > '2026-01-01T00:00:00' — narrows the pull (and your result count).",
                        "default": ""
                    },
                    "appToken": {
                        "title": "Socrata App Token",
                        "type": "string",
                        "description": "Optional free token (data.gov / Socrata). Lifts the anonymous rate limit — recommended for large pulls."
                    },
                    "pageSize": {
                        "title": "Page size",
                        "minimum": 100,
                        "maximum": 50000,
                        "type": "integer",
                        "description": "How many permits to fetch per request (default 1000).",
                        "default": 1000
                    },
                    "maxItemsPerDataset": {
                        "title": "Max items per dataset (0 = all)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Cap the number of permits pulled per city. 0 means all.",
                        "default": 0
                    },
                    "maxConcurrency": {
                        "title": "Max concurrent datasets",
                        "minimum": 1,
                        "maximum": 8,
                        "type": "integer",
                        "description": "How many cities to process in parallel.",
                        "default": 3
                    },
                    "proxyConfiguration": {
                        "title": "Proxy",
                        "type": "object",
                        "description": "Not required (public open-data API). Leave off unless you hit rate limits.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
