# ProPublica Nonprofit Explorer Scraper (`crawlerbros/propublica-nonprofit-scraper`) Actor

Scrape ProPublica's Nonprofit Explorer, IRS Form 990 data for 1.8M+ US nonprofits. Search by keyword, state, NTEE category, tax status, or look up organizations by EIN. Get financials, filings, and PDF links.

- **URL**: https://apify.com/crawlerbros/propublica-nonprofit-scraper.md
- **Developed by:** [Crawler Bros](https://apify.com/crawlerbros) (community)
- **Categories:** Developer tools, Automation, Other
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 4 bookmarks
- **User rating**: 5.00 out of 5 stars

## Pricing

from $3.00 / 1,000 results

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## ProPublica Nonprofit Explorer Scraper

Access IRS Form 990 financial data for **1.8M+ US nonprofits** via ProPublica's free Nonprofit Explorer API. Search by keyword, filter by state or NTEE category, or look up any organization directly by EIN number.

Perfect for grant research, charity vetting, philanthropic analysis, and academic research on the nonprofit sector.

### Features

- **Search** nonprofits by keyword/name with optional state, NTEE category, and 501(c) tax status filters
- **Look up by EIN** — get full organization details, address, financial summary, and all available Form 990 filings
- **Top by revenue** — browse top nonprofits filtered by state and NTEE category
- Covers **1.8M+ organizations** including charities, foundations, trade associations, and social clubs
- Returns financial data: total revenue, expenses, assets, support, and officer compensation percentage
- Includes PDF links to actual Form 990 filings on ProPublica's servers

### Input Parameters

| Parameter | Type | Description |
|-----------|------|-------------|
| `mode` | Select | `search`, `byEIN`, or `topByRevenue` |
| `query` | Text | Keyword/name to search (mode=search) |
| `ein` | Text | 9-digit EIN number, e.g. `530196605` (mode=byEIN) |
| `state` | Select | 2-letter US state code to filter results |
| `nteeCode` | Select | NTEE category letter A–Z |
| `cCode` | Select | 501(c) tax status: 3, 4, 6, or 7 |
| `maxItems` | Integer | Maximum records to return (1–10,000, default 50) |

#### NTEE Category Codes

| Code | Category |
|------|----------|
| A | Arts, Culture & Humanities |
| B | Education |
| C | Environment |
| D | Animal-Related |
| E | Health Care |
| F | Mental Health |
| G | Diseases & Medical Disciplines |
| H | Medical Research |
| I | Crime & Legal |
| J | Employment |
| K | Food, Agriculture & Nutrition |
| L | Housing & Shelter |
| M | Public Safety |
| N | Recreation & Sports |
| O | Youth Development |
| P | Human Services |
| Q | International Affairs |
| R | Civil Rights |
| S | Community Improvement |
| T | Philanthropy & Grantmaking |
| U | Science & Technology |
| V | Social Science |
| W | Public & Societal Benefit |
| X | Religion |
| Y | Mutual & Membership Benefit |
| Z | Unknown |

### Output Fields

| Field | Description |
|-------|-------------|
| `ein` | 9-digit Employer Identification Number |
| `name` | Organization legal name |
| `city` | City of primary address |
| `state` | 2-letter US state code |
| `zipcode` | ZIP code (mode=byEIN) |
| `address` | Street address (mode=byEIN) |
| `nteeCode` | NTEE category code |
| `subsectionCode` | IRS 501(c) subsection number |
| `assetAmount` | Total assets in USD |
| `incomeAmount` | Total income in USD |
| `revenueAmount` | Total revenue in USD (mode=byEIN) |
| `form990Count` | Number of Form 990s filed |
| `filingYear` | Tax year of the filing |
| `formType` | Form type (990, 990-EZ, 990-PF) |
| `totalRevenue` | Total revenue from filing |
| `totalExpenses` | Total functional expenses |
| `totalAssets` | Total assets end of year |
| `totalSupport` | Total support received |
| `pctOfficerCompensation` | % of expenses for officer compensation |
| `pdfUrl` | Direct link to Form 990 PDF |
| `propublicaUrl` | ProPublica organization profile URL |
| `rulingDate` | IRS ruling date (YYYYMM format) |
| `scrapedAt` | ISO 8601 timestamp of scrape |

### Example Use Cases

**Find education nonprofits in California:**
```json
{
  "mode": "search",
  "query": "education",
  "state": "CA",
  "nteeCode": "B",
  "maxItems": 100
}
````

**Look up a specific organization by EIN:**

```json
{
  "mode": "byEIN",
  "ein": "530196605"
}
```

**Get top 50 health nonprofits in New York:**

```json
{
  "mode": "topByRevenue",
  "state": "NY",
  "nteeCode": "E",
  "maxItems": 50
}
```

### Data Source

All data is sourced from [ProPublica's Nonprofit Explorer](https://projects.propublica.org/nonprofits/), which aggregates IRS Form 990 filings. The API is completely free and requires no authentication.

### FAQs

**Q: What is a Form 990?**
A: Form 990 is the IRS annual information return required from most tax-exempt organizations. It includes financial statements, governance information, and compensation data.

**Q: What is an EIN?**
A: The Employer Identification Number (EIN) is a unique 9-digit number assigned by the IRS to identify organizations. Format: XX-XXXXXXX.

**Q: How current is the data?**
A: ProPublica updates its data regularly as new Form 990s are processed by the IRS. Filings typically appear 6–18 months after the tax year ends.

**Q: Can I get PDF copies of Form 990s?**
A: Yes — the `pdfUrl` field in mode=byEIN provides direct links to Form 990 PDF files hosted by ProPublica.

**Q: What is NTEE?**
A: The National Taxonomy of Exempt Entities (NTEE) is a classification system for nonprofit organizations developed by the IRS and the National Center for Charitable Statistics.

**Q: Are all 1.8M nonprofits searchable?**
A: Yes. ProPublica indexes all organizations that have filed Form 990 with the IRS, including inactive organizations.

**Q: Is there a cost to use this scraper?**
A: No. The underlying ProPublica API is free and public. This scraper uses only standard Apify datacenter proxies (included in the free tier).

# Actor input Schema

## `mode` (type: `string`):

What to fetch: search nonprofits by keyword, look up by EIN, or get top nonprofits by revenue.

## `query` (type: `string`):

Keyword or organization name to search (mode=search).

## `ein` (type: `string`):

Employer Identification Number, e.g. '530196605' or '13-1837418'.

## `state` (type: `string`):

Limit results to a US state (2-letter code).

## `nteeCode` (type: `string`):

National Taxonomy of Exempt Entities category (single letter A-Z).

## `cCode` (type: `string`):

IRS subsection code (tax-exempt status type).

## `maxItems` (type: `integer`):

Hard cap on emitted records.

## Actor input object example

```json
{
  "mode": "search",
  "query": "education",
  "state": "",
  "nteeCode": "",
  "cCode": "",
  "maxItems": 50
}
```

# Actor output Schema

## `nonprofits` (type: `string`):

Dataset containing all scraped nonprofit records with EIN, financials, and filing data.

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "mode": "search",
    "query": "education",
    "state": "",
    "nteeCode": "",
    "cCode": "",
    "maxItems": 50
};

// Run the Actor and wait for it to finish
const run = await client.actor("crawlerbros/propublica-nonprofit-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "mode": "search",
    "query": "education",
    "state": "",
    "nteeCode": "",
    "cCode": "",
    "maxItems": 50,
}

# Run the Actor and wait for it to finish
run = client.actor("crawlerbros/propublica-nonprofit-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "mode": "search",
  "query": "education",
  "state": "",
  "nteeCode": "",
  "cCode": "",
  "maxItems": 50
}' |
apify call crawlerbros/propublica-nonprofit-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=crawlerbros/propublica-nonprofit-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "ProPublica Nonprofit Explorer Scraper",
        "description": "Scrape ProPublica's Nonprofit Explorer, IRS Form 990 data for 1.8M+ US nonprofits. Search by keyword, state, NTEE category, tax status, or look up organizations by EIN. Get financials, filings, and PDF links.",
        "version": "1.0",
        "x-build-id": "0l3JxUaz2jbxOj0rL"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/crawlerbros~propublica-nonprofit-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-crawlerbros-propublica-nonprofit-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/crawlerbros~propublica-nonprofit-scraper/runs": {
            "post": {
                "operationId": "runs-sync-crawlerbros-propublica-nonprofit-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/crawlerbros~propublica-nonprofit-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-crawlerbros-propublica-nonprofit-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "mode"
                ],
                "properties": {
                    "mode": {
                        "title": "Mode",
                        "enum": [
                            "search",
                            "byEIN",
                            "topByRevenue"
                        ],
                        "type": "string",
                        "description": "What to fetch: search nonprofits by keyword, look up by EIN, or get top nonprofits by revenue.",
                        "default": "search"
                    },
                    "query": {
                        "title": "Search query",
                        "type": "string",
                        "description": "Keyword or organization name to search (mode=search).",
                        "default": "education"
                    },
                    "ein": {
                        "title": "EIN (mode=byEIN)",
                        "type": "string",
                        "description": "Employer Identification Number, e.g. '530196605' or '13-1837418'."
                    },
                    "state": {
                        "title": "State filter",
                        "enum": [
                            "",
                            "AL",
                            "AK",
                            "AZ",
                            "AR",
                            "CA",
                            "CO",
                            "CT",
                            "DE",
                            "DC",
                            "FL",
                            "GA",
                            "HI",
                            "ID",
                            "IL",
                            "IN",
                            "IA",
                            "KS",
                            "KY",
                            "LA",
                            "ME",
                            "MD",
                            "MA",
                            "MI",
                            "MN",
                            "MS",
                            "MO",
                            "MT",
                            "NE",
                            "NV",
                            "NH",
                            "NJ",
                            "NM",
                            "NY",
                            "NC",
                            "ND",
                            "OH",
                            "OK",
                            "OR",
                            "PA",
                            "RI",
                            "SC",
                            "SD",
                            "TN",
                            "TX",
                            "UT",
                            "VT",
                            "VA",
                            "WA",
                            "WV",
                            "WI",
                            "WY"
                        ],
                        "type": "string",
                        "description": "Limit results to a US state (2-letter code).",
                        "default": ""
                    },
                    "nteeCode": {
                        "title": "NTEE category",
                        "enum": [
                            "",
                            "A",
                            "B",
                            "C",
                            "D",
                            "E",
                            "F",
                            "G",
                            "H",
                            "I",
                            "J",
                            "K",
                            "L",
                            "M",
                            "N",
                            "O",
                            "P",
                            "Q",
                            "R",
                            "S",
                            "T",
                            "U",
                            "V",
                            "W",
                            "X",
                            "Y",
                            "Z"
                        ],
                        "type": "string",
                        "description": "National Taxonomy of Exempt Entities category (single letter A-Z).",
                        "default": ""
                    },
                    "cCode": {
                        "title": "501(c) tax code",
                        "enum": [
                            "",
                            "3",
                            "4",
                            "6",
                            "7"
                        ],
                        "type": "string",
                        "description": "IRS subsection code (tax-exempt status type).",
                        "default": ""
                    },
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Hard cap on emitted records.",
                        "default": 50
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
