# GoodFirms Companies Directory Scraper (`parseforge/goodfirms-companies-scraper`) Actor

Pull ranked B2B service providers from GoodFirms by category slug, country, and minimum rating. Returns company name, services, location, team size, hourly rate, rating, review count, founded year, and profile URL. Built for vendor discovery, market mapping, and outbound sales prospecting.

- **URL**: https://apify.com/parseforge/goodfirms-companies-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Lead generation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $7.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🏅 GoodFirms Companies Directory Scraper

> 🚀 **Export GoodFirms B2B company directories in seconds. Names, services, hourly rates, headcounts, locations, founded year, clients, and ratings, straight from goodfirms.co.**

> 🕒 **Last updated:** 2026-05-29 · **📊 12 fields** per record · Public directory · No login required

The GoodFirms Companies Directory Scraper turns the public [GoodFirms](https://www.goodfirms.co) directory into a clean, structured dataset. Hand it a search query or a category, and it returns one row per profile with every public field flattened and normalized.

You get 12 fields per record - the same ones a human visitor sees on the public site, just structured so you can drop them straight into Excel, BigQuery, or your CRM.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| 🏢 B2B sales teams | Build prospect lists from public directories |
| 📊 Market researchers | Map the competitive landscape |
| 🤖 ML engineers | Build training sets of real-world profiles |
| 📰 Journalists | Source GoodFirms profiles for stories |
| 👩‍💻 Developers | Mirror GoodFirms listings into your own DB |
| 🧑‍💼 Recruiters | Find talent, agencies, or vendors at scale |

### 📋 What the GoodFirms Companies Directory Scraper does

- Calls the public GoodFirms pages and parses the listing HTML or JSON payload.
- Walks pagination and follows each profile link.
- Extracts 12 fields per record - identifiers, contact info, ratings, and metadata.
- Surfaces upstream errors as a clean `error` record instead of crashing.
- Exports as CSV, Excel, JSON, JSONL, XML, RSS, or HTML.

> 💡 **Why it matters:** GoodFirms is a goldmine of public directory data, but the site is built for browsing, not bulk export. This actor turns it into a structured dataset in seconds.

### 🎬 Full Demo

_🚧 Coming soon._

### ⚙️ Input

<table>
<tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr>
<tr><td><code>category</code></td><td>string</td><td>No</td><td>GoodFirms category slug (e.g. "mobile-app-development").</td></tr>
<tr><td><code>maxItems</code></td><td>integer</td><td>No</td><td>Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000</td></tr>
<tr><td><code>country</code></td><td>string</td><td>No</td><td>Optional country filter.</td></tr>
<tr><td><code>minRating</code></td><td>number</td><td>No</td><td>Optional minimum rating (0 to 5).</td></tr>
</table>

**Example 1 - default run:**
```json
{
  "category": "mobile-app-development",
  "maxItems": 10,
  "minRating": 10
}
````

**Example 2 - larger pull:**

```json
{
  "category": "mobile-app-development",
  "maxItems": 50
}
```

> ⚠️ **Good to Know:** Free users are auto-limited to 10 items per run. Paid users can pull up to 1,000,000 records. Heavy runs may take longer due to GoodFirms's rate limits.

### 📊 Output

Each record is a flat object. Image URL is first, `error` is always last.

| Field | Type | Description |
|---|---|---|
| 🖼️ `logoUrl` | string | Logo image URL. |
| 🏢 `name` | string | Display name. |
| 🛠️ `services` | array | Services offered. |
| 💵 `hourlyRate` | string | Hourly rate. |
| 👥 `employees` | string | Number of employees. |
| 📍 `location` | string | Free-text location. |
| 📆 `founded` | integer | Year founded. |
| 🤝 `clients` | array | Notable clients. |
| ⭐ `rating` | number | Average star rating. |
| 🔗 `websiteUrl` | string | External website URL. |
| 🔗 `profileUrl` | string | Source profile URL. |
| 🕒 `scrapedAt` | string | When this row was fetched (ISO 8601). |
| ❌ `error` | string | Set if the upstream response was an error. |

**Sample record:**

```json
{
  "logoUrl": "https://cdn.example.com/img.jpg",
  "name": "Sample value",
  "services": [
    "item-1",
    "item-2"
  ],
  "hourlyRate": "https://example.com",
  "employees": "Sample value",
  "location": "Sample value",
  "founded": 42,
  "clients": [
    "item-1",
    "item-2"
  ],
  "rating": 4.8,
  "websiteUrl": "https://example.com",
  "profileUrl": "https://example.com",
  "scrapedAt": "2026-05-29T12:00:00.000Z",
  "error": null
}
```

### ✨ Why choose this Actor

| 🆓 | Free users get 10 items per run to evaluate before upgrading. |
| 🧹 | Cleaned and normalized fields - no scraping artifacts. |
| 🔢 | Numeric fields cast to numbers, dates to ISO, arrays preserved. |
| 🛟 | Upstream errors surfaced as clean `error` records, never crashes. |
| 🔌 | One input form, one click, dataset ready in seconds. |
| 💾 | Push to dataset → instant CSV / Excel / JSON / XML / RSS / HTML export. |

### 📈 How it compares to alternatives

| Approach | Setup time | Clean fields? | Pagination? | Rate-limit handling? |
|---|---|---|---|---|
| Manual copy/paste | hours per page | partial | manual | none |
| Roll your own scraper | 4+ hours | ❌ | ❌ | ❌ |
| **This Actor** | 5 sec, no install | ✅ | ✅ | ✅ |

### 🚀 How to use

1. Click **Try for free**.
2. Fill in the input form (or use prefilled defaults).
3. Click **Start**. Within seconds your dataset is ready - download as CSV, Excel, JSON, or XML, or pipe to your warehouse.
4. (Optional) Schedule it to re-run daily, weekly, or on a custom cron.

### 💼 Business use cases

**📊 Lead generation.** Build a fresh, structured prospect list from GoodFirms every week. No more manual copy-paste from a browser.

**🔍 Competitive intelligence.** Track who's listed on GoodFirms, with what services, and at what price point.

**🤖 ML feature engineering.** Build clean training sets of real-world profiles for matching, ranking, or recommendation models.

**📰 Editorial research.** Reporters can pull a directory snapshot in 30 seconds, then verify quotes and facts against the structured data.

### 🔌 Automating GoodFirms Companies Directory Scraper

- **Make / Zapier**: trigger this actor on a schedule, push results to Airtable, Google Sheets, HubSpot, or Slack.
- **Cron schedule**: native Apify scheduler - run nightly, weekly, or on any cron expression.
- **Webhooks**: get a POST to your endpoint the moment a run finishes.
- **Pipe to BigQuery / Snowflake / Postgres**: native Apify integrations move datasets straight into your warehouse.

### 🌟 Beyond business use cases

**🎓 Education.** Teach a data class? Have students pull their own GoodFirms dataset in 5 seconds and analyze it in pandas.

**🧪 Personal research.** Track your favourite freelancers, agencies, or speakers over time.

**🤝 Non-profit & open data.** Build public dashboards of who's working where and on what.

**🧰 Tinkering & prototyping.** Spin up a real dataset in seconds to test a new visualization library or BI tool.

### 🤖 Ask an AI assistant about this scraper

Pop this README into ChatGPT, Claude, or any AI assistant and ask it to map your specific workflow to the actor's inputs. The schema, examples, and field list above contain everything an LLM needs to design a working pipeline.

### ❓ Frequently Asked Questions

**❓ Do I need an account on GoodFirms?** No. This actor only reads public pages.

**❓ Is this allowed?** This actor scrapes only publicly available data. Users are responsible for complying with GoodFirms's terms of service and applicable law.

**❓ How many records can I pull?** Free plan: 10 per run (preview). Paid plans: up to 1,000,000.

**❓ Is there a rate limit?** GoodFirms may throttle aggressive requests. The actor uses respectful pacing and surfaces upstream errors as clean records.

**❓ Are values cleaned?** Yes. Numeric fields are cast to numbers, dates to ISO strings, arrays preserved as arrays.

**❓ How are errors handled?** If a profile fails to parse, we push a single record with `error` populated. The run never crashes mid-batch.

**❓ Can I schedule runs?** Yes - Apify's native scheduler, or hook this up to Make / Zapier / cron.

**❓ Will the schema change?** Core identifiers and contact fields are stable. New optional fields may be added; existing fields will not be renamed.

**❓ What format can I download?** CSV, Excel, JSON, JSONL, XML, RSS, or HTML - straight from the Apify dataset UI.

**❓ Can I filter by location, category, or rating?** Yes - see the **Input** section for the full list of supported filters.

### 🔌 Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook endpoint. Trigger runs from a calendar event, a form submission, a cron job, or pipe results straight into BigQuery, Snowflake, or a Postgres warehouse.

### 🔗 Recommended Actors

| Actor | What it does |
|---|---|
| [ParseForge Alpha Vantage Scraper](https://apify.com/parseforge/alpha-vantage-public-scraper) | Public stock, FX, and crypto market data. |
| [ParseForge OurAirports Scraper](https://apify.com/parseforge/ourairports-scraper) | Global airport database. |
| [ParseForge FINRA BrokerCheck Scraper](https://apify.com/parseforge) | US broker and adviser public records. |
| [ParseForge FAA Aircraft Registry Scraper](https://apify.com/parseforge) | US civil aircraft registry. |

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.

***

**Disclaimer:** This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by GoodFirms or any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. [Create a free account w/ $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp).

# Actor input Schema

## `category` (type: `string`):

GoodFirms category slug (e.g. "mobile-app-development").

## `maxItems` (type: `integer`):

Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000

## `country` (type: `string`):

Optional country filter.

## `minRating` (type: `number`):

Optional minimum rating (0 to 5).

## Actor input object example

```json
{
  "category": "mobile-app-development",
  "maxItems": 10
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "category": "mobile-app-development",
    "maxItems": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/goodfirms-companies-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "category": "mobile-app-development",
    "maxItems": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("parseforge/goodfirms-companies-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "category": "mobile-app-development",
  "maxItems": 10
}' |
apify call parseforge/goodfirms-companies-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/goodfirms-companies-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "GoodFirms Companies Directory Scraper",
        "description": "Pull ranked B2B service providers from GoodFirms by category slug, country, and minimum rating. Returns company name, services, location, team size, hourly rate, rating, review count, founded year, and profile URL. Built for vendor discovery, market mapping, and outbound sales prospecting.",
        "version": "0.1",
        "x-build-id": "APRfxGcXfzj5i3cwi"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~goodfirms-companies-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-goodfirms-companies-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~goodfirms-companies-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-goodfirms-companies-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~goodfirms-companies-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-goodfirms-companies-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "category": {
                        "title": "Category slug",
                        "type": "string",
                        "description": "GoodFirms category slug (e.g. \"mobile-app-development\")."
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000"
                    },
                    "country": {
                        "title": "Country",
                        "type": "string",
                        "description": "Optional country filter."
                    },
                    "minRating": {
                        "title": "Min rating",
                        "minimum": 0,
                        "maximum": 5,
                        "type": "number",
                        "description": "Optional minimum rating (0 to 5)."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
