# WKO Firmen Austrian Business Directory Scraper (`scrapyspider/firmen-wko-at-scraper`) Actor

Scrape business listings from firmen.wko.at — Austria's official WKO business directory. Extract company names, addresses, phones, emails, websites, and full profile details from any industry and region. No account required.

- **URL**: https://apify.com/scrapyspider/firmen-wko-at-scraper.md
- **Developed by:** [ScrapySpider](https://apify.com/scrapyspider) (community)
- **Categories:** Lead generation, Jobs
- **Stats:** 3 total users, 0 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### WKO Firmen Austrian Business Directory Scraper

Scrapes business listings from firmen.wko.at — Austria's official WKO business directory — and returns structured JSON with company names, contact details, addresses, and full profile data including social links and opening hours.

- No account or login required
- Covers all 9 Austrian federal states (Bundesländer)
- Scrapes both listing pages and full company profile pages
- Extracts phones, emails, websites, addresses, social links, and opening hours
- Automatic "Load More" pagination — gets all results, not just the first page
- Built-in deduplication — no duplicate company records
- Export as JSON, CSV, or Excel

#### What data does it extract?

Each result contains:

**Company info:** company name, WKO profile link, contact name/title

**Contact details:** phone numbers (array), email address, company website URL

**Address:** full address string, street, place, ZIP code, city

**Profile details:** description/about text, social media links (Facebook, Instagram, LinkedIn, etc.), opening hours, WKO categories, company registration number (where available)

**Meta:** source listing URL, profile page URL, page number, scraped timestamp

---

#### Use cases

- **Lead generation:** Build targeted B2B contact lists from Austrian businesses by industry and region
- **Market research:** Analyze the Austrian business landscape — which industries are active in which regions
- **Sales prospecting:** Find roofers, plumbers, lawyers, or any trade in any Austrian state with full contact info
- **Data enrichment:** Append verified Austrian company contact data to existing CRM records
- **Competitive intelligence:** Monitor businesses in your industry across Austrian regions
- **Directory building:** Create niche business directories for Austrian market verticals

---

#### How to use

1. Click **Try for free** above
2. In the **Input** tab, paste one or more `firmen.wko.at` directory URLs — one per line
3. Set **Max Concurrency** (default 3 is fine for most runs)
4. Click **Start** and wait for the run to complete (typically 1–5 minutes per 100 companies)
5. Download results as JSON, CSV, or Excel from the **Output** tab

**URL format:** `https://firmen.wko.at/{industry}/{region}` — for example:
- `https://firmen.wko.at/dachdecker/wien` → roofers in Vienna
- `https://firmen.wko.at/rechtsanwaelte/salzburg` → lawyers in Salzburg
- `https://firmen.wko.at/elektriker/tirol` → electricians in Tyrol

---

#### Input parameters

| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| `startUrls` | Array of URLs | Yes | `https://firmen.wko.at/dachdecker/salzburg` | One or more firmen.wko.at directory or search URLs to scrape |
| `maxConcurrency` | Integer | No | `3` | Number of pages processed in parallel. Range: 1–20. |
| `proxy` | Object | No | Apify automatic proxy | Proxy configuration. Leave as default for best reliability. |

---

#### Output example

```json
{
  "companyName": "Muster Dachdecker GmbH",
  "companyLink": "https://firmen.wko.at/muster-dachdecker-gmbh/salzburg/?firmaid=abc123",
  "name": "Max Mustermann",
  "phones": ["+43 662 123456", "+43 664 9876543"],
  "email": "office@muster-dachdecker.at",
  "url": "https://www.muster-dachdecker.at",
  "fullAddress": "Mustergasse 12 5020 Salzburg",
  "street": "Mustergasse 12",
  "place": "5020 Salzburg",
  "zip": "5020",
  "city": "Salzburg",
  "pageUrl": "https://firmen.wko.at/dachdecker/salzburg",
  "pageNumber": 1,
  "description": "Ihr zuverlässiger Dachdecker in Salzburg seit 1985.",
  "socialLinks": ["https://www.facebook.com/musterdachdecker"],
  "registrationNumber": "FN 123456a",
  "openingHours": ["Mo-Fr 08:00-17:00"],
  "categories": ["Dachdecker", "Spengler"],
  "profilePageUrl": "https://firmen.wko.at/muster-dachdecker-gmbh/salzburg/?firmaid=abc123",
  "scrapedAt": "2025-10-12T10:41:58.055Z"
}
````

***

#### Pricing

This Actor is **free to use** — you only pay for Apify platform compute time and proxy usage.

A typical run scraping 100 company profiles costs approximately $0.05–$0.15 in Apify platform credits.

New Apify accounts receive $5 in free credits — enough for several hundred company records.

***

#### Technical notes

- **No account needed:** This Actor does not require a WKO login or cookies. A consent cookie is set automatically to bypass the cookie banner.
- **Proxies:** The Actor uses Apify's automatic proxy by default. For large-volume runs, residential proxies are recommended.
- **Pagination:** The scraper automatically clicks the "Mehr laden" (Load More) button until all results are loaded.
- **Profile pages:** After scraping all listing results, the Actor follows each company's WKO profile link for additional data. This doubles the requests but significantly enriches the output.
- **Rate limits:** firmen.wko.at is a public directory — rate limiting is rarely an issue for normal runs. Keep `maxConcurrency` at 3–5 for safe operation.
- **Data availability:** Not all companies have emails, websites, or social links. Missing fields are omitted or returned as null.

***

#### Support

Have questions or found a bug? Reach out:

- **Email:** ScrapySpider@protonmail.com
- **Website:** ScrapySpider.com
- **Apify:** Open a support issue on this Actor page
- **Response time:** Within 24–48 hours on weekdays

# Actor input Schema

## `startUrls` (type: `array`):

One or more firmen.wko.at directory URLs to scrape. Use the pattern: https://firmen.wko.at/{industry}/{region} (e.g. https://firmen.wko.at/dachdecker/salzburg). You can also use search URLs like https://firmen.wko.at/-/?branche=25241

## `maxConcurrency` (type: `integer`):

Maximum number of pages processed in parallel. Lower values are more reliable; higher values are faster. Recommended: 3–5.

## `proxy` (type: `object`):

Proxy settings for the scraper. Defaults to Apify automatic proxy. Recommended to leave as default for best reliability.

## Actor input object example

```json
{
  "startUrls": [
    {
      "url": "https://firmen.wko.at/dachdecker/salzburg"
    }
  ],
  "maxConcurrency": 3,
  "proxy": {
    "useApifyProxy": true
  }
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        {
            "url": "https://firmen.wko.at/dachdecker/salzburg"
        }
    ],
    "proxy": {
        "useApifyProxy": true
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("scrapyspider/firmen-wko-at-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": [{ "url": "https://firmen.wko.at/dachdecker/salzburg" }],
    "proxy": { "useApifyProxy": True },
}

# Run the Actor and wait for it to finish
run = client.actor("scrapyspider/firmen-wko-at-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    {
      "url": "https://firmen.wko.at/dachdecker/salzburg"
    }
  ],
  "proxy": {
    "useApifyProxy": true
  }
}' |
apify call scrapyspider/firmen-wko-at-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=scrapyspider/firmen-wko-at-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "WKO Firmen Austrian Business Directory Scraper",
        "description": "Scrape business listings from firmen.wko.at — Austria's official WKO business directory. Extract company names, addresses, phones, emails, websites, and full profile details from any industry and region. No account required.",
        "version": "0.1",
        "x-build-id": "QhWje7ecEnDm97yEa"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/scrapyspider~firmen-wko-at-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-scrapyspider-firmen-wko-at-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/scrapyspider~firmen-wko-at-scraper/runs": {
            "post": {
                "operationId": "runs-sync-scrapyspider-firmen-wko-at-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/scrapyspider~firmen-wko-at-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-scrapyspider-firmen-wko-at-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "startUrls"
                ],
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "One or more firmen.wko.at directory URLs to scrape. Use the pattern: https://firmen.wko.at/{industry}/{region} (e.g. https://firmen.wko.at/dachdecker/salzburg). You can also use search URLs like https://firmen.wko.at/-/?branche=25241",
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "maxConcurrency": {
                        "title": "Max Concurrency",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "Maximum number of pages processed in parallel. Lower values are more reliable; higher values are faster. Recommended: 3–5.",
                        "default": 3
                    },
                    "proxy": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Proxy settings for the scraper. Defaults to Apify automatic proxy. Recommended to leave as default for best reliability."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
