# The Knot Marketplace Scraper (`kawsar/the-knot-marketplace-scraper`) Actor

The Knot Marketplace Scraper pulls vendor names, ratings, prices, and contact details from TheKnot.com listing pages, so you get a clean dataset for outreach or market research instead of clicking through pages.

- **URL**: https://apify.com/kawsar/the-knot-marketplace-scraper.md
- **Developed by:** [Kawsar](https://apify.com/kawsar) (community)
- **Categories:** Automation, Lead generation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $3.99 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## The Knot Marketplace Scraper

The Knot Marketplace Scraper pulls wedding vendor data from TheKnot.com listing pages. Give it a category slug like `wedding-photographers` and a location like `new-york-ny`, and it returns each vendor's name, service area, star rating, review count, price tier, portfolio photos, and business bio across all result pages. Works with direct search URLs too, one or many.

Useful for building sales prospect lists, populating a CRM with local vendor contacts, or tracking which businesses show up in a given market.

---

### What it collects

Each vendor record includes:

- Vendor name
- Profile URL on TheKnot.com
- Service location (city and region as listed)
- Star rating and review count
- Price range (e.g. `$$ - Affordable`, `$$$$ - Luxury`)
- Business bio excerpt visible on the listing card
- Portfolio image URLs (up to 5)
- Awards (e.g. `Award winner (2x)`, Best of Weddings)
- Vendor badges (e.g. `Responds quickly`)
- Category name inferred from the search URL
- Source listing page URL and page number
- Timestamp of when the record was collected

---

### Input

You can either provide listing URLs directly, or let the actor build the URL from a category and location.

#### Option A: Start URLs

Paste one or more TheKnot.com marketplace listing URLs:

```json
{
  "startUrls": [
    "https://www.theknot.com/marketplace/wedding-photographers-new-york-ny",
    "https://www.theknot.com/marketplace/florists-los-angeles-ca"
  ]
}
````

#### Option B: Category and location slugs

```json
{
  "category": "wedding-djs",
  "location": "chicago-il"
}
```

The actor builds `https://www.theknot.com/marketplace/wedding-djs-chicago-il` from these two fields.

#### All input fields

| Field | Type | Default | Description |
|---|---|---|---|
| `startUrls` | string list | — | One or more TheKnot.com marketplace listing URLs |
| `category` | string | — | Category slug, e.g. `wedding-photographers` |
| `location` | string | — | City-state slug, e.g. `new-york-ny` |
| `maxPages` | integer | 3 | Pages of results to collect per URL (each page is ~30 vendors) |
| `maxItems` | integer | 100 | Total vendor records cap across all URLs |
| `requestTimeoutSecs` | integer | 60 | Per-page timeout in seconds |

***

### Category slugs

Some common categories on TheKnot.com:

| Category | Slug |
|---|---|
| Wedding Photographers | `wedding-photographers` |
| Videographers | `wedding-videographers` |
| Florists | `florists` |
| Wedding DJs | `wedding-djs` |
| Wedding Bands | `live-wedding-bands` |
| Photo Booths | `wedding-photo-booth-rentals` |
| Wedding Planners | `wedding-planners` |
| Catering | `catering` |
| Wedding Cakes | `wedding-cake-bakeries` |
| Reception Venues | `wedding-reception-venues` |
| Transportation | `transportation-services` |

Location slugs follow the pattern `{city}-{state-code}`, for example `san-diego-ca`, `austin-tx`, `miami-fl`.

***

### Example output

```json
{
  "vendorId": "373c48de-ff47-49a0-9ba4-3109eabcd795",
  "vendorName": "The Capture Corner",
  "vendorUrl": "https://www.theknot.com/marketplace/the-capture-corner-san-diego-ca-2088163",
  "location": "San Diego and all of SoCal",
  "rating": 5.0,
  "reviewCount": 79,
  "priceRange": "$$ - Affordable",
  "shortDescription": "Since 2018, we've helped couples across Southern California celebrate in style, capturing joyful moments and creating keepsakes your guests will love.",
  "imageUrls": [
    "https://media-api.xogrp.com/images/3c591157-272e-4c92-94b1-1b0f73804c01~sc_310.210?quality=70",
    "https://media-api.xogrp.com/images/44be28a5-c691-40bd-a379-5ce74b3d2b4b~sc_310.210?quality=70"
  ],
  "awards": ["Award winner (2x)"],
  "isBestOfWeddings": true,
  "badgeTexts": ["Responds quickly"],
  "categoryName": "Wedding Photo Booth Rentals",
  "pageUrl": "https://www.theknot.com/marketplace/wedding-photo-booth-rentals-san-diego-ca",
  "pageNumber": 1,
  "scrapedAt": "2026-05-12T10:30:00.000Z"
}
```

***

### Usage tips

**Scraping a full market:** Set `maxPages` to 20 and `maxItems` to 1000 to collect every vendor in a category and location. Most markets have 1-5 pages of results (30 vendors per page).

**Multi-city runs:** Add multiple URLs to `startUrls`, one per city. The actor processes them in order and stops when it hits `maxItems`.

**CRM import:** The output fields map cleanly to contact records. `vendorName` and `vendorUrl` give you the lead identity, `location` gives geography, `rating` and `reviewCount` help prioritize outreach.

**Filtering by awards:** The `isBestOfWeddings` boolean flags vendors with award recognition. Use it to filter the dataset to top-rated vendors only.

***

### Limitations

- TheKnot.com listing pages show the business bio as a short excerpt. Full descriptions are on the individual vendor profile pages.
- Phone numbers, email addresses, and website links are on individual vendor profile pages, not the listing cards.
- Results reflect what is publicly listed on TheKnot.com at the time of the run. Vendor data changes as businesses update their profiles.

***

### Legal notice

This actor collects publicly visible data from TheKnot.com listing pages. Use it in line with TheKnot.com's terms of service and applicable data regulations. Do not use collected data for spam or unsolicited bulk outreach.

# Actor input Schema

## `startUrls` (type: `array`):

One or more TheKnot.com marketplace listing URLs to scrape. Example: https://www.theknot.com/marketplace/wedding-photographers-new-york-ny

## `category` (type: `string`):

Wedding vendor category slug. Used to build the search URL when no startUrls are provided. Example: wedding-photographers, florists, wedding-djs, wedding-photo-booth-rentals

## `location` (type: `string`):

City and state slug. Combined with the category slug to build the search URL. Example: new-york-ny, los-angeles-ca, chicago-il

## `maxPages` (type: `integer`):

How many pages of results to collect per listing URL. Each page returns up to 30 vendors.

## `maxItems` (type: `integer`):

Maximum total number of vendor records to collect across all URLs.

## `requestTimeoutSecs` (type: `integer`):

Per-page timeout in seconds. Pages with JavaScript rendering take longer, so 60 seconds is recommended.

## Actor input object example

```json
{
  "startUrls": [
    "https://www.theknot.com/marketplace/wedding-photographers-new-york-ny"
  ],
  "category": "wedding-photographers",
  "location": "new-york-ny",
  "maxPages": 1,
  "maxItems": 25,
  "requestTimeoutSecs": 60
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        "https://www.theknot.com/marketplace/wedding-photographers-new-york-ny"
    ],
    "category": "wedding-photographers",
    "location": "new-york-ny"
};

// Run the Actor and wait for it to finish
const run = await client.actor("kawsar/the-knot-marketplace-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": ["https://www.theknot.com/marketplace/wedding-photographers-new-york-ny"],
    "category": "wedding-photographers",
    "location": "new-york-ny",
}

# Run the Actor and wait for it to finish
run = client.actor("kawsar/the-knot-marketplace-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    "https://www.theknot.com/marketplace/wedding-photographers-new-york-ny"
  ],
  "category": "wedding-photographers",
  "location": "new-york-ny"
}' |
apify call kawsar/the-knot-marketplace-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=kawsar/the-knot-marketplace-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "The Knot Marketplace Scraper",
        "description": "The Knot Marketplace Scraper pulls vendor names, ratings, prices, and contact details from TheKnot.com listing pages, so you get a clean dataset for outreach or market research instead of clicking through pages.",
        "version": "0.0",
        "x-build-id": "95NEBkvW2FgDCEEHa"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/kawsar~the-knot-marketplace-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-kawsar-the-knot-marketplace-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/kawsar~the-knot-marketplace-scraper/runs": {
            "post": {
                "operationId": "runs-sync-kawsar-the-knot-marketplace-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/kawsar~the-knot-marketplace-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-kawsar-the-knot-marketplace-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "One or more TheKnot.com marketplace listing URLs to scrape. Example: https://www.theknot.com/marketplace/wedding-photographers-new-york-ny",
                        "items": {
                            "type": "string"
                        }
                    },
                    "category": {
                        "title": "Category slug",
                        "type": "string",
                        "description": "Wedding vendor category slug. Used to build the search URL when no startUrls are provided. Example: wedding-photographers, florists, wedding-djs, wedding-photo-booth-rentals"
                    },
                    "location": {
                        "title": "Location slug",
                        "type": "string",
                        "description": "City and state slug. Combined with the category slug to build the search URL. Example: new-york-ny, los-angeles-ca, chicago-il"
                    },
                    "maxPages": {
                        "title": "Max pages per URL",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "How many pages of results to collect per listing URL. Each page returns up to 30 vendors.",
                        "default": 1
                    },
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 1,
                        "maximum": 1000,
                        "type": "integer",
                        "description": "Maximum total number of vendor records to collect across all URLs.",
                        "default": 25
                    },
                    "requestTimeoutSecs": {
                        "title": "Request timeout (seconds)",
                        "minimum": 15,
                        "maximum": 120,
                        "type": "integer",
                        "description": "Per-page timeout in seconds. Pages with JavaScript rendering take longer, so 60 seconds is recommended.",
                        "default": 60
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
