# Podcast Email Scraper - Host & Contact Finder (`ritaya/podcast-email-scraper`) Actor

Find podcasts by topic and extract the host/business email, author, website and show metadata from the podcast RSS feed. No login.

- **URL**: https://apify.com/ritaya/podcast-email-scraper.md
- **Developed by:** [Zane](https://apify.com/ritaya) (community)
- **Categories:** Lead generation, Social media
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Podcast Email Scraper - Host & Contact Finder

**Podcast email scraper** and **host contact finder**: discover podcasts by topic, then
extract the **host / business email**, author, website, genre and show metadata straight
from each show's RSS feed. Built for **podcast guesting, sponsorship outreach, PR and
B2B-to-creator lead generation**.

Keywords: podcast email scraper, podcast host email, podcast contact finder, podcast
outreach leads, sponsorship prospecting, podcast guesting, apple podcasts scraper.

### Why the email hit-rate is near 100%

Discovery uses the keyless **iTunes Search API** (returns the RSS `feedUrl` + metadata).
Apple **mandates** an owner email in every podcast RSS feed:

```xml
<itunes:owner><itunes:email>host@show.com</itunes:email><itunes:name>Host</itunes:name></itunes:owner>
````

so the host/business email is public and present for ~all shows. No login, no anti-bot,
**no residential proxy** — the iTunes API and RSS feeds are open. Validated live: 6/6 shows
returned a real host email on the first sample.

### Input

| field | meaning |
|---|---|
| `searchTerms` | topics/keywords to discover podcasts by |
| `itunesIds` / `feedUrls` | enrich specific shows directly |
| `country` | Apple storefront (US, GB, DE, …) |
| `genre` | keep only shows whose primary genre matches |
| `maxPerTerm` | shows per keyword (1–200) |
| `minEpisodes` | drop inactive shows below this episode count |
| `requireEmail` | only return (and charge for) shows with a host email |
| `concurrency` | feeds fetched in parallel (default 8) |

### Output (one item per podcast)

```json
{
  "name": "The Personal Finance Podcast", "author": "Andrew Giancola",
  "ownerEmail": "giancola.andrew@gmail.com", "ownerName": "Andrew Giancola",
  "website": "https://mastermoney.co", "language": "en-us",
  "genre": "Investing", "genres": ["Business"], "episodeCount": 546,
  "itunesId": 1496681088, "itunesUrl": "https://podcasts.apple.com/...",
  "artworkUrl": "https://...", "lastReleaseDate": "2026-06-25T...", "sourceTerm": "personal finance"
}
```

### Pricing (PAY\_PER\_EVENT)

| event | suggested price | when |
|---|---|---|
| `podcast-scraped` | $0.03 | every podcast returned (host email almost always included) |

Single event keeps it simple — owner email is present for ~all shows. Undercuts
ryanclinton ($0.05/podcast) while platform cost is tiny (no residential proxy), so margin
is near-pure. Use `requireEmail: true` to only pay for shows that expose an email.

### Run locally

```bash
mkdir -p storage/key_value_stores/default
cp .actor/input.example.json storage/key_value_stores/default/INPUT.json
APIFY_LOCAL_STORAGE_DIR="$PWD/storage" \
  uv run --with apify --with "httpx[http2]" python -m src.main
```

### Deploy

```bash
apify push
```

# Actor input Schema

## `searchTerms` (type: `array`):

Discover podcasts by topic, e.g. "personal finance", "true crime". One search per item.

## `itunesIds` (type: `array`):

Enrich specific shows by their Apple Podcasts collection IDs.

## `feedUrls` (type: `array`):

Enrich specific shows directly by RSS feed URL.

## `country` (type: `string`):

2-letter country code for the iTunes storefront (US, GB, DE, ...).

## `genre` (type: `string`):

Keep only shows whose primary genre contains this text, e.g. "Business".

## `maxPerTerm` (type: `integer`):

How many shows to take from each search keyword (1-200).

## `minEpisodes` (type: `integer`):

Drop shows below this episode count (0 = no filter). Filters out inactive shows.

## `requireEmail` (type: `boolean`):

Skip (and do not charge for) shows whose RSS feed exposes no owner email.

## `concurrency` (type: `integer`):

Number of feeds fetched in parallel (1-30).

## `proxyConfiguration` (type: `object`):

Not required — iTunes API and RSS feeds are open. Add a proxy only for very high volume.

## Actor input object example

```json
{
  "searchTerms": [
    "personal finance"
  ],
  "country": "US",
  "maxPerTerm": 25,
  "minEpisodes": 0,
  "requireEmail": false,
  "concurrency": 8
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "searchTerms": [
        "personal finance"
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("ritaya/podcast-email-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "searchTerms": ["personal finance"] }

# Run the Actor and wait for it to finish
run = client.actor("ritaya/podcast-email-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "searchTerms": [
    "personal finance"
  ]
}' |
apify call ritaya/podcast-email-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=ritaya/podcast-email-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Podcast Email Scraper - Host & Contact Finder",
        "description": "Find podcasts by topic and extract the host/business email, author, website and show metadata from the podcast RSS feed. No login.",
        "version": "0.1",
        "x-build-id": "Aq3MKBo5cvTZ0RJil"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/ritaya~podcast-email-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-ritaya-podcast-email-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/ritaya~podcast-email-scraper/runs": {
            "post": {
                "operationId": "runs-sync-ritaya-podcast-email-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/ritaya~podcast-email-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-ritaya-podcast-email-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchTerms": {
                        "title": "Search topics / keywords",
                        "type": "array",
                        "description": "Discover podcasts by topic, e.g. \"personal finance\", \"true crime\". One search per item.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "itunesIds": {
                        "title": "Apple Podcasts IDs (optional)",
                        "type": "array",
                        "description": "Enrich specific shows by their Apple Podcasts collection IDs.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "feedUrls": {
                        "title": "RSS feed URLs (optional)",
                        "type": "array",
                        "description": "Enrich specific shows directly by RSS feed URL.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "country": {
                        "title": "Apple storefront country",
                        "type": "string",
                        "description": "2-letter country code for the iTunes storefront (US, GB, DE, ...).",
                        "default": "US"
                    },
                    "genre": {
                        "title": "Genre filter (optional)",
                        "type": "string",
                        "description": "Keep only shows whose primary genre contains this text, e.g. \"Business\"."
                    },
                    "maxPerTerm": {
                        "title": "Max podcasts per keyword",
                        "minimum": 1,
                        "maximum": 200,
                        "type": "integer",
                        "description": "How many shows to take from each search keyword (1-200).",
                        "default": 25
                    },
                    "minEpisodes": {
                        "title": "Minimum episodes",
                        "type": "integer",
                        "description": "Drop shows below this episode count (0 = no filter). Filters out inactive shows.",
                        "default": 0
                    },
                    "requireEmail": {
                        "title": "Only return shows with a host email",
                        "type": "boolean",
                        "description": "Skip (and do not charge for) shows whose RSS feed exposes no owner email.",
                        "default": false
                    },
                    "concurrency": {
                        "title": "Concurrency",
                        "minimum": 1,
                        "maximum": 30,
                        "type": "integer",
                        "description": "Number of feeds fetched in parallel (1-30).",
                        "default": 8
                    },
                    "proxyConfiguration": {
                        "title": "Proxy (optional)",
                        "type": "object",
                        "description": "Not required — iTunes API and RSS feeds are open. Add a proxy only for very high volume."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
