Speaker Bureau Directory Scraper - Keynote Speakers & Fees avatar

Speaker Bureau Directory Scraper - Keynote Speakers & Fees

Pricing

Pay per event

Go to Apify Store
Speaker Bureau Directory Scraper - Keynote Speakers & Fees

Speaker Bureau Directory Scraper - Keynote Speakers & Fees

Scrape keynote speaker profiles from major US speakers bureaus. Extract speaker name, tagline, live and virtual fee ranges, travel region, topics, categories, bio, books, profile photo, and bureau booking URL. Built for event planners, PR firms, and competing bureaus.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Speaker Bureau Directory Scraper

Scrape keynote speaker profiles from the All American Speakers Bureau directory. Returns name, tagline, live and virtual fee ranges, travel region, topics, categories, full biography, books authored, and the bureau booking URL for ~16,500 speakers.


Speaker Bureau Directory Scraper Features

  • Extracts 16+ structured fields per speaker profile, including separate live and virtual fee ranges
  • Pulls a normalized topics array and a separate categories taxonomy — most directories give you one or the other
  • Returns the full biography as plain text. No HTML to clean up
  • Lists every book on the speaker's profile page, plus YouTube and Vimeo video URLs
  • Sources from a public sitemap — no proxies, no browser, no CAPTCHA dance
  • Configurable scope: scrape the whole roster (16k speakers) or pin specific profile URLs

Who Uses Speaker Bureau Data?

  • Corporate event planners — Build keynote shortlists with budget bands already attached. The fee fields alone save you ten contact-form submissions
  • PR and media bookers — Source guests for podcasts, panels, and trade press. Eighty percent of speaker outreach goes unanswered, so pre-qualified contact data has value
  • Competing speakers bureaus — Track talent rosters and benchmark fee positioning. Most A-list speakers are multi-bureau represented anyway
  • Sales teams targeting authors and executives — Speakers are a clean buyer-persona slice. Their booking pages double as a lead-source

How the Speaker Bureau Directory Scraper Works

  1. Fetch sitemap — Reads the bureau's public sitemap.xml to discover every speaker profile URL
  2. Filter and seed — Filters to the /speakers/{id}/... URL pattern, slices to maxItems, queues each profile
  3. Extract per profile — Loads each profile page and pulls speaker name, fees, topics, categories, biography, books, and videos into a flat record
  4. Save — Emits one JSON record per speaker to the dataset, tagged with bureau: "aae" so future versions can layer additional bureaus into the same output

Pass a list of direct profile URLs to skip sitemap discovery and crawl only what you specify.


Input

{
"bureau": "aae",
"maxItems": 10
}
FieldTypeDefaultDescription
bureaustring"aae"Source bureau. v1 supports aae (All American Speakers Bureau, ~16k speakers). Additional bureaus arrive in subsequent releases.
maxItemsinteger10Maximum number of speaker records to return. Default is intentionally low so a single run finishes within Apify's 5-minute tester window. Increase for larger crawls.
profileUrlsarray[]Optional list of direct speaker profile URLs. When provided, the scraper skips sitemap discovery and crawls only these URLs.

Targeted scrape — specific speakers only

{
"bureau": "aae",
"maxItems": 3,
"profileUrls": [
{ "url": "https://www.allamericanspeakers.com/speakers/389198/Science-Bob-Pflugfelder" },
{ "url": "https://www.allamericanspeakers.com/speakers/385009/3OH%213" }
]
}

Speaker Bureau Directory Scraper Output Fields

{
"speaker_name": "\"Science Bob\" Pflugfelder",
"tagline": "Known as \"Science Bob\"; Science Teacher & TV Personality; Co-Author of the \"Nick & Tesla\" Series for Young Readers",
"bureau": "aae",
"bureau_label": "All American Speakers Bureau",
"bureau_speaker_id": "389198",
"profile_url": "https://www.allamericanspeakers.com/speakers/389198/%22Science-Bob%22-Pflugfelder",
"profile_image_url": "https://thumbnails.aaehq.com/t_face_aas_md/.../2018_bobpflugfelder_headshot.png",
"fee_range_live": "$10,000 - $20,000",
"fee_range_virtual": "$5,000 - $10,000",
"travel_region": "San Francisco, CA, USA",
"topics": ["STEM (STEAM) Education", "Science Demonstrations And Innovations"],
"categories": ["Education", "Science", "STEM", "STEM Education", "Technology"],
"bio": "Bob Pflugfelder, known as \"Science Bob,\" is a science teacher, author...",
"books_authored": [
"Nick and Tesla's High-Voltage Danger Lab",
"Nick and Tesla's Robot Army Rampage"
],
"featured_videos": ["https://www.youtube.com/watch?v=sAGm50Cvw9g"],
"bureau_booking_url": "https://www.allamericanspeakers.com/contact-us",
"scraped_at": "2026-04-30T10:33:46.945Z"
}
FieldTypeDescription
speaker_namestringSpeaker's full name as shown on the bureau profile
taglinestringShort positioning line (e.g. "CEO of...", "Author of...", "The King of Negotiators")
bureaustringSource bureau slug (aae for All American Speakers Bureau)
bureau_labelstringHuman-readable bureau name
bureau_speaker_idstringNumeric speaker ID assigned by the bureau (stable, from URL)
profile_urlstringCanonical URL of the speaker's profile page
profile_image_urlstringSpeaker headshot URL
fee_range_livestringSpeaking fee range for in-person events (e.g. $10,000 - $20,000, Please Contact)
fee_range_virtualstringSpeaking fee range for virtual / online events
travel_regionstringWhere the speaker travels from, as published
topicsarray of stringsSpeaking topics offered by the speaker
categoriesarray of stringsBureau taxonomy tags the speaker is filed under
biostringFull biography, plain text (HTML stripped)
books_authoredarray of stringsBook titles authored or co-authored by the speaker
featured_videosarray of stringsYouTube / Vimeo URLs of featured speaker videos
bureau_booking_urlstringURL where event planners can request booking through the bureau
scraped_atstringISO 8601 timestamp when the record was extracted

FAQ

How do I scrape allamericanspeakers.com?

The Speaker Bureau Directory Scraper pulls profiles directly from the public AAE sitemap. Set bureau: "aae", pick a maxItems cap, and run. No login, no proxy, no CAPTCHA solver required.

How much does the Speaker Bureau Directory Scraper cost to run?

Pricing is pay-per-event: $0.10 per actor start plus $0.001 per speaker record. A 100-speaker run costs about $0.20. The full ~16,500-speaker AAE roster is around $16.60.

Can I scrape only specific speakers?

Yes. Pass a profileUrls array of direct speaker URLs and the scraper skips sitemap discovery, fetching only what you list. Useful for refreshing a known watchlist or backfilling a CRM.

Does the Speaker Bureau Directory Scraper need proxies?

No. The target site is public, server-rendered, and behind a Cloudflare CDN — but it doesn't gate the profile pages behind a managed challenge. Plain HTTP requests work, which is more than half of "scraper-ready" sites can claim.

Why doesn't the scraper return agent name, agent email, or speaker direct email?

AAE doesn't publish per-speaker agent contact cards or speaker direct emails on the public profile page — they're gated behind the bureau's contact form. The scraper returns the bureau-wide bureau_booking_url instead, which is the actual path to a booking on this site.

Will more bureaus be supported?

Yes. The dataset schema already carries a bureau field on every record so additional bureaus (BigSpeak, Premier, WSB, Harry Walker) can be layered in without breaking existing output. v1 ships AAE because it's the largest and the cleanest to extract.


Need More Features?

Need additional bureaus, custom fields, or a different filter? File an issue or get in touch.

Why Use the Speaker Bureau Directory Scraper?

  • Affordable — $0.001 per speaker record, $0.10 per run start
  • Multi-bureau ready — Output schema includes a bureau tag on every record, so adding sources doesn't break consumers. Most aggregator scrapers can't say that
  • Structured fees — Returns separate fee_range_live and fee_range_virtual fields instead of one free-text string. The virtual line item didn't exist before 2020; pretending it doesn't isn't an option anymore