Custom Conference Speaker Scraper (Saad Belcaid)
Pricing
Pay per usage
Custom Conference Speaker Scraper (Saad Belcaid)
Get speaker data from conference websites. Names, titles, companies, bios, and photos — ready for outreach, research, or enrichment.
Pricing
Pay per usage
Rating
0.0
(0)
Developer

Saad Belcaid
Actor stats
2
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Conference Speaker Scraper
Extract structured speaker data from any conference website. Point it at a speakers page, get back names, titles, companies, bios, and photos — ready for outreach, enrichment, or market analysis.
What it does
Most conference websites are JavaScript-rendered SPAs (React, Next.js, Swapcard, Squarespace). This scraper uses Playwright to render the page like a real browser, then auto-detects speaker cards and extracts structured data.
No configuration needed for most sites. The scraper automatically:
- Detects speaker card layouts (tested on Web Summit, SaaStr, TechCrunch Disrupt, and more)
- Finds names, job titles, companies, photos, and bios
- Handles pagination (Load More buttons, numbered pages, infinite scroll)
- Dismisses cookie banners
- Deduplicates across pages
- Resolves relative URLs to absolute
For custom or unusual layouts, you can provide CSS selector overrides.
Output
Each speaker record contains:
| Field | Description | Example |
|---|---|---|
first_name | First name | Anton |
last_name | Last name | Osika |
full_name | Full name | Anton Osika |
job_title | Role / position | Co-founder & CEO |
company_name | Company or organization | Lovable |
event_name | Conference name (from input) | Web Summit 2026 |
photo_url | Speaker headshot URL | https://... |
bio | Speaker biography | |
session_topic | Talk or panel topic | |
speaker_url | Link to speaker detail page | https://websummit.com/attendees/... |
linkedin_url | LinkedIn profile (if on page) |
Example
Input:
{"speakersUrl": "https://websummit.com/speakers","eventName": "Web Summit 2026","maxSpeakers": 50}
Output (50 speakers in ~23 seconds):
Anton Osika — Co-founder & CEO @ LovableAlex Schultz — CMO @ MetaCristobal Valenzuela — Founder & CEO @ RunwayToto Wolff — Team Principal & CEO @ Mercedes-AMG PETRONAS F1 TeamMatthew Prince — Co-founder & CEO @ CloudflareBrad Smith — President & Vice Chair @ MicrosoftRobert Playter — CEO @ Boston DynamicsRose Wang — COO @ Bluesky...
Use cases
Sales & BD teams
Conference speakers are decision-makers publicly broadcasting their priorities. Scrape the speakers list before a conference, enrich with email addresses, and reach out with context: "Saw you're speaking at [conference] about [topic] — we work with teams at that exact stage."
Investor research
Track which founders and executives are on the conference circuit. Speaking at tier-1 conferences often signals fundraising activity, product launches, or market expansion.
Market intelligence
Map which companies are investing in conference presence across your industry. Spot emerging players, track competitor activity, and identify potential partners.
Recruiting
Conference speakers are proven domain experts who are actively visible. Build targeted talent pipelines from industry-specific events.
Connector OS integration
Feed the dataset directly into Connector OS Station to run market matching. The scraper output is designed to plug into the Connector OS I Layer — speakers become demand or supply signals that the matching engine evaluates against your market. Conference speakers are some of the highest-signal records you can feed into the system: they're decision-makers, they're active, and their session topics tell you exactly what they care about.
Flow: Scrape speakers → Get Dataset ID → Paste into Station → Run matching → Get scored introductions with reasoning.
Supported sites
Tested on:
- Web Summit (Swapcard platform, React SPA)
- SaaStr Annual (Squarespace)
- TechCrunch Disrupt (WordPress)
- Money20/20 (Next.js)
- Most conference sites with standard speaker card layouts
The auto-detection handles styled-components, BEM, and custom class naming conventions. If a site uses an unusual layout, use the speakerCardSelector override.
Input options
| Field | Required | Description |
|---|---|---|
speakersUrl | Yes | URL of the conference speakers page |
eventName | Yes | Name of the conference (included in output) |
maxSpeakers | No | Max speakers to scrape (default: 500) |
waitForSelector | No | CSS selector to wait for before scraping |
speakerCardSelector | No | Override: CSS selector for speaker cards |
paginationSelector | No | Override: CSS selector for Next/Load More button |
proxyConfig | No | Apify proxy configuration |
Cost
Typical runs use ~0.05 compute units (a few cents). Most conferences have 50-500 speakers, which takes 20-120 seconds to scrape.