TV Tropes Scraper — Tropes, Examples & Page-Change Monitor avatar

TV Tropes Scraper — Tropes, Examples & Page-Change Monitor

Pricing

from $4.00 / 1,000 per record returneds

Go to Apify Store
TV Tropes Scraper — Tropes, Examples & Page-Change Monitor

TV Tropes Scraper — Tropes, Examples & Page-Change Monitor

Scrape TV Tropes (which has no public API) — trope and work pages with title, description, examples & cross-linked tropes — by URL or by crawling a whole namespace. Monitor pages for edits on a schedule with Slack/email/webhook alerts. $4 per 1,000 pages.

Pricing

from $4.00 / 1,000 per record returneds

Rating

0.0

(0)

Developer

Scrapers Delight

Scrapers Delight

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 hours ago

Last modified

Share

📚 TV Tropes Scraper — Tropes, Examples & Change Monitor

Turn TV Tropes into structured data — it has no public API, but this actor reads its pages directly: trope and work pages with title, description, the full list of examples, and every cross-linked trope/work. Scrape by URL, or crawl a whole namespace. Run it on a schedule to monitor pages for edits and get Slack / email / webhook alerts.

No login, no JavaScript rendering — TV Tropes pages are server-rendered HTML.


What does it do?

For each trope or work page it returns clean, structured data you can export to JSON, CSV, Excel, or push to your app via API:

  • 🏷️ Title, namespace, page type (trope / Film / Series / VideoGame / …)
  • 📝 Description (the page's intro paragraphs)
  • 📋 Examples — the list entries (+ folder section labels)
  • 🔗 Cross-links — every linked trope/work with its {namespace, name, url}
  • 🧬 Content hash for change detection

What data does it extract?

For every page: url, namespace, name, title, page_type, description, links[] ({text, namespace, name, url}), link_count, examples[], example_count, folders[], content_hash, change (monitor: new / updated / unchanged), scraped_at.


Who is it for?

  • 🤖 AI / ML / dataset builders — trope graphs and example corpora are popular for NLP and story-modeling.
  • ✍️ Writers & researchers mapping tropes across works.
  • 📊 Wiki analysts tracking how pages and examples evolve over time (the wiki edits daily).

How to use it (step by step)

  1. Click Try for free.
  2. Paste page URLs (or Namespace/Name) — or set a namespace to crawl (e.g. Main, Film).
  3. (Optional) toggle examples / cross-links.
  4. Click Start, open the Dataset tab to view/export.
  5. (Optional) set monitorMode + a Schedule to get alerts when pages change.

Quick start

{ "pageUrls": ["Main/ChekhovsGun", "Series/BreakingBad"] }

Input

FieldWhat it does
pageUrlsTV Tropes page URLs / Namespace/Name
namespace, namespaceStartLettercrawl a whole namespace via the index
includeExamples, includeLinkstoggle example list / cross-links
maxPageshard cap per run (0 = all)
monitorMode, alertOnChangerecurring change watcher + alerts
webhookUrl, slackWebhookUrl, emailRecipientsalert channels
proxyConfiguration, requestConcurrencyproxy + parallelism

Output

Each page is one dataset record (fields above). Export to JSON, CSV, Excel, HTML, or RSS, or fetch via the Apify API.


How much does it cost?

Pay-per-event — you pay for what you pull:

EventWhat it coversSuggested price
lot-scrapedeach page returned~$0.003 / page
monitor-run-completedeach scheduled watch run~$0.05 / run
new-lot-detectedeach new/changed page~$0.02 / page
alert-deliveredeach Slack/email/webhook push~$0.005 / alert

(Final per-event prices are set on the actor's pricing page.)


TV Tropes content is publicly viewable and contributed under the site's terms (Creative Commons BY-NC-SA). This actor reads public pages — no personal data. Scraping public pages is generally legal, but you are responsible for your use — review TV Tropes' Terms of Service, attribute, and respect the non-commercial share-alike license when redistributing.


FAQ

TV Tropes has no API — how does this work? It reads the server-rendered HTML pages directly and parses them, so you get structured data without an API.

Can I crawl a whole category? Yes — set namespace (e.g. Main for all tropes) and a maxPages cap; the actor enumerates the index.

Can I detect when a page changes? Yes — monitorMode hashes each page and emits only new/changed pages, with alerts.

How do I export? JSON, CSV, Excel, HTML, or RSS from the Dataset tab, or via the Apify API.


Feedback

Want laconic/quote extraction, image lists, or per-folder example grouping? Open an issue on the actor.