GPT Scraper avatar

GPT Scraper

Try for free

Pay $9.00 for 1,000 pages

View all Actors
GPT Scraper

GPT Scraper

drobnikj/gpt-scraper
Try for free

Pay $9.00 for 1,000 pages

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Do you want to learn more about this Actor?

Get a demo
JB

Not able to scrape filmot.com

Closed

jen_brown_done opened this issue
3 months ago

When I try to scrape based on a specific url, i get 3 total results but the page is a full grid of text.

svpetrenko avatar

Hi! This is because this scraper uses GPT-4o mini. It has limited context length. The max context allowed is around 8k tokens, while the length required to scrape your page is 70+k tokens. I'd recommend using Extended GPT Scraper where you'd select GPT-4o (without mini) model. Here is an example run with more results https://console.apify.com/view/runs/nOYNHy1su3IGo72Bo (it also has json schema for output, you may find it helpful).

As an alternative, you may reduce content being sent to GPT by adding additional elements to "Remove HTML elements (CSS selector)" input, until log messages no longer say that the content is trimmed.

Let me know if this helps

JB

jen_brown_done

3 months ago

Hello! This worked! thank you!

lukas.prusa avatar

That’s great! We are happy to hear that!

Developer
Maintained by Apify

Actor Metrics

  • 155 monthly users

  • 64 stars

  • >99% runs succeeded

  • 2 days response time

  • Created in Mar 2023

  • Modified 4 days ago