GPT Scraper avatar
GPT Scraper

Pricing

$9.00 / 1,000 pages

Go to Store
GPT Scraper

GPT Scraper

drobnikj/gpt-scraper

Developed by

Jakub Drobník

Maintained by Apify

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.4 (7)

Pricing

$9.00 / 1,000 pages

86

Monthly users

144

Runs succeeded

>99%

Response time

3.2 days

Last modified

3 months ago

EW

Incomplete results

Closed
enchanting_wilderness opened this issue
a year ago

This actor is exactly what I need, but the results I get seems to incomplete. I have a free account, but I doubt this influences the result. My use-case is very simple. I want a list of all casino's mentioned on a website, but it seems to list only a small number, maximum 3 in one case and maximum 10 in another.

paja avatar

Hi,

thanks for reaching out, we'll look into it and let you know what can be done.

EW

enchanting_wilderness

a year ago

I hope to hear about it soon @pavlina.

lukas.prusa avatar

Hi Jasper,

the website you are trying to scrape is very large and can't fully fit into the GPT context window, as indicated by the warn log messages "Content was truncated for...".

There are two options to resolve this:

  1. the easier option is to use the Extended GPT Scraper with your own API key. You can select a model with a larger context window and fit the whole web page into it (e.g. gpt-4-turbo-preview).
  2. use our page processing filters to decrease the page content being sent to GPT. This is a more advanced way and requires knowledge of CSS and HTML selectors.
    • Selecting the newly added removeLinkUrls option and adding img to removeElementsCssSelector should help you get more results, though probably still not all of them...

I hope this helps, happy scraping!

EW

enchanting_wilderness

a year ago

Thanks Lukáš, this helps a lot!

Pricing

Pricing model

Pay per result 

This Actor is paid per result. You are not charged for the Apify platform usage, but only a fixed price for each dataset of 1,000 items in the Actor outputs.

Price per 1,000 items

$9.00