GPT Scraper avatar
GPT Scraper

Pricing

$9.00 / 1,000 pages

Go to Store
GPT Scraper

GPT Scraper

Developed by

Jakub Drobník

Jakub Drobník

Maintained by Apify

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.4 (7)

Pricing

$9.00 / 1,000 pages

107

Total users

6K

Monthly users

104

Runs succeeded

99%

Issues response

2.3 days

Last modified

6 months ago

IM

taking way too long!!!

Open

imkundeng opened this issue
2 days ago

2025-07-24T04:54:05.876Z ACTOR: Pulling Docker image of build DultqBbOXMMEYnkJc from registry. 2025-07-24T04:54:48.760Z ACTOR: Creating Docker container. 2025-07-24T04:54:48.938Z ACTOR: Starting Docker container. 2025-07-24T04:54:49.539Z Starting X virtual framebuffer using: Xvfb :99 -ac -screen 0 1920x1080x24+32 -nolisten tcp 2025-07-24T04:54:49.542Z Executing main command 2025-07-24T04:54:52.358Z INFO System info {"apifyVersion":"3.1.16","apifyClientVersion":"2.9.3","crawleeVersion":"3.8.1","osType":"Linux","nodeVersion":"v18.20.5"} 2025-07-24T04:54:53.088Z INFO Max pages per crawl: 10 2025-07-24T04:54:53.986Z INFO Configuration completed. Starting the crawl. 2025-07-24T04:54:54.111Z INFO PlaywrightCrawler: Starting the crawler. 2025-07-24T04:55:05.202Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/introduction/introduction... 2025-07-24T04:55:07.805Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/introduction/introduction enqueued 12 new URLs. {"foundLinksCount":12,"enqueuedLinksCount":12,"alreadyPresentLinksCount":12} 2025-07-24T04:55:20.637Z INFO Processing page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/introduction/introduction with GPT instruction... {"contentLength":5427} 2025-07-24T04:55:21.902Z INFO Calling GPT for page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/introduction/introduction. 2025-07-24T04:55:54.173Z INFO Statistics: PlaywrightCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":26657,"requestsFinishedPerMinute":1,"requestsFailedPerMinute":0,"requestTotalDurationMillis":26657,"requestsTotal":1,"crawlerRuntimeMillis":60127,"retryHistogram":[1]} 2025-07-24T04:55:54.229Z INFO PlaywrightCrawler:AutoscaledPool: state {"currentConcurrency":2,"desiredConcurrency":3,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.019},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0.174},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2025-07-24T04:55:55.360Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/introduction/introduction processed. {"apiCallsCount":1,"usage":{"promptTokens":1646,"completionTokens":1089,"totalTokens":2735},"usdUsage":0.0009002999999999999} 2025-07-24T04:56:00.518Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/get-started-searching/start-searching-using-spl2... 2025-07-24T04:56:20.102Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/get-started-searching/start-searching-using-spl2 enqueued 7 new URLs. {"foundLinksCount":7,"enqueuedLinksCount":7,"alreadyPresentLinksCount":35} 2025-07-24T04:56:27.300Z INFO Processing page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/get-started-searching/start-searching-using-spl2 with GPT instruction... {"contentLength":6861} 2025-07-24T04:56:35.998Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2... 2025-07-24T04:56:36.103Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/modules-statements-and-views/modules-and-spl2-statements... 2025-07-24T04:56:37.115Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2 enqueued 5 new URLs. {"foundLinksCount":5,"enqueuedLinksCount":5,"alreadyPresentLinksCount":62} 2025-07-24T04:56:37.542Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/modules-statements-and-views/modules-and-spl2-statements enqueued 2 new URLs. {"foundLinksCount":2,"enqueuedLinksCount":2,"alreadyPresentLinksCount":54} 2025-07-24T04:56:38.713Z INFO Processing page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2 with truncated text using GPT instruction... {"originContentLength":27687,"contentLength":26400,"contentMaxTokens":8990} 2025-07-24T04:56:38.713Z WARN Content was truncated for https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2 to match GPT maxTokens limit. {"url":"https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2","maxTokensLimit":10000} 2025-07-24T04:56:39.297Z INFO Processing page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/modules-statements-and-views/modules-and-spl2-statements with GPT instruction... {"contentLength":20728} 2025-07-24T04:56:40.001Z INFO Calling GPT for page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/modules-statements-and-views/modules-and-spl2-statements. 2025-07-24T04:56:54.111Z INFO Statistics: PlaywrightCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":38141,"requestsFinishedPerMinute":2,"requestsFailedPerMinute":0,"requestTotalDurationMillis":190706,"requestsTotal":5,"crawlerRuntimeMillis":120127,"retryHistogram":[5]} 2025-07-24T04:56:54.299Z INFO PlaywrightCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":1,"systemStatus":{"isSystemIdle":false,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.1},"cpuInfo":{"isOverloaded":true,"limitRatio":0.4,"actualRatio":0.474},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2025-07-24T04:57:04.819Z INFO Calling GPT for page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2. 2025-07-24T04:57:14.817Z INFO Calling GPT for page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/get-started-searching/start-searching-using-spl2. 2025-07-24T04:57:29.998Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/get-started-searching/start-searching-using-spl2 processed. {"apiCallsCount":2,"usage":{"promptTokens":4444,"completionTokens":2001,"totalTokens":6445},"usdUsage":0.0018671999999999998} 2025-07-24T04:57:31.298Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets... 2025-07-24T04:57:37.096Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets enqueued 2 new URLs. {"foundLinksCount":2,"enqueuedLinksCount":2,"alreadyPresentLinksCount":43} 2025-07-24T04:57:38.418Z INFO Processing page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets with truncated text using GPT instruction... {"originContentLength":23190,"contentLength":19900,"contentMaxTokens":8990} 2025-07-24T04:57:38.418Z WARN Content was truncated for https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets to match GPT maxTokens limit. {"url":"https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets","maxTokensLimit":10000} 2025-07-24T04:57:41.116Z INFO Calling GPT for page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets. 2025-07-24T04:57:45.496Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/modules-statements-and-views/modules-and-spl2-statements processed. {"apiCallsCount":3,"usage":{"promptTokens":10720,"completionTokens":5049,"totalTokens":15769},"usdUsage":0.0046374} 2025-07-24T04:57:53.732Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/export-import-and-namespaces/exporting-module-items-using-spl2 processed. {"apiCallsCount":4,"usage":{"promptTokens":18564,"completionTokens":7852,"totalTokens":26416},"usdUsage":0.0074957999999999995} 2025-07-24T04:57:54.112Z INFO Statistics: PlaywrightCrawler request statistics: {"requestAvgFailedDurationMillis":null,"requestAvgFinishedDurationMillis":37285,"requestsFinishedPerMinute":3,"requestsFailedPerMinute":0,"requestTotalDurationMillis":335561,"requestsTotal":9,"crawlerRuntimeMillis":180128,"retryHistogram":[9]} 2025-07-24T04:57:54.309Z INFO PlaywrightCrawler:AutoscaledPool: state {"currentConcurrency":3,"desiredConcurrency":3,"systemStatus":{"isSystemIdle":false,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.6,"actualRatio":0.089},"cpuInfo":{"isOverloaded":true,"limitRatio":0.4,"actualRatio":0.746},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2025-07-24T04:58:04.304Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/functions/built-in-and-custom-functions... 2025-07-24T04:58:05.300Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/datasets-and-dataset-literals/datasets processed. {"apiCallsCount":5,"usage":{"promptTokens":26774,"completionTokens":9096,"totalTokens":35870},"usdUsage":0.0094737} 2025-07-24T04:58:06.799Z INFO Page https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/functions/built-in-and-custom-functions enqueued 4 new URLs. {"foundLinksCount":4,"enqueuedLinksCount":4,"alreadyPresentLinksCount":51} 2025-07-24T04:58:07.117Z INFO Opening https://help.splunk.com/en/splunk-cloud-platform/search/spl2-search-manual/expressions-and-predicates/types-of-expressions...