Pricing

Pay per usage

Go to Store

Extended GPT Scraper

Try for free

Developed by

Jakub Drobník

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

4.6 (4)

Pricing

Pay per usage

Total users

1.4k

Monthly users

Runs succeeded

98%

Issue response

1.4 days

Last modified

4 months ago

Lead generation

Open source

Back to issues Create new issue

4o mini model (for cost concern)

Closed

tak_lai opened this issue

Hi, since website scraping involved large input token and out token, any plans to add model 4o mini soon? Many thanks

Lukáš Průša (lukas.prusa)

Hi Tak, thanks opening this issue and your suggestion!

Yes, we will 100% add the 4o-mini model shortly :) We will also most likely add it to our Pay Per Result version of this Actor.

I will keep you updated here, thanks!

Lukáš Průša (lukas.prusa)

Hi again, thanks for your patience!

We've had some other issues with the scraper, so we kept in the beta for a bit too long now. The 4o-mini model is accessible on the latest (default) version :) We've also set it as the default one for the Pay Per Result Actor version.

Try it out and let me know how it works, thanks!

tak_lai

Thanks , I will try it. Really appreciate your effort

tak_lai

Hello , I just spotted another issue, which may relate to playwright. I tried to use playwright and selenium to scrape https://www.newbalance.com/pd/made-in-usa-990v6/U990V6-45189.html I am not sure why it is failed when using playwright. Since Extended GPT Scraper use playwright so it shared same result(retrieve blank page). It works using selenium in my local but failed to use playwright in my local. I just want to share this limitation as it is not the problem on this actor. I am truly thankful of your effort on this actor

Lukáš Průša (lukas.prusa)

Hi, thanks for sharing this inside. This looks to be some issue in playwright, or at least it's getting detected and blocked by the website.

I've tried a few settings like residential proxies and waiting longer for the dynamic content to load, but I guess it's still getting blocked... This looks a bit out of our scope as this is most likely an issue somewhere else, so unfortunately we won't be fixing this. But good luck with the website and I hope you will be able to scrape it well with selenium :)

Add comment

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

5.8k

4.4

GPT Browser

anchor/gpt-browser

A GPT browser to use OpenAI prompt on any website. Put a list of URLs and a prompt, then the GPT agent will give you the answer you need. Fast, easy, and not limited with OpenAI ChatGPT restrictions. The best way to search and use GPT on large number of websites. Upload Excel or CSV. Screenshots 📸

Anchor

🔍 GPT Search [Private API]

openapi/gpt-search-private-api

Use OpenAI's GPT4o Search mode via API! No cookie or proxy is required. Fast, cheap and reliable.

Open API

5.0

Auto GPT

lukaskrivka/auto-gpt

Run Auto GPT sessions directly on Apify. No OpenAI account or API token is required! Store parsed thoughts into datasets for later analysis.

Lukáš Křivka

194

ChatGPT

pertosh/chatgpt

You can use this Actor to transform scraped results, such as reviews from restaurants, by rephrasing the sentences. Additionally, translation is also supported. You can also use it to generate new website descriptions, keywords, and other similar metadata.

Alper

133

Pinecone GPT Chatbot

tri_angle/pinecone-gpt-chatbot

Pinecone GPT Chatbot combines OpenAI's GPT models with Pinecone's database to generate insightful responses. Its interactive chatbot interface presents precise and comprehensive answers to user queries. Benefit from semantic understanding, efficient workflows, and enriched knowledge integration!

Tri⟁angle

4.5

Universal AI GPT Scraper

louisdeconinck/ai-gpt-scraper

Transform any website into structured data with AI-powered extraction. This versatile tool combines advanced web scraping with intelligent content analysis to deliver clean, customized JSON output - perfect for automating data collection from any web source.

Louis Deconinck

5.0

OpenAI Vector Store Integration

jiri.spilka/openai-vector-store-integration

The Apify OpenAI Vector Store integration uploads data from Apify Actors to the OpenAI Vector Store linked to OpenAI Assistant.

Jiří Spilka

151

4.8

RAG Web Browser

apify/rag-web-browser

Web browser for OpenAI Assistants, RAG pipelines, or AI agents, similar to a web browser in ChatGPT. It queries Google Search, scrapes the top N pages, and returns their content as Markdown for further processing by an LLM. It can also scrape individual URLs. Supports Model Context Protocol (MCP).

Apify

2.3k

4.4

OpenRouter - Unified LLM Interface for ChatGPT, Claude, Gemini

xyzzy/open-router

Use the OpenRouter platform to choose the best and most cost effective model for your prompts utilizing a standardized interface (including ChatGPT, Claude, Gemini, Llama, Mistral, and more). See instructions for creating an OpenRouter account and API key.