Pay $9.00 for 1,000 pages

GPT Scraper

drobnikj/gpt-scraper

Pay $9.00 for 1,000 pages

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Do you want to learn more about this Actor?

Get a demo

API clients

API endpoints

The code examples below show how to run the Actor and get its results. To run the code, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token, which you can find under Settings > Integrations in Apify Console. Learn more

Node.js

Python

curl

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4client = ApifyClient("<YOUR_API_TOKEN>")
5
6# Prepare the Actor input
7run_input = {
8    "startUrls": [{ "url": "https://news.ycombinator.com/" }],
9    "instructions": """Gets the post with the most points from the page and returns it as JSON in this format: 
10postTitle
11postUrl
12pointsCount""",
13    "includeUrlGlobs": [],
14    "excludeUrlGlobs": [],
15    "linkSelector": "a[href]",
16    "initialCookies": [],
17    "proxyConfiguration": { "useApifyProxy": True },
18    "targetSelector": "",
19    "removeElementsCssSelector": "script, style, noscript, path, svg, xlink",
20    "schema": {
21        "type": "object",
22        "properties": {
23            "title": {
24                "type": "string",
25                "description": "Page title",
26            },
27            "description": {
28                "type": "string",
29                "description": "Page description",
30            },
31        },
32        "required": [
33            "title",
34            "description",
35        ],
36    },
37    "schemaDescription": "",
38}
39
40# Run the Actor and wait for it to finish
41run = client.actor("drobnikj/gpt-scraper").call(run_input=run_input)
42
43# Fetch and print Actor results from the run's dataset (if there are any)
44print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
45for item in client.dataset(run["defaultDatasetId"]).iterate_items():
46    print(item)
47
48# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Developer

Jakub Drobník

Actor metrics

221 monthly users
45 stars
99.0% runs succeeded
1.8 days response time
Created in Mar 2023
Modified 27 days ago

Categories

Lead generation

Business

Extended GPT Scraper

drobnikj/extended-gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

GPTs Scraper

observant_bagpipes/GPTs-scraper

Use this scraper to collect data about GPTs url, title, description and more.

quill zhou

Free GPTs Scraper

seadapp/free-gpts-scraper

Gets you GPT data from Openai. Download your data as JSON, HTML Table, CSV, Execl, RSS Feed

Seadapp

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

20.2k

433

Linkedin x GPT prompt

anchor/linkedin-gpt-prompt

Extract LinkedIn profiles, and uses ChatGPT magic automatically on each profile ! Your prompt, the answer you need, the way you want.

guillim

WCC Pinecone Integration

tri_angle/wcc-pinecone-integration

Crawl any website and store its content in your Pinecone vector database. Enhance the accuracy and reliability of your own AI Assistant with facts fetched from external sources or connect this integration to our Pinecone GPT Chatbot assistant available in Apify Store.

Tri⟁angle

Google Maps Extractor

compass/google-maps-extractor

Extract data from hundreds of places fast. Scrape Google Maps by keyword, category, location, URLs & other filters. Get addresses, contact info, opening hours, popular times, prices, menus & more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Compass

7.8k

128

Facebook Posts Scraper

apify/facebook-posts-scraper

Extract data from hundreds of Facebook posts from one or multiple Facebook pages and profiles. Get post URL, post text, page or profile URL, timestamp, number of likes, shares, comments, and more. Download the data in JSON, CSV, and Excel and use it in apps, spreadsheets, and reports.

Apify

11.5k

115

Instagram Scraper

apify/instagram-scraper

Scrape and download Instagram posts, profiles, places, hashtags, photos, and comments. Get data from Instagram using one or more Instagram URLs or search queries. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Apify

49.9k

281