Pricing

from $50.00 / 1,000 results

Go to Apify Store

Chatgpt Conversation Extractor

Try for free

This scraper extracts the conversation history from public ChatGPT conversations

Pricing

from $50.00 / 1,000 results

Rating

0.0

(0)

Developer

KLINZINGER

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Overview

This Actor extracts conversation data from ChatGPT's publicly shared conversations by accessing the data embedded in the page through React Router's data loader. The data is not fetched via a separate API endpoint but is embedded server-side and accessible through the browser's JavaScript environment.

How It Works

The Actor navigates to the provided ChatGPT share URLs using Puppeteer
Waits for the page to fully load and React Router to initialize
Extracts conversation data from window.__reactRouterDataRouter.state.loaderData
Parses the conversation tree structure into a linear array of messages
Outputs structured data including:
- Conversation metadata (title, timestamps, share ID)
- Parsed messages in chronological order
- Optionally, the complete raw conversation data

Input

The Actor accepts the following input parameters:

startUrls (required): Array of ChatGPT share URLs to extract
- Example: https://chatgpt.com/share/693011c8-0a3c-8006-b6cf-77d844d1bb51
includeRawData (optional, default: true): Whether to include the complete raw conversation data in the output

Example Input

{
  "startUrls": [
    {
      "url": "https://chatgpt.com/share/693011c8-0a3c-8006-b6cf-77d844d1bb51"
    }
  ],
  "includeRawData": true
}

Output

The Actor outputs structured data to the dataset with the following fields:

url: The ChatGPT share URL
shareId: Extracted share ID from the URL
title: Conversation title
createTime: Unix timestamp when conversation was created
updateTime: Unix timestamp when conversation was last updated
messageCount: Number of messages in the conversation
messages: Array of parsed messages, each containing:
- role: Message role ("user" or "assistant")
- content: Message content text
- timestamp: Unix timestamp when message was created
- messageId: Unique message identifier
- status: Message status
rawData (if includeRawData is true): Complete raw conversation data with full tree structure

Example Output

{
  "url": "https://chatgpt.com/share/693011c8-0a3c-8006-b6cf-77d844d1bb51",
  "shareId": "693011c8-0a3c-8006-b6cf-77d844d1bb51",
  "title": "Example Conversation",
  "createTime": 1764757960.044993,
  "updateTime": 1764757965.106983,
  "messageCount": 54,
  "messages": [
    {
      "role": "user",
      "content": "Hello, how are you?",
      "timestamp": 1764256500.3946629,
      "messageId": "message_id_1",
      "status": "finished_successfully"
    },
    {
      "role": "assistant",
      "content": "I'm doing well, thank you!",
      "timestamp": 1764256501.1234567,
      "messageId": "message_id_2",
      "status": "finished_successfully"
    }
  ],
  "rawData": { /* complete raw conversation data */ }
}

Data Structure

ChatGPT conversations are stored in a tree structure where:

Each message has a parent reference to its parent message
Each message has a children array with child message IDs
Messages are organized in threads/branches
The Actor traverses this tree to extract messages in chronological order

Limitations

Only works for publicly shared conversations
Requires JavaScript execution (uses Puppeteer browser automation)
Cannot access private conversations without authentication
Data structure may change as ChatGPT updates their platform
Rate limiting may apply if extracting many conversations

Use Cases

Archiving publicly shared conversations
Analyzing conversation patterns and structures
Converting conversations to other formats (Markdown, CSV, etc.)
Building conversation datasets for training or analysis
Creating backups of shared conversations
Research and analysis of AI conversation patterns

Getting Started

Local Development

Install dependencies:

$npm install

Run the Actor locally:

$apify run

The Actor will read input from storage/key_value_stores/default/INPUT.json. Create this file with your ChatGPT share URLs:

{
  "startUrls": [
    {
      "url": "https://chatgpt.com/share/YOUR_SHARE_ID"
    }
  ]
}

Deploy to Apify

$apify login

Deploy your Actor:

$apify push

Technical Details

Extraction Method

The Actor uses the following approach to extract conversation data:

Page Navigation: Uses Puppeteer to navigate to the ChatGPT share URL
Wait for React Router: Waits for window.__reactRouterDataRouter to be available

Data Extraction: Accesses the conversation data from:

window.__reactRouterDataRouter.state.loaderData['routes/share.$shareId.($action)'].serverResponse.data

Tree Traversal: Parses the conversation tree structure by:
- Finding the root message (message without a parent)
- Traversing the tree recursively through children
- Extracting messages in chronological order

Error Handling

If extraction fails, the Actor will:

Log detailed error information
Push error data to the dataset for debugging
Continue processing other URLs if multiple are provided

Resources

License

ISC

ChatGPT Conversation Scraper

straightforward_understanding/chatgpt-conversation-scraper

Extract complete conversations from ChatGPT shared links with smart Pay-Per-Event pricing. Get full dialogues, code blocks, and metadata - perfect for training datasets, conversation analysis, and knowledge management.

Yann Feunteun

Chatgpt Prompt Actor

automation_nerd/chatgpt-prompt-actor

This Actor automates interactions with ChatGPT by sending prompts and extracting responses. it opens the web interface, dismisses pop-ups, sends prompts, waits for responses (up to 2 minutes), and extracts generated results including citations for further use.

Egon Maier

5.0

ChatGPT Brand Visibility Tracker

gmangabeira2/chatgpt-brand-visibility-tracker

Track your brand's visibility in ChatGPT across any industry. Get visibility rate, sentiment analysis, and competitor comparisons. All-inclusive pricing - no API keys needed. Just paste your URL for instant AEO insights.

Gabriel Mangabeira

ChatGPT

pertosh/chatgpt

You can use this Actor to transform scraped results, such as reviews from restaurants, by rephrasing the sentences. Additionally, translation is also supported. You can also use it to generate new website descriptions, keywords, and other similar metadata.

Alper

170

GPT Search

tri_angle/gpt-search

Send queries to ChatGPT and retrieve structured answers with full source citations. Easily integrate into your tools or workflows for flexible, scalable AI-powered solutions.

Tri⟁angle

164

AI Rank Tracker

salman_bareesh/ai-rank-tracker

AI Rank Tracker monitors your website's rankings across ChatGPT, Gemini, Perplexity, and Claude for specific keywords. Get visibility scores, ranking positions, and product listings from AI platforms. Perfect for SEO monitoring, competitor analysis, and tracking AI search performance.

Salman Bareesh

AI Brand Visibility

adityalingwal/AI-brand-Visibility

Track how AI platforms like ChatGPT, Gemini, and Perplexity recommend your brand versus competitors. Get instant visibility into which brands dominate AI-generated recommendations in your industry — perfect for marketing teams, SEO specialists, and brand managers optimizing for the AI search era.

Aditya Lingwal

4.2

Ai Visibility Suite - Dark Visitors Alternative

alizarin_refrigerator-owner/ai-visibility-suite---dark-visitors-alternative

Comprehensive AI bot monitoring, robots.txt analysis, LLMs.txt generation & AI shopping optimization. Monitor AI crawlers visits, check AI compliance, generate AI-friendly configurations, and optimize for AI shopping agents. AI Bot Directory Robots.txt LLMs.txt AI Shopping Competitor AI Audit

The Howlers

Extended GPT Scraper

drobnikj/extended-gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.