Researchgpt Deep Research Agent avatar
Researchgpt Deep Research Agent

Pricing

from $70.00 / 1,000 results

Go to Apify Store
Researchgpt Deep Research Agent

Researchgpt Deep Research Agent

๐Ÿ”ฌ Transform any topic into a comprehensive research report in minutes! Scrapes Wikipedia, arXiv, Semantic Scholar, news & web sources. Outputs professional JSON, HTML & PDF reports. Perfect for students, researchers, content creators & businesses. No API keys needed.

Pricing

from $70.00 / 1,000 results

Rating

0.0

(0)

Developer

Varun Chopra

Varun Chopra

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

1

Monthly active users

9 days ago

Last modified

Share

๐Ÿ”ฌ ResearchGPT - Deep Research Agent

Transform any topic into a comprehensive research report in minutes, not hours.

Apify Actor Python 3.11+ License: MIT


๐ŸŽฏ What is ResearchGPT?

ResearchGPT is your AI-powered research assistant that does in 3 minutes what would take you 3+ hours manually.

Simply enter any topic, and ResearchGPT will:

โœ… Search across multiple engines (DuckDuckGo, Brave, Mojeek)
โœ… Scrape Wikipedia, arXiv, Semantic Scholar, OpenAlex, CrossRef
โœ… Extract the latest news articles and web content
โœ… Process everything with intelligent NLP analysis
โœ… Generate beautiful reports in JSON, HTML & PDF formats

No API keys required. No complex setup. Just results.


๐Ÿš€ Perfect For

Use CaseHow ResearchGPT Helps
๐Ÿ“š Students & AcademicsLiterature reviews, thesis research, citation gathering
โœ๏ธ Content CreatorsBlog research, fact-checking, source compilation
๐Ÿ’ผ Business AnalystsMarket research, competitive analysis, trend reports
๐Ÿ”ฌ ResearchersCross-referencing sources, academic paper aggregation
๐Ÿ“ฐ JournalistsBackground research, source verification, story development
๐Ÿค– AI/ML ProjectsTraining data collection, knowledge base building

โšก Quick Start

1. Run on Apify (Easiest)

  1. Go to the ResearchGPT Actor page
  2. Enter your research topic
  3. Click Start
  4. Download your reports! ๐Ÿ“„

2. Via API

curl -X POST "https://api.apify.com/v2/acts/YOUR_USERNAME~researchgpt-deep-research-agent/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{"topic": "quantum computing breakthroughs 2025"}'

3. Via Apify SDK (Python)

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("YOUR_USERNAME/researchgpt-deep-research-agent").call(
run_input={"topic": "artificial intelligence in healthcare"}
)
# Get results
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

๐Ÿ“Š What You Get

Three Professional Output Formats

FormatBest ForContents
๐Ÿ“„ JSONDevelopers, APIs, databasesFull structured data with metadata
๐ŸŒ HTMLWeb publishing, sharingBeautifully styled report with CSS
๐Ÿ“‘ PDFPrinting, presentationsClean, professional document

Rich Research Data

{
"topic": "artificial intelligence ethics",
"sources": {
"wikipedia": 5,
"academic": 10,
"news": 5,
"general": 10
},
"processed_content": {
"summary": "Comprehensive executive summary...",
"key_findings": ["Finding 1", "Finding 2", "..."],
"themes": ["Theme 1", "Theme 2", "..."],
"entities": ["Entity 1", "Entity 2", "..."]
}
}

๐Ÿ”ง Configuration Options

{
"topic": "your research topic here",
"outputFormats": ["json", "html", "pdf"],
"maxSourcesPerType": 10,
"includeWikipedia": true,
"includeAcademic": true,
"includeNews": true,
"includeGeneral": true,
"searchProviders": ["duckduckgo"],
"requestTimeout": 30,
"maxRetries": 3,
"debug": false
}

Parameter Reference

ParameterTypeDefaultDescription
topicstringrequired๐ŸŽฏ Your research topic or question
outputFormatsarray["json", "html", "pdf"]๐Ÿ“„ Output formats to generate
maxSourcesPerTypeinteger10๐Ÿ“Š Sources per category (1-20)
includeWikipediabooleantrue๐Ÿ“– Include Wikipedia articles
includeAcademicbooleantrue๐ŸŽ“ Include academic papers
includeNewsbooleantrue๐Ÿ“ฐ Include news articles
includeGeneralbooleantrue๐ŸŒ Include general web content
searchProvidersarray["duckduckgo"]๐Ÿ” Search engines to use
requestTimeoutinteger30โฑ๏ธ Request timeout (seconds)
maxRetriesinteger3๐Ÿ”„ Retry attempts on failure
proxyConfigurationobjectnull๐Ÿ›ก๏ธ Apify proxy settings
debugbooleanfalse๐Ÿ› Enable verbose logging

๐ŸŒ Data Sources

ResearchGPT taps into 6+ authoritative sources:

SourceTypeWhat You Get
๐ŸŒ WikipediaKnowledge BaseFoundational articles via MediaWiki API
๐Ÿ“š arXivAcademicPre-print papers in physics, CS, math, and more
๐Ÿ”ฌ Semantic ScholarAcademic200M+ papers with citation analysis
๐Ÿ“– OpenAlexAcademicOpen catalog of scholarly works
๐Ÿ“‘ CrossRefAcademicDOI metadata and citations
๐Ÿ“ฐ News SourcesCurrent EventsLatest articles via smart extraction
๐ŸŒ General WebInsightsCurated web content with readability algorithms

๐Ÿ›ก๏ธ Production-Grade Features

Built for reliability and scale:

  • โšก Smart Caching - 5-minute TTL prevents redundant requests
  • ๐Ÿ”„ Retry Logic - Exponential backoff with jitter
  • ๐Ÿšฆ Rate Limiting - Respects API limits automatically
  • ๐Ÿ” Deduplication - MD5-based content fingerprinting
  • ๐ŸŒ Connection Pooling - Efficient HTTP management
  • ๐Ÿ›ก๏ธ Error Handling - Graceful fallbacks, never crashes

๐Ÿ’ป Local Development

# Clone the repository
git clone https://github.com/your-repo/researchgpt-deep-research-agent
cd researchgpt-deep-research-agent
# Create virtual environment
python -m venv .venv
.venv\Scripts\activate # Windows
source .venv/bin/activate # macOS/Linux
# Install dependencies
pip install -r requirements.txt
# Run locally
python run_local.py

๐Ÿ“ Project Structure

researchgpt-deep-research-agent/
โ”œโ”€โ”€ ๐Ÿ“‚ .actor/
โ”‚ โ””โ”€โ”€ actor.json # Apify configuration
โ”œโ”€โ”€ ๐Ÿ“‚ src/
โ”‚ โ”œโ”€โ”€ __init__.py
โ”‚ โ””โ”€โ”€ main.py # ๐Ÿš€ Main Apify entry point
โ”œโ”€โ”€ ๐Ÿ“‚ scrapers/
โ”‚ โ”œโ”€โ”€ base_scraper.py # Base class with retry/caching
โ”‚ โ”œโ”€โ”€ academic_scraper.py # arXiv, Semantic Scholar, etc.
โ”‚ โ”œโ”€โ”€ wikipedia_scraper.py # MediaWiki API
โ”‚ โ”œโ”€โ”€ news_scraper.py # News extraction
โ”‚ โ”œโ”€โ”€ general_scraper.py # Web scraping
โ”‚ โ””โ”€โ”€ search_engine.py # Multi-provider search
โ”œโ”€โ”€ ๐Ÿ“‚ processors/
โ”‚ โ””โ”€โ”€ content_processor.py # NLP processing
โ”œโ”€โ”€ ๐Ÿ“‚ output/
โ”‚ โ””โ”€โ”€ output_generator.py # Report generation
โ”œโ”€โ”€ ๐Ÿ“„ Dockerfile # Container definition
โ”œโ”€โ”€ ๐Ÿ“„ requirements.txt # Dependencies
โ””โ”€โ”€ ๐Ÿ“„ README.md # You are here!

๐Ÿš€ Deploy to Apify

npm install -g apify-cli
apify login
apify push

Option 2: GitHub Integration

  1. Push to GitHub
  2. Apify Console โ†’ Create Actor โ†’ Link to GitHub
  3. Auto-builds on every push! ๐Ÿ”„

๐Ÿ“ˆ Performance Tips

TipImpact
Lower maxSourcesPerTypeโšก Faster results
Disable unused sources๐Ÿš€ Skip what you don't need
Use single search provider๐Ÿ“‰ Reduce API calls
Enable debug mode๐Ÿ” Troubleshoot issues

๐Ÿค” FAQ


๐Ÿ†š Why ResearchGPT?

FeatureResearchGPTManual ResearchOther Tools
โฑ๏ธ Time3 minutes3+ hours30+ minutes
๐Ÿ“š Sources6+ databasesLimitedUsually 1-2
๐Ÿ“„ OutputJSON + HTML + PDFManual formattingSingle format
๐Ÿ’ฐ CostPay per runYour time = $$$$Subscription
๐Ÿ”ง SetupZeroN/AAPI keys needed

๐Ÿ“ Example Topics

Get inspired! Here are some topics that work great:

  • "artificial intelligence ethics and regulation 2025"
  • "quantum computing practical applications"
  • "climate change solutions renewable energy"
  • "cryptocurrency DeFi market analysis"
  • "remote work productivity research"
  • "mental health digital therapeutics"
  • "gene editing CRISPR medical applications"
  • "electric vehicles battery technology"

๐Ÿค Support & Community


๐Ÿ“„ License

MIT License - Use it freely, commercially or personally.