Researchgpt Deep Research Agent avatar
Researchgpt Deep Research Agent

Pricing

from $150.00 / 1,000 results

Go to Apify Store
Researchgpt Deep Research Agent

Researchgpt Deep Research Agent

πŸ”¬ Transform any topic into a comprehensive research report in minutes! Scrapes Wikipedia, arXiv, Semantic Scholar, news & web sources. Outputs professional JSON, HTML & PDF reports. Perfect for students, researchers, content creators & businesses. No API keys needed.

Pricing

from $150.00 / 1,000 results

Rating

0.0

(0)

Developer

Varun Chopra

Varun Chopra

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

πŸ”¬ ResearchGPT - Deep Research Agent

Transform any topic into a comprehensive research report in minutes, not hours.

Apify Actor Python 3.11+ License: MIT


🎯 What is ResearchGPT?

ResearchGPT is your AI-powered research assistant that does in 3 minutes what would take you 3+ hours manually.

Simply enter any topic, and ResearchGPT will:

βœ… Search across multiple engines (DuckDuckGo, Brave, Mojeek)
βœ… Scrape Wikipedia, arXiv, Semantic Scholar, OpenAlex, CrossRef
βœ… Extract the latest news articles and web content
βœ… Process everything with intelligent NLP analysis
βœ… Generate beautiful reports in JSON, HTML & PDF formats

No API keys required. No complex setup. Just results.


πŸš€ Perfect For

Use CaseHow ResearchGPT Helps
πŸ“š Students & AcademicsLiterature reviews, thesis research, citation gathering
✍️ Content CreatorsBlog research, fact-checking, source compilation
πŸ’Ό Business AnalystsMarket research, competitive analysis, trend reports
πŸ”¬ ResearchersCross-referencing sources, academic paper aggregation
πŸ“° JournalistsBackground research, source verification, story development
πŸ€– AI/ML ProjectsTraining data collection, knowledge base building

⚑ Quick Start

1. Run on Apify (Easiest)

  1. Go to the ResearchGPT Actor page
  2. Enter your research topic
  3. Click Start
  4. Download your reports! πŸ“„

2. Via API

curl -X POST "https://api.apify.com/v2/acts/YOUR_USERNAME~researchgpt-deep-research-agent/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{"topic": "quantum computing breakthroughs 2025"}'

3. Via Apify SDK (Python)

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("YOUR_USERNAME/researchgpt-deep-research-agent").call(
run_input={"topic": "artificial intelligence in healthcare"}
)
# Get results
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

πŸ“Š What You Get

Three Professional Output Formats

FormatBest ForContents
πŸ“„ JSONDevelopers, APIs, databasesFull structured data with metadata
🌐 HTMLWeb publishing, sharingBeautifully styled report with CSS
πŸ“‘ PDFPrinting, presentationsClean, professional document

Rich Research Data

{
"topic": "artificial intelligence ethics",
"sources": {
"wikipedia": 5,
"academic": 10,
"news": 5,
"general": 10
},
"processed_content": {
"summary": "Comprehensive executive summary...",
"key_findings": ["Finding 1", "Finding 2", "..."],
"themes": ["Theme 1", "Theme 2", "..."],
"entities": ["Entity 1", "Entity 2", "..."]
}
}

πŸ”§ Configuration Options

{
"topic": "your research topic here",
"outputFormats": ["json", "html", "pdf"],
"maxSourcesPerType": 10,
"includeWikipedia": true,
"includeAcademic": true,
"includeNews": true,
"includeGeneral": true,
"searchProviders": ["duckduckgo"],
"requestTimeout": 30,
"maxRetries": 3,
"debug": false
}

Parameter Reference

ParameterTypeDefaultDescription
topicstringrequired🎯 Your research topic or question
outputFormatsarray["json", "html", "pdf"]πŸ“„ Output formats to generate
maxSourcesPerTypeinteger10πŸ“Š Sources per category (1-20)
includeWikipediabooleantrueπŸ“– Include Wikipedia articles
includeAcademicbooleantrueπŸŽ“ Include academic papers
includeNewsbooleantrueπŸ“° Include news articles
includeGeneralbooleantrue🌐 Include general web content
searchProvidersarray["duckduckgo"]πŸ” Search engines to use
requestTimeoutinteger30⏱️ Request timeout (seconds)
maxRetriesinteger3πŸ”„ Retry attempts on failure
proxyConfigurationobjectnullπŸ›‘οΈ Apify proxy settings
debugbooleanfalseπŸ› Enable verbose logging

🌐 Data Sources

ResearchGPT taps into 6+ authoritative sources:

SourceTypeWhat You Get
🌍 WikipediaKnowledge BaseFoundational articles via MediaWiki API
πŸ“š arXivAcademicPre-print papers in physics, CS, math, and more
πŸ”¬ Semantic ScholarAcademic200M+ papers with citation analysis
πŸ“– OpenAlexAcademicOpen catalog of scholarly works
πŸ“‘ CrossRefAcademicDOI metadata and citations
πŸ“° News SourcesCurrent EventsLatest articles via smart extraction
🌐 General WebInsightsCurated web content with readability algorithms

πŸ›‘οΈ Production-Grade Features

Built for reliability and scale:

  • ⚑ Smart Caching - 5-minute TTL prevents redundant requests
  • πŸ”„ Retry Logic - Exponential backoff with jitter
  • 🚦 Rate Limiting - Respects API limits automatically
  • πŸ” Deduplication - MD5-based content fingerprinting
  • 🌐 Connection Pooling - Efficient HTTP management
  • πŸ›‘οΈ Error Handling - Graceful fallbacks, never crashes

πŸ’» Local Development

# Clone the repository
git clone https://github.com/your-repo/researchgpt-deep-research-agent
cd researchgpt-deep-research-agent
# Create virtual environment
python -m venv .venv
.venv\Scripts\activate # Windows
source .venv/bin/activate # macOS/Linux
# Install dependencies
pip install -r requirements.txt
# Run locally
python run_local.py

πŸ“ Project Structure

researchgpt-deep-research-agent/
β”œβ”€β”€ πŸ“‚ .actor/
β”‚ └── actor.json # Apify configuration
β”œβ”€β”€ πŸ“‚ src/
β”‚ β”œβ”€β”€ __init__.py
β”‚ └── main.py # πŸš€ Main Apify entry point
β”œβ”€β”€ πŸ“‚ scrapers/
β”‚ β”œβ”€β”€ base_scraper.py # Base class with retry/caching
β”‚ β”œβ”€β”€ academic_scraper.py # arXiv, Semantic Scholar, etc.
β”‚ β”œβ”€β”€ wikipedia_scraper.py # MediaWiki API
β”‚ β”œβ”€β”€ news_scraper.py # News extraction
β”‚ β”œβ”€β”€ general_scraper.py # Web scraping
β”‚ └── search_engine.py # Multi-provider search
β”œβ”€β”€ πŸ“‚ processors/
β”‚ └── content_processor.py # NLP processing
β”œβ”€β”€ πŸ“‚ output/
β”‚ └── output_generator.py # Report generation
β”œβ”€β”€ πŸ“„ Dockerfile # Container definition
β”œβ”€β”€ πŸ“„ requirements.txt # Dependencies
└── πŸ“„ README.md # You are here!

πŸš€ Deploy to Apify

npm install -g apify-cli
apify login
apify push

Option 2: GitHub Integration

  1. Push to GitHub
  2. Apify Console β†’ Create Actor β†’ Link to GitHub
  3. Auto-builds on every push! πŸ”„

πŸ“ˆ Performance Tips

TipImpact
Lower maxSourcesPerType⚑ Faster results
Disable unused sourcesπŸš€ Skip what you don't need
Use single search providerπŸ“‰ Reduce API calls
Enable debug modeπŸ” Troubleshoot issues

πŸ€” FAQ


πŸ†š Why ResearchGPT?

FeatureResearchGPTManual ResearchOther Tools
⏱️ Time3 minutes3+ hours30+ minutes
πŸ“š Sources6+ databasesLimitedUsually 1-2
πŸ“„ OutputJSON + HTML + PDFManual formattingSingle format
πŸ’° CostPay per runYour time = $$$$Subscription
πŸ”§ SetupZeroN/AAPI keys needed

πŸ“ Example Topics

Get inspired! Here are some topics that work great:

  • "artificial intelligence ethics and regulation 2025"
  • "quantum computing practical applications"
  • "climate change solutions renewable energy"
  • "cryptocurrency DeFi market analysis"
  • "remote work productivity research"
  • "mental health digital therapeutics"
  • "gene editing CRISPR medical applications"
  • "electric vehicles battery technology"

🀝 Support & Community


πŸ“„ License

MIT License - Use it freely, commercially or personally.