Go to Apify Store Actor stats
Monthly active users 2
Monthly active users
Last modified 20 days ago
Last modified
[0.2.0]
Added
Rate limiting utility with token bucket algorithm for external API calls
HTTP client with configurable timeouts and automatic retry with exponential backoff
Custom error types for validation, API, crawl, configuration, and enrichment errors
Key-value store schema collections for input, state, stats, audit logs, caches, and errors
Browser pooling system with configurable pool size and automatic cleanup of idle browsers
robots.txt compliance checking with 1-hour TTL caching
Configurable proxy country code with ISO 3166-1 alpha-2 validation
SQL injection pattern detection for PDL queries
Complete signal detection implementation with parallel processing
CrawlOrchestrator cleanup call in finally block for proper resource management
Comprehensive dry run output with configuration summary, cost estimation, and helpful tips
estimateRunTime() and generateDryRunTips() helper methods in WorkflowController
Table of Contents in README for easier navigation
Cost Estimation section in README with Apify + API cost breakdown
Error Handling documentation in README
Rate Limits documentation for external APIs
Changed
Playwright browser options moved from launchOptions to contextOptions
Standardized naming conventions to camelCase throughout codebase
Improved error messages with more context
Enhanced PDL query builder with input validation
Extracted magic numbers to named constants
Tightened ESLint configuration with stricter rules
Improved input schema defaults
Dockerfile updated to run as non-root user with improved health check
TypeScript fields aligned with input schema defaults for optional parameters
README expanded from 368 to 535 lines with improved documentation
Jest configuration migrated from deprecated globals.ts-jest to new transform syntax
Dry run now pushes detailed JSON report to dataset for visibility in Apify UI
Support section in README rewritten for clarity
Fixed
Phone field mapping in contact discovery
Unnecessary await on synchronous log calls (67 ESLint await-thenable errors across 12 files)
Deprecated substr() method replaced with substring()
Redundant validation checks removed
Email validation API key handling for ZeroBounce and Hunter
CrawlOrchestrator variable scope for proper cleanup
Version synchronization between actor.json and package.json
Missing crawlMode validation
Incomplete signal detection loop
Hardcoded proxy country code now configurable
Playwright crawler per-request performance issue with browser pooling
JSON parse error handling for malformed AI responses
WorkflowController concurrency now respects user crawlConcurrency setting instead of hardcoded limits
Redundant Playwright retry logic removed from companyEnrichment fallback strategy
CrawlOrchestrator refactored to use persistent BrowserPool for improved performance
Input validation updated to enforce strict default values for stability
TypeScript errors in test/inputValidation.test.ts using proper type assertion
Removed fictional generatedBy metadata from actor.json
Removed "Coming soon" placeholder text from README
Security
Prefer HTTP headers for API keys; when a provider requires query-string keys, URLs are never logged
Enhanced SQL escaping with null byte removal, backslash escaping, and control character removal
Input validation for PDL query parameters to prevent injection attacks
Non-root Docker user for container security
Webhook URL validation to prevent SSRF attacks
Email addresses properly URL-encoded in validation API calls
[0.1.0]
Added
Initial pilot release of DRG Phantom Core
Stealth-grade processing engine with anti-detection measures
Multi-mode operation: Discovery & Enrichment, Enrichment Only, AI Scoring Only
TypeScript implementation with full type safety
SERP-based discovery module using stealth scraping techniques
SerpAPI integration for enhanced search capabilities
Company enrichment module with Clearbit API integration
Contact discovery system using People Data Labs (PDL)
LinkedIn enrichment via Proxycurl integration
Tech stack detection using Wappalyzer and BuiltWith
Multi-level email validation (regex, MX, SMTP)
Job posting signals for hiring activity detection
Funding round tracking for investment signals
Traffic analysis module for growth indicators
AI-powered scoring system (OpenAI GPT-4, Anthropic Claude)
CRM integrations (HubSpot, Pipedrive, GoHighLevel, Close.io)
Email platform support (Instantly, Smartlead, Lemlist)
Data warehouse export (Google BigQuery, Amazon Redshift)
Webhook support for custom integrations
Slack and Microsoft Teams notifications
Proxy management system with Apify proxy support
Audit logging for compliance tracking
Tenant isolation for multi-user support