
Clutch Scraper Pro
Pricing
from $0.90 / 1,000 results
Go to Apify Store

Clutch Scraper Pro
The most comprehensive and reliable Clutch scraper on Apify marketplace just by Link. Extract complete company profiles with 50+ data fields, including reviews, intelligent deduplication, KV caching, and blazing-fast performance.
0.0 (0)
Pricing
from $0.90 / 1,000 results
0
2
2
Last modified
a day ago
Changelog
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
[2.1.0] - 2025-01-14
🆕 Business Intelligence & Pricing Data
New Company Fields
- Min project size: Minimum project budget requirement ($USD)
- Hourly rate range: From/to hourly rates ($USD/hr)
- Employees count: Team size range (e.g., "50-249", "2-9")
- Year founded: Company establishment year
- Client testimonials: "What Clients Have Said" summary with pricing insights
Enhanced Review Analysis
- Project Summary: Detailed project description extracted from reviews
- Feedback Summary: Results and outcomes summary
- Detailed Ratings: Quality, Schedule, Cost ratings with individual comments
- Full Review: Complete review text with improved text cleaning
Technical Improvements
- Advanced text cleaning: Removes newlines, tabs, excessive whitespace from all fields
- Position-based parsing: More reliable extraction using HTML structure analysis
- Robust fallback logic: Multiple extraction strategies for maximum data coverage
- Universal parsing: Works across different company types and profiles
🔧 Output Schema Updates
- Added new fields to CSV/JSON export
- Enhanced Apify Dataset Schema with new pricing and review fields
- Updated overview and reviews views in Apify Console UI
- Improved field labels and formatting
[2.0.0] - 2025-01-10
🚀 Major Enhancements
New Data Fields
- Domain extraction: Real company domains with intelligent parsing
- Numeric fields:
minProjectSize
,hourlyRateFrom/To
,employeesFrom/To
as numbers - Industries breakdown: Complete industry focus with percentages
- Client focus: Client size/type distribution with percentages
- Most common project size: Extracted as numeric value
- Timezone: Company timezone information
- Multiple addresses: Support for multiple office locations
- Enhanced verification: Detailed business entity and legal filing status
Performance & Reliability
- Intelligent deduplication: Global tracking prevents duplicate companies across pages
- Follow redirects: Option to follow redirects for accurate domain extraction
- Robust error handling: Graceful handling of frame detachment and timeout errors
- Increased timeouts: Protocol timeout increased to 3 minutes for stability
- Smart retry logic: Failed requests retry up to 3 times
Export Improvements
- CSV Excellence: Switched to
csv-writer
library for perfect CSV generation - Array formatting: Services, industries as separate columns with percentages
- HTML entity decoding: All special characters properly decoded
- Indexed columns: Reviews and portfolio items in numbered columns
- Clean data: Aggressive whitespace and quote cleaning
New Features
- LIST_WEBSITES mode: Optimized scraping mode for faster results
- Review sorting: Sort reviews by relevance, date, or rating
- Detailed statistics: Comprehensive runtime statistics with:
- Duplicate count tracking
- Error categorization
- Performance metrics
- Beautiful console output
📊 Statistics Dashboard
📊 SUMMARY STATISTICS:• Total unique companies collected• Duplicates removed• Runtime and speed metrics• Error breakdown by type• Performance analysis
🔧 Technical Improvements
- Enhanced TypeScript types
- Better memory management
- Optimized concurrency settings
- Improved logging with progress tracking
- Clean code architecture
🐛 Bug Fixes
- Fixed double-saving of companies in LIST_DETAIL mode
- Resolved CSV export issues with nested objects
- Fixed review parsing selectors
- Corrected domain extraction logic
- Fixed race conditions in parallel processing
[1.0.0] - 2024-01-09
Added
- Initial release of Clutch.co Scraper
- Support for scraping company listings with filters
- Detailed company profile extraction including:
- Basic information (name, website, tagline, rating)
- Verification status and details
- Business entity information
- Portfolio items
- Client reviews with ratings
- Service focus areas with percentages
- Founded date and full description
- Multiple scraping modes: LIST, LIST_DETAIL
- Pagination support with customizable limits
- Review scraping with configurable max reviews per company
- Portfolio scraping (optional)
- Custom data extension via
extendOutputFunction
- Dataset clearing option for development
- Support for both headless and headful browser modes
- Comprehensive error handling and retry logic
- TypeScript support with full type definitions
- Optimized Docker image with Puppeteer pre-installed
Features
- Flexible URL support: Direct profile URLs or filtered list pages
- Smart pagination: Handles Clutch's pagination with configurable limits
- Data enrichment: Merges data from list and detail pages
- Performance optimized: Configurable concurrency and request limits
- Developer friendly: TypeScript, detailed logging, clear error messages
Technical Details
- Built with Crawlee (formerly Apify SDK v3)
- Uses PuppeteerCrawler for JavaScript rendering
- Implements request routing for different page types
- Includes anti-blocking measures (proxy support, fingerprinting)
[0.1.0] - 2024-01-08
Added
- Initial development version
- Basic scraping functionality
- Testing and debugging features