
Clutch Scraper Pro
Pricing
from $0.90 / 1,000 results

Clutch Scraper Pro
The most comprehensive and reliable Clutch scraper on Apify marketplace just by Link. Extract complete company profiles with 50+ data fields, including reviews, intelligent deduplication, KV caching, and blazing-fast performance.
0.0 (0)
Pricing
from $0.90 / 1,000 results
0
5
5
Last modified
21 days ago
🏆 Clutch Scraper Pro - Most Complete Data Extraction
The most comprehensive and reliable Clutch scraper on Apify marketplace. Extract complete company profiles with 70+ data fields including pricing information, detailed business metrics, enhanced review analysis, LinkedIn company pages, business verification details, credit reports, and social media links. Features intelligent deduplication, KV caching, and blazing-fast performance.
🚀 Why Choose This Scraper?
✨ Why Choose This Scraper?
- 70+ Data Fields: The most complete profile coverage on Clutch including pricing & business details
- Comprehensive Business Data: Min project size, hourly rates, employee count, founding year 🆕
- Enhanced Review Analysis: Project summaries, feedback analysis, detailed ratings breakdown 🆕
- Client Testimonials: Complete "What Clients Have Said" sections with pricing insights 🆕
- LinkedIn Company Pages: Smart extraction of real LinkedIn company URLs (not policy links)
- Business Verification: Complete Business Entity details, Credit Risk Assessment, legal filings
- Social Media Links: LinkedIn, Facebook, X (Twitter), Instagram from official social sections
- Intelligent Deduplication: Removes duplicates across list pages and runs
- Real Domain Extraction: Optional redirect following for accurate domains
- KV Profile Cache: Reuse parsed detail pages across runs to save time and CUs
- Apify Proxy Support: Use residential/datacenter groups and optional country pinning
- Review Sorting: relevance, recent, ratingHighToLow, ratingLowToHigh
- Robust Error Handling: Retries, timeouts, and safe fallbacks
- Detailed Statistics: Duplicates, errors, speed, runtime summary
- Multiple Export Formats: CSV, Excel, JSON, XML with clean, flat columns
- Blazing Fast: 90+ companies/minute in LIST_DETAIL in real-world runs
📊 Data Fields Extracted
Company Information
- Company name, logo, tagline
- Clutch URL and website URL
- Real domain (with redirect following option)
- Rating and reviews count
- Verification status (VERIFIED, PREMIER VERIFIED)
- Founded year
- Timezone
- Full description
Business Details 🆕
- Min project size: Minimum project budget requirement ($USD)
- Hourly rate range: From/to hourly rates ($USD/hr)
- Employees count: Team size range (e.g., "50-249", "2-9")
- Year founded: Company establishment year
- Most common project size: Typical project budget based on reviews
- Client testimonials: "What Clients Have Said" summary with pricing insights
- Multiple locations: Complete address information with country/region
Service Breakdown
- Services with percentages
- Focus areas with percentages
- Industries served with percentages
- Client focus (size/type) with percentages
Social Media & Contact
- LinkedIn Company Pages: Smart URL extraction (e.g.,
/company/acme-corp/
) - Facebook Pages: Official business pages
- X (Twitter): Company Twitter handles
- Instagram: Business Instagram accounts
- Real website domains (with redirect resolution)
Verification & Trust
- Business Entity: Legal name, status, jurisdiction, formation date, entity ID
- Credit Report Results: International Credit Risk Assessment, source, last updated
- Verification Status: VERIFIED, PREMIER VERIFIED, GOLD VERIFIED badges
- Legal Filings: Bankruptcy, tax liens, judgments, collections count
- Last Updated: When verification data was last refreshed
Reviews (Optional)
- Review title and rating
- Review date and comments
- Reviewer details
- Project services, size, and length
- Quality, schedule, cost ratings
- Willingness to refer rating
Additional Data
- Multiple office addresses
- Portfolio items
- Custom fields via extension function
🎯 Scraping Modes
Mode | Description | Best For |
---|---|---|
LIST | Extract company data from listing pages only | Quick overview, basic info |
LIST_DETAIL | Extract from listings + full detail pages | Complete profiles with all data |
LIST_WEBSITES | Optimized mode - details only for missing websites | Fast extraction with websites |
🔧 Input Configuration
{"startUrls": [{ "url": "https://clutch.co/us/agencies/digital-strategy" }],"scrapeMode": "LIST_DETAIL","maxResults": 300,"includeReviews": true,"maxReviewsPerCompany": 5,"includePortfolio": true,"exportFormat": "CSV","followRedirects": true,"clearDataset": true}
Input Parameters
Parameter | Type | Default | Description |
---|---|---|---|
startUrls | Array | Required | Clutch.co URLs to scrape (list or profile pages) |
scrapeMode | String | LIST_DETAIL | Scraping mode: LIST , LIST_DETAIL , LIST_WEBSITES |
includeReviews | Boolean | false | Extract company reviews |
maxReviewsPerCompany | Number | 10 | Maximum reviews per company |
reviewSortBy | String | relevance | Sort reviews: relevance , recent , ratingHighToLow , ratingLowToHigh |
includePortfolio | Boolean | true | Extract portfolio items |
maxResults | Number | - | Maximum total companies to extract |
maxPagesPerSearch | Number | - | Maximum pages per search URL |
maxRequestsPerCrawl | Number | 0 | Maximum total requests (0 = unlimited) |
exportFormat | String | JSON | Export format: JSON , CSV , EXCEL , XML |
followRedirects | Boolean | false | Follow redirects to get real domains |
useApifyProxy | Boolean | false | Use Apify Proxy |
apifyProxyGroups | Array | ["RESIDENTIAL"] | Proxy groups to use |
apifyProxyCountry | String | - | Proxy country code (e.g., US ) |
useHeadless | Boolean | true | Run browser in headless mode |
clearDataset | Boolean | false | Clear dataset before run |
extendOutputFunction | String | - | Custom JS function to extend output |
useProfileCache | Boolean | false | Use KV cache for detail pages |
profileCacheTtlDays | Number | 0 | Cache TTL in days (0 = never expire) |
profileCacheVersion | String | "" | Bump to invalidate old cache |
🚀 Performance
- In a recent LIST_DETAIL benchmark on
https://clutch.co/us/agencies/digital-strategy
with reviews enabled (max 5 per company), the scraper achieved:- Average speed: ~90 companies/minute
- Total unique companies: 9,750
- Runtime: ~108 minutes
- Settings:
scrapeMode=LIST_DETAIL
,includeReviews=true
,reviewSortBy=recent
,maxReviewsPerCompany=5
,exportFormat=CSV
Actual speed depends on filters, network, anti-bot pressure, and selected options (reviews/portfolio/proxy).
📈 Performance & Statistics
After each run, you'll see detailed statistics:
================================================================================SCRAPING COMPLETED SUCCESSFULLY================================================================================📊 SUMMARY STATISTICS:• Total unique companies collected: 275• Total companies processed (with duplicates): 436• Duplicates removed: 161• Runtime: 3m 51s• Average speed: 71 companies/minute🔍 SCRAPING DETAILS:• Scrape mode: LIST_DETAIL• Reviews included: Yes (max 3 per company)• Export format: CSV• Exported file: clutch-export-2024-01-15.csv✅ NO ERRORS DETECTED🎯 PERFORMANCE METRICS:• Average time per company: 842ms• Total runtime: 231614ms================================================================================
💾 Output Formats
CSV Export
- Perfectly formatted with all fields as columns
- Special handling for arrays (services, industries, etc.)
- HTML entities decoded and cleaned
- Review and portfolio data in indexed columns
Output Schema (CSV/JSON)
Below is the complete list of fields the scraper can output and how they appear in CSV (flattened) vs JSON (nested):
-
Top-level fields (LIST and LIST_DETAIL)
companyName
(string)clutchUrl
(string)websiteUrl
(string)domain
(string, withoutwww.
)logoUrl
(string)tagline
(string)rating
(number)reviewsCount
(number)verificationStatus
(string:VERIFIED
|PREMIER VERIFIED
|GOLD VERIFIED
| empty)minProjectSize
(number, e.g., 25000) 🆕hourlyRateFrom
(number) 🆕hourlyRateTo
(number) 🆕employees
(string, e.g., "50-249", "2-9") 🆕yearFounded
(number, e.g., 2015) 🆕location
(string)
-
Social Media Links 🆕
- JSON:
socialLinks: { linkedIn, facebook, x, instagram }
- CSV:
socialLinks_linkedIn
,socialLinks_facebook
,socialLinks_x
,socialLinks_instagram
- JSON:
-
Services (list page chips)
- JSON:
services: [{ name: string, percent|percentage: number }]
- CSV:
services_{Service_Name}
= numeric percentage
- JSON:
-
Focus Areas (detail)
- JSON:
focusAreas: [{ name: string, percentage: number }]
- CSV:
focusAreas_{Area_Name}
= numeric percentage
- JSON:
-
Industries (detail)
- JSON:
industries: [{ name: string, percentage: number }]
- CSV:
industries_{Industry_Name}
= numeric percentage
- JSON:
-
Client Focus (detail)
- JSON:
clientFocus: [{ name: string, percentage: number }]
- CSV:
clientFocus_{Client_Type}
= numeric percentage
- JSON:
-
Most Common Project Size (detail)
- JSON:
mostCommonProjectSize
(number) - CSV:
mostCommonProjectSize
- JSON:
-
Client Testimonials (detail) 🆕
- JSON:
clientTestimonial
(string, "What Clients Have Said" section) - CSV:
clientTestimonial
- JSON:
-
Timezone (detail)
- JSON:
timezone
(string) - CSV:
timezone
- JSON:
-
Full Description (detail)
- JSON:
fullDescription
(string) - CSV:
fullDescription
- JSON:
-
Addresses (detail)
- JSON:
addresses: [{ title, street, locality, region, postalCode, country, phone }]
- CSV:
addresses_0_title
,addresses_0_street
,addresses_0_locality
,addresses_0_region
,addresses_0_postalCode
,addresses_0_country
,addresses_0_phone
, ... (indexed per address)
- JSON:
-
Portfolio (detail, optional)
- JSON:
portfolio: [{ image, description }]
- CSV:
portfolio_0_image
,portfolio_0_description
,portfolio_1_image
, ...
- JSON:
-
Reviews (detail, optional) 🆕 Enhanced
- JSON per review item:
name
(string)rating
(number)date
(string)comments
(string; main review quote)projectSummary
(string; detailed project description) 🆕feedbackSummary
(string; results and outcomes summary) 🆕detailedRatings
(array; Quality, Schedule, Cost ratings with comments) 🆕fullReview
(string; complete review text) 🆕reviewerName
(string)reviewerPosition
(string)projectServices
(string; semicolon-separated)projectSize
(string)projectLength
(string)
- CSV (indexed and flattened):
review_0_name
,review_0_rating
,review_0_date
,review_0_comments
,review_0_projectSummary
,review_0_feedbackSummary
,review_0_detailedRatings_0_category
,review_0_detailedRatings_0_rating
,review_0_detailedRatings_0_comment
,review_0_fullReview
,review_0_reviewerName
,review_0_reviewerPosition
,review_0_projectServices
,review_0_projectSize
,review_0_projectLength
, ...
- JSON per review item:
-
Verification Details (detail) 🆕
- JSON:
verificationDetails: { verificationStatus, businessEntity: { name, status, jurisdictionOfFormation, ID, source, lastUpdated, dateOfFormation }, paymentLegalFilings: { bankruptcy, taxLienFilings, judgementFilings, collectionsCount, source, lastUpdated, fullBusinessCreditReport } }
- CSV (flattened):
verificationDetails_verificationStatus
verificationDetails_businessEntity_name
,verificationDetails_businessEntity_status
,verificationDetails_businessEntity_jurisdictionOfFormation
,verificationDetails_businessEntity_ID
,verificationDetails_businessEntity_source
,verificationDetails_businessEntity_lastUpdated
,verificationDetails_businessEntity_dateOfFormation
verificationDetails_paymentLegalFilings_bankruptcy
,verificationDetails_paymentLegalFilings_taxLienFilings
,verificationDetails_paymentLegalFilings_judgementFilings
,verificationDetails_paymentLegalFilings_collectionsCount
,verificationDetails_paymentLegalFilings_source
,verificationDetails_paymentLegalFilings_lastUpdated
,verificationDetails_paymentLegalFilings_fullBusinessCreditReport
- JSON:
-
Credit Report Results (detail) 🆕
- JSON:
creditReportResults: { internationalCreditRiskAssessment, source, lastUpdated }
- CSV:
creditReportResults_internationalCreditRiskAssessment
,creditReportResults_source
,creditReportResults_lastUpdated
- JSON:
-
Verification Last Updated 🆕
- JSON/CSV:
verificationLastUpdated
(string, e.g., "August 8, 2025")
- JSON/CSV:
Notes:
- 🆕 Social media extraction from official "Connect with [Company] on Social" sections
- 🆕 Business verification data including legal entity details and credit assessments
- CSV экспорт автоматически декодирует HTML-сущности и чистит лишние пробелы
- Списки (reviews, portfolio, addresses) индексируются:
_0_
,_1_
, ... - Процентные поля сохраняются числом; фильтрация по ним в таблицах проста
JSON Output
Complete nested structure with all relationships preserved.
🛠️ Advanced Usage
Extract Companies with Specific Filters
{"startUrls": [{ "url": "https://clutch.co/agencies/digital-marketing?employees=10%20-%2049" }],"scrapeMode": "LIST_DETAIL","includeReviews": true,"followRedirects": true,"maxResults": 100}
Custom Data Extension
{"extendOutputFunction": "($, data) => ({customScore: data.rating * data.reviewsCount,scrapedAt: new Date().toISOString(),hasPortfolio: data.portfolio?.length > 0})"}
🐛 Error Handling
The scraper includes robust error handling:
- Continues on individual page failures
- Retries failed requests up to 3 times
- Handles frame detachment errors
- Manages navigation timeouts
- Provides detailed error statistics
🔄 Deduplication
Intelligent deduplication ensures you only get unique companies:
- Tracks companies globally across all pages
- Removes duplicates automatically
- Reports exact number of duplicates removed
- Maintains data quality and accuracy
🤝 Support
- 📧 Email: support@your-email.com
- 🐛 Issues: GitHub Issues
- 📖 Documentation: Full Docs
📄 License
ISC License - feel free to use in your projects!
Note: This scraper is not affiliated with Clutch.co. Please respect their Terms of Service and use responsibly.