Clutch Scraper Pro avatar
Clutch Scraper Pro

Pricing

from $0.90 / 1,000 results

Go to Apify Store
Clutch Scraper Pro

Clutch Scraper Pro

Developed by

Procoders

Procoders

Maintained by Community

The most comprehensive and reliable Clutch scraper on Apify marketplace just by Link. Extract complete company profiles with 50+ data fields, including reviews, intelligent deduplication, KV caching, and blazing-fast performance.

0.0 (0)

Pricing

from $0.90 / 1,000 results

0

5

5

Last modified

21 days ago

🏆 Clutch Scraper Pro - Most Complete Data Extraction

Apify Actor TypeScript License

The most comprehensive and reliable Clutch scraper on Apify marketplace. Extract complete company profiles with 70+ data fields including pricing information, detailed business metrics, enhanced review analysis, LinkedIn company pages, business verification details, credit reports, and social media links. Features intelligent deduplication, KV caching, and blazing-fast performance.

🚀 Why Choose This Scraper?

✨ Why Choose This Scraper?

  • 70+ Data Fields: The most complete profile coverage on Clutch including pricing & business details
  • Comprehensive Business Data: Min project size, hourly rates, employee count, founding year 🆕
  • Enhanced Review Analysis: Project summaries, feedback analysis, detailed ratings breakdown 🆕
  • Client Testimonials: Complete "What Clients Have Said" sections with pricing insights 🆕
  • LinkedIn Company Pages: Smart extraction of real LinkedIn company URLs (not policy links)
  • Business Verification: Complete Business Entity details, Credit Risk Assessment, legal filings
  • Social Media Links: LinkedIn, Facebook, X (Twitter), Instagram from official social sections
  • Intelligent Deduplication: Removes duplicates across list pages and runs
  • Real Domain Extraction: Optional redirect following for accurate domains
  • KV Profile Cache: Reuse parsed detail pages across runs to save time and CUs
  • Apify Proxy Support: Use residential/datacenter groups and optional country pinning
  • Review Sorting: relevance, recent, ratingHighToLow, ratingLowToHigh
  • Robust Error Handling: Retries, timeouts, and safe fallbacks
  • Detailed Statistics: Duplicates, errors, speed, runtime summary
  • Multiple Export Formats: CSV, Excel, JSON, XML with clean, flat columns
  • Blazing Fast: 90+ companies/minute in LIST_DETAIL in real-world runs

📊 Data Fields Extracted

Company Information

  • Company name, logo, tagline
  • Clutch URL and website URL
  • Real domain (with redirect following option)
  • Rating and reviews count
  • Verification status (VERIFIED, PREMIER VERIFIED)
  • Founded year
  • Timezone
  • Full description

Business Details 🆕

  • Min project size: Minimum project budget requirement ($USD)
  • Hourly rate range: From/to hourly rates ($USD/hr)
  • Employees count: Team size range (e.g., "50-249", "2-9")
  • Year founded: Company establishment year
  • Most common project size: Typical project budget based on reviews
  • Client testimonials: "What Clients Have Said" summary with pricing insights
  • Multiple locations: Complete address information with country/region

Service Breakdown

  • Services with percentages
  • Focus areas with percentages
  • Industries served with percentages
  • Client focus (size/type) with percentages

Social Media & Contact

  • LinkedIn Company Pages: Smart URL extraction (e.g., /company/acme-corp/)
  • Facebook Pages: Official business pages
  • X (Twitter): Company Twitter handles
  • Instagram: Business Instagram accounts
  • Real website domains (with redirect resolution)

Verification & Trust

  • Business Entity: Legal name, status, jurisdiction, formation date, entity ID
  • Credit Report Results: International Credit Risk Assessment, source, last updated
  • Verification Status: VERIFIED, PREMIER VERIFIED, GOLD VERIFIED badges
  • Legal Filings: Bankruptcy, tax liens, judgments, collections count
  • Last Updated: When verification data was last refreshed

Reviews (Optional)

  • Review title and rating
  • Review date and comments
  • Reviewer details
  • Project services, size, and length
  • Quality, schedule, cost ratings
  • Willingness to refer rating

Additional Data

  • Multiple office addresses
  • Portfolio items
  • Custom fields via extension function

🎯 Scraping Modes

ModeDescriptionBest For
LISTExtract company data from listing pages onlyQuick overview, basic info
LIST_DETAILExtract from listings + full detail pagesComplete profiles with all data
LIST_WEBSITESOptimized mode - details only for missing websitesFast extraction with websites

🔧 Input Configuration

{
"startUrls": [
{ "url": "https://clutch.co/us/agencies/digital-strategy" }
],
"scrapeMode": "LIST_DETAIL",
"maxResults": 300,
"includeReviews": true,
"maxReviewsPerCompany": 5,
"includePortfolio": true,
"exportFormat": "CSV",
"followRedirects": true,
"clearDataset": true
}

Input Parameters

ParameterTypeDefaultDescription
startUrlsArrayRequiredClutch.co URLs to scrape (list or profile pages)
scrapeModeStringLIST_DETAILScraping mode: LIST, LIST_DETAIL, LIST_WEBSITES
includeReviewsBooleanfalseExtract company reviews
maxReviewsPerCompanyNumber10Maximum reviews per company
reviewSortByStringrelevanceSort reviews: relevance, recent, ratingHighToLow, ratingLowToHigh
includePortfolioBooleantrueExtract portfolio items
maxResultsNumber-Maximum total companies to extract
maxPagesPerSearchNumber-Maximum pages per search URL
maxRequestsPerCrawlNumber0Maximum total requests (0 = unlimited)
exportFormatStringJSONExport format: JSON, CSV, EXCEL, XML
followRedirectsBooleanfalseFollow redirects to get real domains
useApifyProxyBooleanfalseUse Apify Proxy
apifyProxyGroupsArray["RESIDENTIAL"]Proxy groups to use
apifyProxyCountryString-Proxy country code (e.g., US)
useHeadlessBooleantrueRun browser in headless mode
clearDatasetBooleanfalseClear dataset before run
extendOutputFunctionString-Custom JS function to extend output
useProfileCacheBooleanfalseUse KV cache for detail pages
profileCacheTtlDaysNumber0Cache TTL in days (0 = never expire)
profileCacheVersionString""Bump to invalidate old cache

🚀 Performance

  • In a recent LIST_DETAIL benchmark on https://clutch.co/us/agencies/digital-strategy with reviews enabled (max 5 per company), the scraper achieved:
    • Average speed: ~90 companies/minute
    • Total unique companies: 9,750
    • Runtime: ~108 minutes
    • Settings: scrapeMode=LIST_DETAIL, includeReviews=true, reviewSortBy=recent, maxReviewsPerCompany=5, exportFormat=CSV

Actual speed depends on filters, network, anti-bot pressure, and selected options (reviews/portfolio/proxy).

📈 Performance & Statistics

After each run, you'll see detailed statistics:

================================================================================
SCRAPING COMPLETED SUCCESSFULLY
================================================================================
📊 SUMMARY STATISTICS:
• Total unique companies collected: 275
• Total companies processed (with duplicates): 436
• Duplicates removed: 161
• Runtime: 3m 51s
• Average speed: 71 companies/minute
🔍 SCRAPING DETAILS:
• Scrape mode: LIST_DETAIL
• Reviews included: Yes (max 3 per company)
• Export format: CSV
• Exported file: clutch-export-2024-01-15.csv
NO ERRORS DETECTED
🎯 PERFORMANCE METRICS:
• Average time per company: 842ms
• Total runtime: 231614ms
================================================================================

💾 Output Formats

CSV Export

  • Perfectly formatted with all fields as columns
  • Special handling for arrays (services, industries, etc.)
  • HTML entities decoded and cleaned
  • Review and portfolio data in indexed columns

Output Schema (CSV/JSON)

Below is the complete list of fields the scraper can output and how they appear in CSV (flattened) vs JSON (nested):

  • Top-level fields (LIST and LIST_DETAIL)

    • companyName (string)
    • clutchUrl (string)
    • websiteUrl (string)
    • domain (string, without www.)
    • logoUrl (string)
    • tagline (string)
    • rating (number)
    • reviewsCount (number)
    • verificationStatus (string: VERIFIED | PREMIER VERIFIED | GOLD VERIFIED | empty)
    • minProjectSize (number, e.g., 25000) 🆕
    • hourlyRateFrom (number) 🆕
    • hourlyRateTo (number) 🆕
    • employees (string, e.g., "50-249", "2-9") 🆕
    • yearFounded (number, e.g., 2015) 🆕
    • location (string)
  • Social Media Links 🆕

    • JSON: socialLinks: { linkedIn, facebook, x, instagram }
    • CSV: socialLinks_linkedIn, socialLinks_facebook, socialLinks_x, socialLinks_instagram
  • Services (list page chips)

    • JSON: services: [{ name: string, percent|percentage: number }]
    • CSV: services_{Service_Name} = numeric percentage
  • Focus Areas (detail)

    • JSON: focusAreas: [{ name: string, percentage: number }]
    • CSV: focusAreas_{Area_Name} = numeric percentage
  • Industries (detail)

    • JSON: industries: [{ name: string, percentage: number }]
    • CSV: industries_{Industry_Name} = numeric percentage
  • Client Focus (detail)

    • JSON: clientFocus: [{ name: string, percentage: number }]
    • CSV: clientFocus_{Client_Type} = numeric percentage
  • Most Common Project Size (detail)

    • JSON: mostCommonProjectSize (number)
    • CSV: mostCommonProjectSize
  • Client Testimonials (detail) 🆕

    • JSON: clientTestimonial (string, "What Clients Have Said" section)
    • CSV: clientTestimonial
  • Timezone (detail)

    • JSON: timezone (string)
    • CSV: timezone
  • Full Description (detail)

    • JSON: fullDescription (string)
    • CSV: fullDescription
  • Addresses (detail)

    • JSON: addresses: [{ title, street, locality, region, postalCode, country, phone }]
    • CSV: addresses_0_title, addresses_0_street, addresses_0_locality, addresses_0_region, addresses_0_postalCode, addresses_0_country, addresses_0_phone, ... (indexed per address)
  • Portfolio (detail, optional)

    • JSON: portfolio: [{ image, description }]
    • CSV: portfolio_0_image, portfolio_0_description, portfolio_1_image, ...
  • Reviews (detail, optional) 🆕 Enhanced

    • JSON per review item:
      • name (string)
      • rating (number)
      • date (string)
      • comments (string; main review quote)
      • projectSummary (string; detailed project description) 🆕
      • feedbackSummary (string; results and outcomes summary) 🆕
      • detailedRatings (array; Quality, Schedule, Cost ratings with comments) 🆕
      • fullReview (string; complete review text) 🆕
      • reviewerName (string)
      • reviewerPosition (string)
      • projectServices (string; semicolon-separated)
      • projectSize (string)
      • projectLength (string)
    • CSV (indexed and flattened):
      • review_0_name, review_0_rating, review_0_date, review_0_comments, review_0_projectSummary, review_0_feedbackSummary, review_0_detailedRatings_0_category, review_0_detailedRatings_0_rating, review_0_detailedRatings_0_comment, review_0_fullReview, review_0_reviewerName, review_0_reviewerPosition, review_0_projectServices, review_0_projectSize, review_0_projectLength, ...
  • Verification Details (detail) 🆕

    • JSON: verificationDetails: { verificationStatus, businessEntity: { name, status, jurisdictionOfFormation, ID, source, lastUpdated, dateOfFormation }, paymentLegalFilings: { bankruptcy, taxLienFilings, judgementFilings, collectionsCount, source, lastUpdated, fullBusinessCreditReport } }
    • CSV (flattened):
      • verificationDetails_verificationStatus
      • verificationDetails_businessEntity_name, verificationDetails_businessEntity_status, verificationDetails_businessEntity_jurisdictionOfFormation, verificationDetails_businessEntity_ID, verificationDetails_businessEntity_source, verificationDetails_businessEntity_lastUpdated, verificationDetails_businessEntity_dateOfFormation
      • verificationDetails_paymentLegalFilings_bankruptcy, verificationDetails_paymentLegalFilings_taxLienFilings, verificationDetails_paymentLegalFilings_judgementFilings, verificationDetails_paymentLegalFilings_collectionsCount, verificationDetails_paymentLegalFilings_source, verificationDetails_paymentLegalFilings_lastUpdated, verificationDetails_paymentLegalFilings_fullBusinessCreditReport
  • Credit Report Results (detail) 🆕

    • JSON: creditReportResults: { internationalCreditRiskAssessment, source, lastUpdated }
    • CSV: creditReportResults_internationalCreditRiskAssessment, creditReportResults_source, creditReportResults_lastUpdated
  • Verification Last Updated 🆕

    • JSON/CSV: verificationLastUpdated (string, e.g., "August 8, 2025")

Notes:

  • 🆕 Social media extraction from official "Connect with [Company] on Social" sections
  • 🆕 Business verification data including legal entity details and credit assessments
  • CSV экспорт автоматически декодирует HTML-сущности и чистит лишние пробелы
  • Списки (reviews, portfolio, addresses) индексируются: _0_, _1_, ...
  • Процентные поля сохраняются числом; фильтрация по ним в таблицах проста

JSON Output

Complete nested structure with all relationships preserved.

🛠️ Advanced Usage

Extract Companies with Specific Filters

{
"startUrls": [
{ "url": "https://clutch.co/agencies/digital-marketing?employees=10%20-%2049" }
],
"scrapeMode": "LIST_DETAIL",
"includeReviews": true,
"followRedirects": true,
"maxResults": 100
}

Custom Data Extension

{
"extendOutputFunction": "($, data) => ({
customScore: data.rating * data.reviewsCount,
scrapedAt: new Date().toISOString(),
hasPortfolio: data.portfolio?.length > 0
})"
}

🐛 Error Handling

The scraper includes robust error handling:

  • Continues on individual page failures
  • Retries failed requests up to 3 times
  • Handles frame detachment errors
  • Manages navigation timeouts
  • Provides detailed error statistics

🔄 Deduplication

Intelligent deduplication ensures you only get unique companies:

  • Tracks companies globally across all pages
  • Removes duplicates automatically
  • Reports exact number of duplicates removed
  • Maintains data quality and accuracy

🤝 Support

📄 License

ISC License - feel free to use in your projects!


Note: This scraper is not affiliated with Clutch.co. Please respect their Terms of Service and use responsibly.