Pricing

Pay per usage

Go to Store

Scrape And Bypass Any Url Using Scrappey

Try for free

Developed by

Pim

A template for scraping data from web pages using the Scrappey.com API service integrated with an Apify Actor. This actor provides a robust solution for handling complex web scraping scenarios, including sites with anti-bot protection such as Cloudflare, Datadome, PerimeterX and all other forms.

5.0 (2)

Pricing

Pay per usage

Total users

Monthly users

Runs succeeded

>99%

Issues response

20 days

Last modified

3 months ago

Automation

Open source

Apify Scrappey Actor

A powerful web scraping solution that combines Apify's actor infrastructure with Scrappey's advanced anti-detection capabilities. This actor helps you scrape any website while bypassing common anti-bot protections like Cloudflare, Datadome, and PerimeterX.

Scrappey API Integration

🚀 Key Features

Advanced Protection Bypass - Handles Cloudflare, Datadome, PerimeterX, and other anti-bot systems
Session Management - Maintains persistent browser sessions for efficient scraping
Smart Proxy Rotation - Automatic proxy management with country-specific options
Browser Fingerprint Randomization - Prevents detection through browser fingerprinting
Comprehensive Data Extraction - Captures HTML, cookies, headers, and more
Error Handling - Robust error handling with detailed error codes and messages

📋 Input Options

{
    "scrappeyApiKey": "your-api-key",
    "url": "https://example.com",
    "requestType": "browser",  // "browser" or "request"
    "customHeaders": {},       // Custom HTTP headers
    "browserActions": [],      // Automated browser actions
    "session": null,          // Session ID for persistent browsing
    "proxyCountry": null,     // Specific country for proxy
    "cookiejar": null,        // Pre-set cookies
    "includeImages": false,   // Include image URLs in response
    "includeLinks": false     // Include link URLs in response
}

📦 Output Data Structure

The actor stores the following data in the Apify dataset:

{
    "url": "scraped-url",
    "verified": true/false,           // Request verification status
    "cookieString": "cookie-string",  // Formatted cookie string
    "responseHeaders": {},            // Response HTTP headers
    "requestHeaders": {},             // Request HTTP headers
    "html": "page-html",             // Raw HTML content
    "innerText": "page-text",        // Page text content
    "cookies": [],                    // Array of cookies
    "ipInfo": {},                    // IP information
    "status": 200,                   // HTTP status code
    "timeElapsed": "1.2s",          // Request duration
    "session": "session-id",         // Session identifier
    "localStorage": {},              // Browser localStorage data
    "timestamp": "ISO-date"         // Timestamp of scrape
}

🛠️ Common Use Cases

E-commerce Scraping
- Product details from protected stores
- Price monitoring
- Inventory tracking
Login-Protected Content
- Session management for authenticated scraping
- Cookie handling for maintaining login state
Anti-Bot Protected Sites
- Cloudflare challenge bypass
- Datadome protection handling
- PerimeterX mitigation

💡 Usage Examples

Basic Scraping

{
    "scrappeyApiKey": "your-api-key",
    "url": "https://example.com",
    "requestType": "browser"
}

Session-Based Scraping

{
    "scrappeyApiKey": "your-api-key",
    "url": "https://example.com",
    "requestType": "browser",
    "session": "my-session-id",
    "cookiejar": [
        {
            "name": "sessionId",
            "value": "abc123",
            "domain": "example.com",
            "path": "/"
        }
    ]
}

Geo-Targeted Scraping

{
    "scrappeyApiKey": "your-api-key",
    "url": "https://example.com",
    "proxyCountry": "UnitedStates",
    "includeImages": true,
    "includeLinks": true
}

⚠️ Error Handling

The actor handles common error scenarios:

Code	Description	Solution
CODE-0001	Server overload	Retry with backoff
CODE-0002	Cloudflare blocked	Try different proxy
CODE-0010	Datadome blocked	Change proxy country
CODE-0029	Too many sessions	Wait for session cleanup

🚦 Best Practices

Session Management
- Use persistent sessions for related requests
- Clean up sessions when done using sessions.destroy
Proxy Usage
- Rotate proxies for high-volume scraping
- Use country-specific proxies for geo-restricted content
Error Handling
- Implement exponential backoff for retries
- Monitor error rates by URL

📚 Getting Started

Setup

git clone https://github.com/yourusername/apify-scrappey
cd apify-scrappey
npm install

Configuration
- Get your Scrappey API key from scrappey.com
- Set up your input.json in the Apify console or locally
Running Locally
```
$apify run
```

Deployment

apify login
apify push

🔗 Resources

🆘 Support

Technical issues: Create GitHub Issue
Scrappey API: Scrappey Support
Apify Platform: Apify Discord

📄 License

ISC License - Feel free to use this actor for your scraping needs!

On this page

Apify Scrappey Actor

Share Actor:

🛡️⚡ Cloudflare Scraper - Bypass All Captchas

neatrat/cloudflare-scraper

Updated June 2025, No proxies needed! A powerful web scraper that bypasses Cloudflare protection.

Neatrat

5.0

Scraper Api

zfcsoftware/scraper-api

This api allows you to scrape sites such as websites with rate limits, websites with protection such as Cloudflare. It is cheap and fast. It uses trusted proxies and works for most sites. The ip address used is not used again, a reliable ip is used for each request.

ZFC YAZILIM

260

Cloudflare Web Scraper

ecomscrape/cloudflare-web-scraper

Cloudflare Web Scraper extracts data from Cloudflare-protected websites. You can customize parameters such as proxies, timeouts, and JavaScript execution, making it ideal for reports, spreadsheets, and applications.

ecomscrape

131

5.0

Cloudflare Web Scraper

dtrungtin/cloudflare-web-scraper

Prevents Puppeteer from being detected as a bot in services like Cloudflare and allows you to pass captchas without any problems

Tin

128

DataDome Web Scraper

ecomscrape/datadome-web-scraper

DataDome Web Scraper extracts data from DataDome-protected websites. You can customize parameters such as proxies, timeouts, and JavaScript execution, making it ideal for reports, spreadsheets, and applications.

ecomscrape

Stealth Scraper

lolio9/stealth-scraper

A stealthy, headless browser-based scraper that mimics human behavior to avoid detection. Automatically saves every visited HTML page and downloadable file, incrementally archiving progress. Perfect for large websites, internal networks, or compliance-sensitive environments.

Marcus

Opensea Collection Activity Scraper

argusapi/opensea-collection-activity-scraper

A super advanced Opensea collection activity/events scraper capable of fetching 1000 events in under 5 seconds using $0, with pagination and more!

ArgusAPI

1.0

Anti Captcha Geetest

emastra/anti-captcha-geetest

Solve Geetest captchas with this actor using anti-captcha.com service.

Emiliano Mastragostino

160

Subdomain Finder & Reverse IP

canadesk/subdomain-finder-reverse-ip

Enumerate Subdomains and Reverse IPs with RapidDNS, Anubis, AlienVault and crt.sh! It's fast and costs little.

Canadesk Support

112

Australian Business Scraper

proscraper/australian-business-scraper

Scrapes Australian businesses from any city, zip code, keywords and categories. Input locations and keywords, and the scraper will create all possible combinations and start the scrape. Flexible and affordable payment by 1000 results, so you only pay for the data you get.