# Redbubble Scraper (AI POWERED) (`ac_devth/redbubble-intelligent-crawler`) Actor

Automatically research any Redbubble niche with AI. Enter keywords, pick product types, and get back structured JSON analysis — saturation levels, bestseller signals, top design themes, and market gaps — powered by a stealth Firefox browser and LLM.

- **URL**: https://apify.com/ac\_devth/redbubble-intelligent-crawler.md
- **Developed by:** [A- Coding](https://apify.com/ac_devth) (community)
- **Categories:** AI, Automation, E-commerce
- **Stats:** 3 total users, 2 monthly users, 40.0% runs succeeded, 1 bookmarks
- **User rating**: 4.00 out of 5 stars

## Pricing

from $50.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Redbubble Market Intelligence Crawler

Automatically research any Redbubble niche with AI. Enter keywords, pick product types, and get back structured JSON analysis — saturation levels, bestseller signals, top design themes, and market gaps — powered by a stealth Firefox browser and LLM extraction.

---

### What It Does

For each keyword + product type combination you provide, the actor:

1. Opens Redbubble in a stealth Firefox browser (Camoufox — undetectable by bot filters)
2. Navigates via residential proxies for IP rotation and Cloudflare bypass
3. Runs global block detection — retries automatically if a challenge page is served
4. Extracts page content and sends it to an LLM
5. Returns a flat JSON record per job with your custom analysis

---

### Input Fields

#### Keywords *(required unless using Raw URLs)*
The niches or topics you want to research. Each keyword is paired with every product type you select, creating one crawl job per combination.

````

electrician
dark academia
nurse life
cottagecore

````

#### Product Types
Which Redbubble categories to search. Defaults to **All Products** for a broad view. Select multiple to compare saturation across categories for the same keyword.

Available options: All Products, T-Shirts, Stickers, Hoodies & Sweatshirts, Phone Cases, Art Prints, Posters, Mugs, Tote Bags, Throw Pillows, Laptop Skins, Notebooks & Journals, Tapestries, Masks.

#### Sort Order
Controls how Redbubble ranks results before the LLM sees them.

| Option | Best for |
|---|---|
| **Top Selling** *(default)* | Market research — shows proven bestsellers |
| Most Relevant | General search intent |
| Newest | Spotting emerging trends early |
| Price: Low / High | Pricing research |

#### LLM Extraction Instruction
The prompt sent to the AI for each page. Write it however you need — the actor adapts to any output shape. Examples:

- *"List every product title visible on this page as a JSON array"*
- *"Extract the price range and number of reviews for each item"*
- *"Identify any recurring color palettes or art styles in the top 20 results"*

The default instruction extracts: total results, saturation level, sales signals in top 10, recurring design themes, niche assessment, and demand/oversupply indicators.

#### Include Mature Content
Toggle adult content inclusion. Off by default.

#### Use Apify Proxy
Routes requests through Apify residential proxies. **Strongly recommended** — Redbubble uses Cloudflare and will block datacenter IPs. Enabled by default.

#### Proxy Groups
Which Apify proxy pool to use. Default: `RESIDENTIAL`.

#### Raw URLs *(advanced)*
Skip the keyword builder entirely and crawl specific URLs directly. Useful when you've already built a Redbubble URL with custom filters, or when crawling a non-Redbubble site.

---

### Example Inputs

**Simple — one keyword, one product type:**
```json
{
  "keywords": ["electrician"],
  "product_types": ["t-shirts"],
  "sort_order": "top selling"
}
````

**Batch research — multiple niches across multiple product types:**

```json
{
  "keywords": ["electrician", "nurse life", "dark academia"],
  "product_types": ["t-shirts", "stickers", "mugs"],
  "sort_order": "top selling"
}
```

This creates 9 crawl jobs (3 keywords × 3 product types).

**Custom extraction instruction:**

```json
{
  "keywords": ["cottagecore"],
  "product_types": ["all-departments"],
  "sort_order": "top selling",
  "instruction": "List the top 10 product titles visible, their approximate price, and any visible review counts. Return as a JSON object."
}
```

**Advanced — raw URLs:**

```json
{
  "urls": [
    "https://www.redbubble.com/shop?query=electrician&sortOrder=top+selling&iaCode=t-shirts",
    "https://www.redbubble.com/shop?query=nurse+life&sortOrder=recent&iaCode=stickers"
  ]
}
```

***

### Dyanmic Output Format (Example)

Each completed job produces one flat JSON record in the dataset. The fields returned by the LLM vary based on your instruction — the four metadata fields are always present:

```json
{
  "keyword": "electrician",
  "product_type": "t-shirts",
  "sort_order": "top selling",
  "url": "https://www.redbubble.com/shop?query=electrician&...",

  "total_results": "1000+",
  "saturation_level": "high",
  "sales_signals_in_top_10": 7,
  "recurring_design_themes": [
    "Funny electrician puns",
    "Vintage lightning bolt graphics",
    "Minimalist wiring diagrams"
  ],
  "niche_assessment": "opportunity",
  "demand_oversupply_indicators": {
    "strong_demand_signals": ["Multiple bestseller badges", "Recurring popular themes"],
    "oversupply_indicators": ["High listing volume"]
  }
}
```

The **Overview tab** in Apify Console shows the stable metadata columns (`keyword`, `product_type`, `sort_order`, `saturation_level`, `niche_assessment`, `url`). The **All Fields tab** shows the complete LLM output.

Failed jobs are non-fatal — the run continues and records the error:

```json
{
  "keyword": "...",
  "product_type": "...",
  "sort_order": "...",
  "url": "https://...",
  "error": "reason for failure"
}
```

***

### Performance & Limits

| Setting | Default | Env var to override |
|---|---|---|
| Rate limit | 35 req/min | `RATE_LIMIT_RPM` |
| Concurrency | 3 parallel browsers | `MAX_CONCURRENCY` |
| Retries | 2 (backoff: 2s, 4s) | `MAX_RETRIES` |
| Timeout | 120s per page | `CRAWL_TIMEOUT_SECONDS` |

Typical cost: **~$0.06 per keyword/product combination.** A batch of 50 keywords × 3 product types (150 jobs) runs for roughly $9.

***

### Calling via API

Start a run:

```bash
curl -X POST \
  "https://api.apify.com/v2/acts/YOUR_USERNAME~camoufox-crawler/runs?token=YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "keywords": ["electrician", "nurse life"],
    "product_types": ["t-shirts", "stickers"],
    "sort_order": "top selling"
  }'
```

Fetch results when done:

```bash
curl "https://api.apify.com/v2/acts/YOUR_USERNAME~camoufox-crawler/runs/last/dataset/items?token=YOUR_API_TOKEN"
```

***

### How Block Detection Works

After every page load the actor checks:

- **Page title** — scanned for Cloudflare signals ("Just a moment", "Attention Required", etc.)
- **Body HTML** — scanned for known challenge fingerprints (`cf_chl_opt`, `recaptcha`, `px-captcha`, etc.)

If a block is detected the request is retried with a fresh browser session and proxy rotation. After all retries are exhausted the job is recorded as failed and the run continues.

# Actor input Schema

## `keywords` (type: `array`):

One or more niche or product keywords to research. Each keyword is combined with every selected product type to form one crawl job.

Examples: electrician, cottagecore, dark academia, motivational quotes, nurse life

## `product_types` (type: `array`):

Which Redbubble product categories to search. Each keyword × each product type = one crawl job. Leave as 'All Products' for a broad market overview, or pick specific types to compare saturation across categories.

## `sort_order` (type: `string`):

How to sort Redbubble search results. 'Top Selling' is recommended for market research as it surfaces proven bestsellers. Use 'Newest' to spot emerging trends.

## `include_mature` (type: `boolean`):

Whether to include mature/adult content in results.

## `instruction` (type: `string`):

Tell the AI exactly what to extract and return from each page. The output will always be a JSON object with a 'results' key. You can customise this to extract anything — saturation signals, design themes, pricing, bestseller patterns, gaps, etc.

## `urls` (type: `array`):

Optional. Bypass the keyword builder and crawl these exact URLs directly. Useful for targeting a specific Redbubble page with custom filters already applied, or for crawling non-Redbubble sites entirely.

## Actor input object example

```json
{
  "keywords": [
    "electrician",
    "cottagecore",
    "dark academia"
  ],
  "product_types": [
    "all-departments"
  ],
  "sort_order": "top selling",
  "include_mature": false,
  "instruction": "Analyze this Redbubble search results page. Return ONLY a valid JSON object. Focus on: exact total number of results, saturation level (low/medium/high), sales signals in top 10 results (reviews, favorites, bestseller badges), 3-5 specific top design styles/themes, gap assessment, and clear indicators of demand or oversupply.",
  "urls": [
    "https://www.redbubble.com/shop?query=electrician&sortOrder=top+selling&iaCode=t-shirts",
    "https://www.redbubble.com/shop?query=nurse+life&sortOrder=recent&iaCode=stickers"
  ]
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("ac_devth/redbubble-intelligent-crawler").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("ac_devth/redbubble-intelligent-crawler").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call ac_devth/redbubble-intelligent-crawler --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=ac_devth/redbubble-intelligent-crawler",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Redbubble Scraper (AI POWERED)",
        "description": "Automatically research any Redbubble niche with AI. Enter keywords, pick product types, and get back structured JSON analysis — saturation levels, bestseller signals, top design themes, and market gaps — powered by a stealth Firefox browser and LLM.",
        "version": "0.0",
        "x-build-id": "6Be1glNtVliGpDuBL"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/ac_devth~redbubble-intelligent-crawler/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-ac_devth-redbubble-intelligent-crawler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/ac_devth~redbubble-intelligent-crawler/runs": {
            "post": {
                "operationId": "runs-sync-ac_devth-redbubble-intelligent-crawler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/ac_devth~redbubble-intelligent-crawler/run-sync": {
            "post": {
                "operationId": "run-sync-ac_devth-redbubble-intelligent-crawler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "keywords": {
                        "title": "Search Keywords",
                        "type": "array",
                        "description": "One or more niche or product keywords to research. Each keyword is combined with every selected product type to form one crawl job.\n\nExamples: electrician, cottagecore, dark academia, motivational quotes, nurse life",
                        "items": {
                            "type": "string"
                        }
                    },
                    "product_types": {
                        "title": "Product Types",
                        "type": "array",
                        "description": "Which Redbubble product categories to search. Each keyword × each product type = one crawl job. Leave as 'All Products' for a broad market overview, or pick specific types to compare saturation across categories.",
                        "items": {
                            "type": "string",
                            "enum": [
                                "all-departments",
                                "t-shirts",
                                "stickers",
                                "hoodies-sweatshirts",
                                "phone-cases",
                                "art-prints",
                                "posters",
                                "mugs",
                                "tote-bags",
                                "throw-pillows",
                                "laptop-skins",
                                "notebooks-journals",
                                "tapestries",
                                "masks"
                            ],
                            "enumTitles": [
                                "All Products",
                                "T-Shirts",
                                "Stickers",
                                "Hoodies & Sweatshirts",
                                "Phone Cases",
                                "Art Prints",
                                "Posters",
                                "Mugs",
                                "Tote Bags",
                                "Throw Pillows",
                                "Laptop Skins",
                                "Notebooks & Journals",
                                "Tapestries",
                                "Masks"
                            ]
                        },
                        "default": [
                            "all-departments"
                        ]
                    },
                    "sort_order": {
                        "title": "Sort Order",
                        "enum": [
                            "top selling",
                            "relevant",
                            "recent",
                            "price-asc",
                            "price-desc"
                        ],
                        "type": "string",
                        "description": "How to sort Redbubble search results. 'Top Selling' is recommended for market research as it surfaces proven bestsellers. Use 'Newest' to spot emerging trends.",
                        "default": "top selling"
                    },
                    "include_mature": {
                        "title": "Include Mature Content",
                        "type": "boolean",
                        "description": "Whether to include mature/adult content in results.",
                        "default": false
                    },
                    "instruction": {
                        "title": "LLM Extraction Instruction",
                        "type": "string",
                        "description": "Tell the AI exactly what to extract and return from each page. The output will always be a JSON object with a 'results' key. You can customise this to extract anything — saturation signals, design themes, pricing, bestseller patterns, gaps, etc.",
                        "default": "Analyze this Redbubble search results page. Return ONLY a valid JSON object. Focus on: exact total number of results, saturation level (low/medium/high), sales signals in top 10 results (reviews, favorites, bestseller badges), 3-5 specific top design styles/themes, gap assessment, and clear indicators of demand or oversupply."
                    },
                    "urls": {
                        "title": "Raw URLs (Advanced)",
                        "type": "array",
                        "description": "Optional. Bypass the keyword builder and crawl these exact URLs directly. Useful for targeting a specific Redbubble page with custom filters already applied, or for crawling non-Redbubble sites entirely.",
                        "items": {
                            "type": "string"
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
