# Python Crawlee Page Profiler (`automation-lab/python-crawlee-page-profiler`) Actor

Pilot Python/Crawlee actor that profiles HTML documents and extracts page metadata, headings, links, and text statistics from supplied URLs.

- **URL**: https://apify.com/automation-lab/python-crawlee-page-profiler.md
- **Developed by:** [Stas Persiianenko](https://apify.com/automation-lab) (community)
- **Categories:** Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per event

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Python Crawlee Page Profiler

Profile HTML documents with a bounded Python/Crawlee pilot actor.

### Overview

Python Crawlee Page Profiler visits supplied URLs, downloads each HTML document with Crawlee for Python, parses it with BeautifulSoup, and writes one dataset item per page.

It extracts titles, meta descriptions, heading counts, link counts, approximate word counts, text samples, and optional sampled links.

This actor is intentionally small and non-critical. Its purpose is to validate whether Python/Crawlee should become a selective internal template for future Apify actors.

### Why this pilot exists

Our standard actor factory is TypeScript-based.

Python can be useful for document processing, file analysis, NLP-lite, PDF tooling, and library-heavy utilities.

Before adopting Python more broadly, we need one bounded pilot with normal Apify schemas, Dockerfile, dependencies, README, pricing events, and smoke-test instructions.

### Who is it for?

This actor is for developers, QA teams, content operators, and internal automation builders who need lightweight HTML page profiles.

It is also for maintainers evaluating whether Python/Crawlee can support selective Apify actor templates.

### What this actor is good for

Use it when you need a quick profile of server-rendered HTML pages.

Typical jobs include:

- checking whether pages have titles and meta descriptions
- counting H1 and H2 headings
- estimating visible text length
- sampling links from documentation or content pages
- comparing small groups of public landing pages
- validating Python actor packaging on Apify

### What this actor is not

This is not a full SEO crawler.

It does not render JavaScript.

It does not bypass anti-bot protection.

It does not deep-crawl discovered links.

It is a Python/Crawlee template validation pilot, not a premium production scraper.

### Input

Provide a list of URLs in `startUrls`.

Set `maxPages` to cap how many supplied URLs are processed.

Set `includeLinks` to include or omit sampled normalized links.

The actor accepts both Apify request-list source objects and plain URL strings.

### Input fields

| Field | Type | Default | Description |
| --- | --- | --- | --- |
| `startUrls` | array | required | HTML document URLs to profile. |
| `maxPages` | integer | `5` | Maximum number of supplied URLs to process, capped at 100. |
| `includeLinks` | boolean | `true` | Include up to 25 normalized links from each page. |

### Example input

```json
{
  "startUrls": [
    { "url": "https://crawlee.dev/python/" },
    { "url": "https://docs.apify.com/sdk/python/" }
  ],
  "maxPages": 2,
  "includeLinks": true
}
````

### Output

Each dataset item represents one profiled page.

Fields include the final loaded URL, basic HTML metadata, heading counts, link counts, text statistics, sampled links, and the profiler identifier.

### Output fields

| Field | Type | Description |
| --- | --- | --- |
| `url` | string | Final loaded URL profiled by Crawlee. |
| `title` | string | Text content of the `<title>` element. |
| `metaDescription` | string | Content of the meta description tag, when present. |
| `statusCode` | number/null | HTTP response status code, when available. |
| `h1Count` | number | Number of H1 headings. |
| `h2Count` | number | Number of H2 headings. |
| `linkCount` | number | Number of usable anchors found. |
| `wordCount` | number | Approximate visible text word count. |
| `textSample` | string | First 500 characters of normalized page text. |
| `links` | array | Up to 25 normalized sampled links. |
| `profiler` | string | Implementation identifier. |

### Example output

```json
{
  "url": "https://example.com/",
  "title": "Example Domain",
  "metaDescription": "",
  "statusCode": 200,
  "h1Count": 1,
  "h2Count": 0,
  "linkCount": 1,
  "wordCount": 28,
  "textSample": "Example Domain This domain is for use in illustrative examples...",
  "links": ["https://www.iana.org/domains/example"],
  "profiler": "python-crawlee-beautifulsoup"
}
```

### Pricing and cost

This pilot uses pay-per-event pricing.

| Event | When it is charged | Current BRONZE price |
| --- | --- | --- |
| `start` | Once when the run starts | `$0.005` |
| `page-profiled` | Once per successfully profiled page | `$0.00021819` |

The values are deliberately low because this is a lightweight HTTP utility with no browser and no residential proxy requirement.

Free-plan estimate: a two-page test run costs about `$0.0054` before platform tier differences.

Larger 100-page validation runs are still expected to be low-cost because the actor only downloads HTML and parses it in memory.

### Cost expectations

The actor does not use a browser.

It does not use residential proxies by default.

Expected memory is 512 MB.

Small runs should complete quickly and produce one item per supplied URL.

Real-world cost depends on target page size and network latency.

### Local smoke run

Install dependencies in a virtual environment:

```bash
python3 -m venv .venv
. .venv/bin/activate
pip install -r requirements.txt
```

Run syntax validation:

```bash
python -m compileall src
```

Run the actor locally with Apify CLI:

```bash
timeout 120 apify run
```

A sample input is committed at `storage/key_value_stores/default/INPUT.json`.

### Docker image

The Dockerfile uses the official Apify Python base image:

```dockerfile
FROM apify/actor-python:3.12
```

Dependencies are pinned in `requirements.txt`.

This keeps the pilot reproducible and avoids floating Python/Crawlee versions.

### Python and Crawlee details

The actor uses:

- `apify==3.4.0`
- `crawlee[beautifulsoup]==1.7.1`
- `beautifulsoup4==4.13.4`
- `lxml==5.4.0`

The crawler class is `BeautifulSoupCrawler`.

The handler receives a `BeautifulSoupCrawlingContext` and reads `context.soup`.

### How it works

The actor normalizes input URLs.

It charges the start event.

It creates a bounded BeautifulSoup crawler with `max_requests_per_crawl` set from `maxPages`.

For each fetched HTML page, it extracts metadata and pushes one dataset item.

It then charges `page-profiled` for that successful item.

### JavaScript rendering

The actor does not render JavaScript.

Use it for pages where relevant data exists in the initial HTML response.

If a page is a JavaScript shell, the title and text fields may be sparse.

### Link sampling

When `includeLinks` is enabled, the actor stores up to 25 normalized links per page.

It skips empty links and `javascript:`, `mailto:`, and `tel:` URLs.

The `linkCount` field counts all usable links, not just the sampled subset.

### Integrations and workflows

You can use the dataset in:

- content inventory workflows
- lightweight documentation audits
- HTML migration checks
- QA smoke tests for public pages
- Python actor template validation runs
- dashboards that compare titles, headings, and text sizes

The actor pairs well with downstream data tools that consume Apify datasets as JSON, CSV, or via API.

### API usage

You can run the actor from Apify Console, the Apify API, Apify client libraries, cURL, or MCP.

The examples below use the public actor identifier `automation-lab/python-crawlee-page-profiler`.

### Node.js API example

```javascript
import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });

const run = await client.actor('automation-lab/python-crawlee-page-profiler').call({
  startUrls: [
    { url: 'https://crawlee.dev/python/' },
    { url: 'https://docs.apify.com/sdk/python/' },
  ],
  maxPages: 2,
  includeLinks: true,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);
```

### Python API example

```python
from apify_client import ApifyClient

client = ApifyClient("<APIFY_TOKEN>")
run = client.actor("automation-lab/python-crawlee-page-profiler").call(run_input={
    "startUrls": [{"url": "https://crawlee.dev/python/"}],
    "maxPages": 1,
    "includeLinks": True,
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)
```

### cURL example

```bash
curl -X POST "https://api.apify.com/v2/acts/automation-lab~python-crawlee-page-profiler/runs?token=$APIFY_TOKEN" \
  -H 'Content-Type: application/json' \
  -d '{"startUrls":[{"url":"https://crawlee.dev/python/"}],"maxPages":1,"includeLinks":true}'
```

### MCP usage

Apify MCP can expose this actor as a tool to compatible clients.

Use the current hosted MCP endpoint pattern:

```text
https://mcp.apify.com?tools=automation-lab/python-crawlee-page-profiler
```

Add it from a CLI-based client with:

```bash
claude mcp add apify "https://mcp.apify.com?tools=automation-lab/python-crawlee-page-profiler"
```

Or use a JSON config block:

```json
{
  "mcpServers": {
    "apify": {
      "url": "https://mcp.apify.com?tools=automation-lab/python-crawlee-page-profiler",
      "headers": {
        "Authorization": "Bearer YOUR_APIFY_TOKEN"
      }
    }
  }
}
```

Example prompts after connecting MCP:

- "Profile the Crawlee Python docs and summarize the title, heading counts, and sampled links."
- "Run Python Crawlee Page Profiler on these two documentation URLs with maxPages set to 2."
- "Compare word counts and meta descriptions for two public HTML pages."

### MCP client configuration

For Claude Desktop or Claude Code style clients, add the hosted Apify MCP endpoint with this pattern:

```bash
claude mcp add apify "https://mcp.apify.com?tools=automation-lab/python-crawlee-page-profiler"
```

If your MCP client accepts JSON server configuration, add:

```json
{
  "mcpServers": {
    "apify": {
      "url": "https://mcp.apify.com?tools=automation-lab/python-crawlee-page-profiler",
      "headers": {
        "Authorization": "Bearer YOUR_APIFY_TOKEN"
      }
    }
  }
}
```

### Data quality notes

The actor reports counts from parsed HTML.

Word counts are approximate and based on BeautifulSoup text extraction.

Links are normalized with the page URL as base.

Only the first 25 links are included when `includeLinks` is enabled.

Sites may serve different HTML to different geographies or user agents.

### Limits

The input schema caps `maxPages` at 100.

This keeps the pilot bounded and makes cost/memory behavior easy to evaluate.

If a future Python template is approved, production actors can choose limits appropriate to their target use case.

### Troubleshooting

If the output is empty, check that your input URLs are valid.

If a page needs JavaScript rendering, this HTTP-only pilot will not see rendered content.

If a site blocks datacenter requests, try a simple public URL for the pilot run.

If `links` is empty, confirm `includeLinks` is set to `true` and the page contains anchors with `href` attributes.

### Legality

Only crawl URLs you are allowed to access.

Respect website terms, robots policies, and applicable laws.

This actor is designed for small, transparent metadata profiling.

### Related scrapers and actors

Other `automation-lab` actors that may fit adjacent workflows:

- [Substack Scraper](https://apify.com/automation-lab/substack-scraper) for newsletter and post metadata.
- [LinkedIn Company Scraper](https://apify.com/automation-lab/linkedin-company-scraper) for company profile enrichment.
- [YouTube Channel Email Scraper](https://apify.com/automation-lab/youtube-channel-email-scraper) for creator discovery workflows.

This pilot is intentionally narrower: it profiles supplied HTML pages rather than scraping a specific platform.

### FAQ

#### Does this actor render JavaScript?

No. It uses `BeautifulSoupCrawler` and parses server responses.

#### Does it follow links?

No. It profiles only supplied URLs in this pilot version.

#### Can it process PDFs?

No. This pilot profiles HTML documents. Future Python actors may target PDF or file-analysis libraries.

#### Is this a production Python template?

Not by itself. The pilot recommendation is to promote Python/Crawlee only as a selective template after QA validates build, pricing, memory, schema, and maintainability.

### Template recommendation

Recommendation: promote Python/Crawlee only as a selective pilot template, not as the default actor scaffolding.

Python is promising for library-heavy utilities such as document processing, file analysis, NLP-lite extraction, and data validation.

TypeScript should remain the default for broad web scraping until more Python utility actors validate maintenance, pricing, and Store-readiness at normal standards.

### Publication status

This actor remains unpublished until normal QA and publisher flow approves it.

The pilot can be evaluated technically without being promoted to the public Store.

### Changelog

- 0.1: Initial Python/Crawlee pilot actor.

### Internal QA checklist

- Python syntax validation passes.
- Local Apify run produces dataset items.
- Dataset items match `.actor/dataset_schema.json`.
- Dockerfile uses pinned official Apify Python base image.
- `dev-precheck.mjs` can distinguish Python actors from TypeScript actors.
- The actor remains unpublished until normal QA/publisher flow approves it.

# Actor input Schema

## `startUrls` (type: `array`):

HTML document URLs to profile. Keep test runs small; the pilot intentionally does not deep-crawl by default.

## `maxPages` (type: `integer`):

Maximum number of pages to process from the supplied URLs.

## `includeLinks` (type: `boolean`):

Include up to 25 normalized href values from each page in the output.

## Actor input object example

```json
{
  "startUrls": [
    {
      "url": "https://crawlee.dev/python/"
    }
  ],
  "maxPages": 3,
  "includeLinks": true
}
```

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "startUrls": [
        {
            "url": "https://crawlee.dev/python/"
        }
    ],
    "maxPages": 3,
    "includeLinks": true
};

// Run the Actor and wait for it to finish
const run = await client.actor("automation-lab/python-crawlee-page-profiler").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "startUrls": [{ "url": "https://crawlee.dev/python/" }],
    "maxPages": 3,
    "includeLinks": True,
}

# Run the Actor and wait for it to finish
run = client.actor("automation-lab/python-crawlee-page-profiler").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "startUrls": [
    {
      "url": "https://crawlee.dev/python/"
    }
  ],
  "maxPages": 3,
  "includeLinks": true
}' |
apify call automation-lab/python-crawlee-page-profiler --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=automation-lab/python-crawlee-page-profiler",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Python Crawlee Page Profiler",
        "description": "Pilot Python/Crawlee actor that profiles HTML documents and extracts page metadata, headings, links, and text statistics from supplied URLs.",
        "version": "0.1",
        "x-build-id": "apltunXtkMI2RV9oB"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/automation-lab~python-crawlee-page-profiler/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-automation-lab-python-crawlee-page-profiler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/automation-lab~python-crawlee-page-profiler/runs": {
            "post": {
                "operationId": "runs-sync-automation-lab-python-crawlee-page-profiler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/automation-lab~python-crawlee-page-profiler/run-sync": {
            "post": {
                "operationId": "run-sync-automation-lab-python-crawlee-page-profiler",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "startUrls"
                ],
                "properties": {
                    "startUrls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "HTML document URLs to profile. Keep test runs small; the pilot intentionally does not deep-crawl by default.",
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "maxPages": {
                        "title": "Maximum pages",
                        "minimum": 1,
                        "maximum": 100,
                        "type": "integer",
                        "description": "Maximum number of pages to process from the supplied URLs.",
                        "default": 5
                    },
                    "includeLinks": {
                        "title": "Include sampled links",
                        "type": "boolean",
                        "description": "Include up to 25 normalized href values from each page in the output.",
                        "default": true
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```