# Built In Job Scraper (`nomad-dev/builtin-scraper`) Actor

Collect tech and startup job listings from Built In's public category pages as clean structured data. Filter by category, cap output, and optionally include full job descriptions. No login or API key required.

- **URL**: https://apify.com/nomad-dev/builtin-scraper.md
- **Developed by:** [Nomad.Dev](https://apify.com/nomad-dev) (community)
- **Categories:** Jobs
- **Stats:** 2 total users, 0 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Built In Job Scraper

Collect tech and startup job listings from [Built In](https://builtin.com) as
clean, structured data — **no login, no API key, no setup**.

Monitor hundreds of current openings across engineering, data science, DevOps,
design, and more. Export to JSON/CSV/Excel or pull via the Apify API.

### Why use it

- **Always current** — reflects live listings from the Built In platform.
- **Tech and startup focused** — curated for software engineering, data, and
  product roles at growth-stage companies.
- **Remote-ready defaults** — the default category set targets remote-eligible
  listings so you get globally-accessible roles without extra configuration.
- **Rich output** — optional full job descriptions give your pipeline everything
  it needs for AI matching, summarisation, or filtering.

### What you get

One dataset item per listing:

| Field | Description |
|---|---|
| `jobId` | Stable numeric listing id |
| `title` | Job title |
| `company` | Hiring company name |
| `location` | Location / remote badge as shown on the listing card |
| `salary` | Salary range when displayed (e.g. "130K–160K Annually") |
| `postedAt` | Relative posted time as shown (e.g. "3 Days Ago") |
| `url` | Direct link to the full listing |
| `snippet` | Card-level preview or full description (see `includeDescription`) |
| `source` | Always `builtin` |

### Input

| Field | Type | Default | Notes |
|---|---|---|---|
| `categoryPages` | array | *(curated set)* | Category page paths or full URLs to scrape. Leave empty for defaults. |
| `maxItems` | integer | `50` | Cap on total results; `0` = no limit. |
| `maxPagesPerCategory` | integer | `1` | Pagination depth per category (1–5). |
| `includeDescription` | boolean | `true` | Fetch full descriptions from detail pages. Richer output, more requests. |

#### Default categories

When `categoryPages` is empty the scraper targets these remote-friendly paths:

- `/jobs/remote/dev-engineering/front-end`
- `/jobs/remote/dev-engineering/javascript`
- `/jobs/remote/data-analytics/machine-learning`
- `/jobs/remote/dev-engineering/devops`
- `/jobs/remote/data-analytics/data-engineering`

You can override these with any Built In category path or full URL.

#### Example input

```json
{
  "categoryPages": [
    "/jobs/remote/dev-engineering/front-end",
    "/jobs/remote/data-analytics/data-engineering"
  ],
  "maxItems": 100,
  "maxPagesPerCategory": 2,
  "includeDescription": true
}
````

Returns up to 100 remote front-end and data engineering listings, each with
the full description text, pulling up to 2 pages per category.

### Use cases

- Job seekers tracking new tech openings across multiple categories in one run.
- Recruitment and talent teams sourcing candidates and benchmarking roles.
- Job-board aggregators and newsletters curating startup and tech listings.
- Researchers and analysts studying compensation, skills, and hiring trends.

# Actor input Schema

## `categoryPages` (type: `array`):

List of Built In category page paths or full URLs to scrape (e.g. "/jobs/remote/dev-engineering/front-end"). Leave empty to use a curated set of remote-friendly tech categories.

## `maxItems` (type: `integer`):

Maximum number of listings to return across all category pages. Set to 0 for no limit.

## `maxPagesPerCategory` (type: `integer`):

How many pagination pages to fetch per category URL. Page 1 is always fetched; subsequent pages use the ?page=N parameter. Set to 1 to fetch only the first page of each category.

## `includeDescription` (type: `boolean`):

Fetch the individual job detail page to retrieve the full listing description. Produces richer output but requires one additional request per result. Turn off for faster, lighter runs.

## `cacheTtlSeconds` (type: `integer`):

Cache the upstream fetch in the key-value store for this many seconds; re-runs within the window skip the network call. Set 0 to disable.

## Actor input object example

```json
{
  "categoryPages": [
    "/jobs/remote/dev-engineering/front-end",
    "/jobs/remote/data-analytics/machine-learning"
  ],
  "maxItems": 50,
  "maxPagesPerCategory": 1,
  "includeDescription": true,
  "cacheTtlSeconds": 1800
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("nomad-dev/builtin-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("nomad-dev/builtin-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call nomad-dev/builtin-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=nomad-dev/builtin-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Built In Job Scraper",
        "description": "Collect tech and startup job listings from Built In's public category pages as clean structured data. Filter by category, cap output, and optionally include full job descriptions. No login or API key required.",
        "version": "0.1",
        "x-build-id": "Hn89ga4kh0yYpnEhg"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/nomad-dev~builtin-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-nomad-dev-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/nomad-dev~builtin-scraper/runs": {
            "post": {
                "operationId": "runs-sync-nomad-dev-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/nomad-dev~builtin-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-nomad-dev-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "categoryPages": {
                        "title": "Category page paths",
                        "type": "array",
                        "description": "List of Built In category page paths or full URLs to scrape (e.g. \"/jobs/remote/dev-engineering/front-end\"). Leave empty to use a curated set of remote-friendly tech categories.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Maximum number of listings to return across all category pages. Set to 0 for no limit.",
                        "default": 50
                    },
                    "maxPagesPerCategory": {
                        "title": "Max pages per category",
                        "minimum": 1,
                        "maximum": 5,
                        "type": "integer",
                        "description": "How many pagination pages to fetch per category URL. Page 1 is always fetched; subsequent pages use the ?page=N parameter. Set to 1 to fetch only the first page of each category.",
                        "default": 1
                    },
                    "includeDescription": {
                        "title": "Include full description",
                        "type": "boolean",
                        "description": "Fetch the individual job detail page to retrieve the full listing description. Produces richer output but requires one additional request per result. Turn off for faster, lighter runs.",
                        "default": true
                    },
                    "cacheTtlSeconds": {
                        "title": "Cache TTL (seconds)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Cache the upstream fetch in the key-value store for this many seconds; re-runs within the window skip the network call. Set 0 to disable.",
                        "default": 1800
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
