# Built In Jobs Scraper — US Tech & Startup Jobs (`nomad-agent/builtin-scraper`) Actor

Extract live US tech vacancies from Built In (builtin.com): software, data, product, design and sales roles across startup hubs. Records include title, company, location, remote flag, salary band and apply URL. A staple source for US tech-hiring data.

- **URL**: https://apify.com/nomad-agent/builtin-scraper.md
- **Developed by:** [Nomad.Dev](https://apify.com/nomad-agent) (community)
- **Categories:** Jobs
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage, which gets cheaper the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-usage

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Built In Jobs Scraper — US Tech & Startup Jobs

Scrape current US tech and startup openings from Built In, including remote flags and salary bands where posted.

### What Built In jobs data does this scraper extract?

Each result is one flat JSON record per job posting:

| Field | Meaning |
|---|---|
| `id` | Stable source-side identifier |
| `jobId` | Same value as `id` — kept for backward compatibility |
| `title` | Job title as posted |
| `company` | Hiring company / organisation (`null` if not shown on the card) |
| `location` | Location / duty station (may include remote hints) |
| `url` | Direct link to the posting |
| `postedAt` | Absolute UTC ISO 8601 timestamp, parsed from the source's relative age badge (e.g. "3 Days Ago"), `null` if the badge is absent or unparseable |
| `salary` | Salary text where the source provides it, otherwise `null` |
| `snippet` | Short description excerpt from the listing card, otherwise `null` |
| `description` | Full job-description text from the detail page (schema.org JobPosting JSON-LD, with a class-matched HTML fallback). Only populated when `includeDescription: true`; `null` otherwise |
| `source` | Always `"builtin"` |

Missing/absent fields are always `null` — never an empty string.

### How to scrape Built In jobs with this Actor

1. Click **Try for free** / **Run** — no login to the target site, no cookies, no proxies to configure.
2. Pick job categories (or leave empty for a curated set of remote-friendly tech categories) and adjust `maxItems`.
3. Run it and export the dataset as JSON, CSV or Excel, or read it over the [API](https://docs.apify.com/api/v2).

Run it from your own code:

```python
from apify_client import ApifyClient

client = ApifyClient("<YOUR_APIFY_TOKEN>")
run = client.actor("nomad-agent/builtin-scraper").call(run_input={"maxItems": 50})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item["title"], "—", item["company"], item["url"])
````

Or a single HTTP call that runs the Actor and returns items in one response:

```bash
curl -X POST \
  "https://api.apify.com/v2/acts/nomad-agent~builtin-scraper/run-sync-get-dataset-items?token=<YOUR_APIFY_TOKEN>" \
  -H "Content-Type: application/json" \
  -d '{"maxItems": 50}'
```

### Input

| Field | Type | Default | Notes |
|---|---|---|---|
| `categories` | array (multi-select) | *(empty)* | Friendly Built In job categories to scrape (Software Engineering, DevOps/SysAdmin, Data & Analytics, Data Science, Product Management, Design, Marketing, Sales, Operations, HR). Leave empty to use a curated set of remote-friendly tech categories. Merges with `categoryPages` if both are set. |
| `maxItems` | integer | `50` | Maximum number of listings to return across all category pages. Set to 0 for no limit. |
| `maxPagesPerCategory` | integer | `1` | How many pagination pages to fetch per category URL. Page 1 is always fetched; subsequent pages use the `?page=N` parameter. |
| `includeDescription` | boolean | `true` | Fetch the individual job detail page and populate `description` with the full JD body. Turn off for faster, lighter runs — `description` stays `null` and `snippet` is unaffected either way. |
| `categoryPages` *(Advanced)* | array | *(empty)* | Legacy/low-level form: raw Built In category page paths or full URLs (e.g. `/jobs/remote/dev-engineering/front-end`). Prefer `categories` for common cases; use this for one-off or niche categories not covered by the enum. Merges with `categories`, doesn't replace it. |
| `cacheTtlSeconds` *(Advanced)* | integer | `1800` | Reuse a page fetched this many seconds ago instead of hitting builtin.com again on rapid re-runs. Set 0 to always fetch live. |

### Output example

```json
{
  "jobId": "2237714",
  "id": "2237714",
  "title": "Senior Data Engineer",
  "company": "Motive",
  "location": "Remote · United States",
  "url": "https://builtin.com/job/senior-data-engineer/2237714",
  "postedAt": "2026-06-30T12:00:00Z",
  "salary": "$150,000–$180,000",
  "snippet": "Motive is hiring a Senior Data Engineer...",
  "description": "Motive is hiring a Senior Data Engineer to build the data platform powering our fleet analytics... (full JD, up to 5000 chars)",
  "source": "builtin"
}
```

`id` and `jobId` always carry the same value — `id` is the standardized alias, `jobId` is kept so existing integrations don't break.

### Integrations

Export the dataset as JSON, CSV or Excel from the Console, pull it over the [Apify API](https://docs.apify.com/api/v2) (including `run-sync-get-dataset-items` for a single blocking call), wire it into Make/Zapier/n8n, or drive it from an AI agent via the [Apify MCP server](https://apify.com/apify/actors-mcp-server).

### Pricing

Pay per event: **$0.05 per Actor start** and **$0.004 per job returned**.
100 jobs ≈ $0.45. No subscription, no rental — you pay only for what you fetch.

### Use cases

- US tech job boards and alert bots
- Startup-hiring market research
- Sourcing pipelines for US tech roles
- Salary benchmarking by city

### FAQ

**Is it legal to scrape Built In jobs?**
This Actor reads only publicly available job postings — data any visitor can see without logging in. No personal data behind authentication is touched. Review the target site's terms and your local regulations for your specific use case.

**Do I need an account on the target site?**
No. Postings are fetched from public pages/APIs — no login, cookies or session tokens.

**How fresh is the data?**
Every run fetches live listings. Results are cached for `cacheTtlSeconds` (default 30 min, set 0 to always hit the source live).

**How many jobs can I get?**
`maxItems` caps the run (set 0 where supported for no cap). Most sources paginate from newest to oldest.

**What's the difference between `categories` and `categoryPages`?**
`categories` is a friendly multi-select of common Built In job categories — pick from a list, no need to know the site's internal URL structure. `categoryPages` (Advanced) is the raw underlying mechanism: literal Built In category page paths or URLs, for categories not covered by the friendly list. Both feed the same fetch and can be combined.

**Something broken or missing?**
Open an issue on the Actor's **Issues** tab — it is monitored and reliability fixes ship fast.

### Related Actors

- [Web Developer Jobs Scraper — 10 Boards in One](https://apify.com/nomad-agent/web-dev-bundle)
- [AI & ML Engineer Jobs Scraper — 8 Boards in One](https://apify.com/nomad-agent/ml-ai-dev-bundle)
- [Y Combinator Jobs Scraper — Work at a Startup](https://apify.com/nomad-agent/ycombinator-was-scraper)
- [LinkedIn Jobs Scraper — No Login, No Cookies](https://apify.com/nomad-agent/linkedin-scraper)

# Actor input Schema

## `categories` (type: `array`):

Friendly Built In job categories to scrape (maps to the site's own category pages under /jobs/remote/...). Leave empty to use a curated set of remote-friendly tech categories. Merges with "Category page paths" (Advanced) if both are set.

## `maxItems` (type: `integer`):

Maximum number of listings to return across all category pages. Set to 0 for no limit.

## `maxPagesPerCategory` (type: `integer`):

How many pagination pages to fetch per category URL. Page 1 is always fetched; subsequent pages use the ?page=N parameter. Set to 1 to fetch only the first page of each category.

## `includeDescription` (type: `boolean`):

Fetch the individual job detail page to retrieve the full listing description. Produces richer output but requires one additional request per result. Turn off for faster, lighter runs.

## `categoryPages` (type: `array`):

Advanced / low-level form of category selection: raw Built In category page paths or full URLs to scrape (e.g. "/jobs/remote/dev-engineering/front-end"). Prefer the friendly "Job categories" field above for common categories; use this for one-off or niche categories not covered there. Values here are merged with (not replaced by) "Job categories". Leave both empty to use a curated set of remote-friendly tech categories.

## `cacheTtlSeconds` (type: `integer`):

Reuse a page fetched this many seconds ago instead of hitting builtin.com again on rapid re-runs. Set 0 to always fetch live.

## Actor input object example

```json
{
  "maxItems": 50,
  "maxPagesPerCategory": 1,
  "includeDescription": true,
  "categoryPages": [
    "/jobs/remote/dev-engineering/front-end",
    "/jobs/remote/data-analytics/machine-learning"
  ],
  "cacheTtlSeconds": 1800
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("nomad-agent/builtin-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("nomad-agent/builtin-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call nomad-agent/builtin-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=nomad-agent/builtin-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Built In Jobs Scraper — US Tech & Startup Jobs",
        "description": "Extract live US tech vacancies from Built In (builtin.com): software, data, product, design and sales roles across startup hubs. Records include title, company, location, remote flag, salary band and apply URL. A staple source for US tech-hiring data.",
        "version": "0.1",
        "x-build-id": "W6xZxSnmx8vF1Ngy6"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/nomad-agent~builtin-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-nomad-agent-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/nomad-agent~builtin-scraper/runs": {
            "post": {
                "operationId": "runs-sync-nomad-agent-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/nomad-agent~builtin-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-nomad-agent-builtin-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "categories": {
                        "title": "Job categories",
                        "type": "array",
                        "description": "Friendly Built In job categories to scrape (maps to the site's own category pages under /jobs/remote/...). Leave empty to use a curated set of remote-friendly tech categories. Merges with \"Category page paths\" (Advanced) if both are set.",
                        "items": {
                            "type": "string",
                            "enum": [
                                "dev-engineering",
                                "dev-engineering/devops",
                                "data-analytics",
                                "data-analytics/data-science",
                                "product",
                                "design-ux",
                                "marketing",
                                "sales",
                                "operations",
                                "hr"
                            ],
                            "enumTitles": [
                                "Software Engineering (Dev + Engineering)",
                                "DevOps / SysAdmin",
                                "Data & Analytics",
                                "Data Science",
                                "Product Management",
                                "Design (UX/UI)",
                                "Marketing",
                                "Sales",
                                "Operations",
                                "HR / People"
                            ]
                        }
                    },
                    "maxItems": {
                        "title": "Max items",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Maximum number of listings to return across all category pages. Set to 0 for no limit.",
                        "default": 50
                    },
                    "maxPagesPerCategory": {
                        "title": "Max pages per category",
                        "minimum": 1,
                        "maximum": 5,
                        "type": "integer",
                        "description": "How many pagination pages to fetch per category URL. Page 1 is always fetched; subsequent pages use the ?page=N parameter. Set to 1 to fetch only the first page of each category.",
                        "default": 1
                    },
                    "includeDescription": {
                        "title": "Include full description",
                        "type": "boolean",
                        "description": "Fetch the individual job detail page to retrieve the full listing description. Produces richer output but requires one additional request per result. Turn off for faster, lighter runs.",
                        "default": true
                    },
                    "categoryPages": {
                        "title": "Category page paths (legacy)",
                        "type": "array",
                        "description": "Advanced / low-level form of category selection: raw Built In category page paths or full URLs to scrape (e.g. \"/jobs/remote/dev-engineering/front-end\"). Prefer the friendly \"Job categories\" field above for common categories; use this for one-off or niche categories not covered there. Values here are merged with (not replaced by) \"Job categories\". Leave both empty to use a curated set of remote-friendly tech categories.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "cacheTtlSeconds": {
                        "title": "Cache TTL (seconds)",
                        "minimum": 0,
                        "type": "integer",
                        "description": "Reuse a page fetched this many seconds ago instead of hitting builtin.com again on rapid re-runs. Set 0 to always fetch live.",
                        "default": 1800
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
