# SAM.gov Expiring Registration Monitor (`lead.gen.labs/sam-gov-expiring-registration-monitor`) Actor

Build targeted government supplier intelligence datasets from open.gsa.gov. Extract company, UEI, registration expiration, NAICS, location for outreach, research, and opportunity discovery.

- **URL**: https://apify.com/lead.gen.labs/sam-gov-expiring-registration-monitor.md
- **Developed by:** [LeadGen Labs](https://apify.com/lead.gen.labs) (community)
- **Categories:** Lead generation, Automation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $19.00 / 1,000 record scrapeds

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

### What does SAM.gov Expiring Registration Monitor do?
**SAM.gov Expiring Registration Monitor** helps you build actionable lead datasets from public sources. Build targeted government supplier intelligence datasets from open.gsa.gov. Extract company, UEI, registration expiration, NAICS, location for outreach, research, and opportunity discovery.
Use it through Apify Console, the API, schedules, integrations, and monitoring workflows.

Default public sources:
- [https://open.gsa.gov/api/entity-api/](https://open.gsa.gov/api/entity-api/)

### Why use SAM.gov Expiring Registration Monitor?
This Actor is designed for Compliance consultants; proposal firms; government sales agencies. It turns public source pages into normalized
records that are easier to export, compare, and enrich. The generated first pass includes bounded
same-domain discovery, retry logic, optional proxy support, public email extraction, phone-number
extraction, and Apify pay-per-event charging.

### How to use SAM.gov Expiring Registration Monitor
1. Open the **Input** tab.
2. Review the default start URLs and replace or extend them when needed.
3. Set a small profile limit for your first run.
4. Start the Actor and inspect the dataset.
5. Increase the limits after confirming the source pages you want to collect.

### Input
| Field | Description |
| --- | --- |
| `start_urls` | Public source URLs where the crawl begins. |
| `max_pages` | Maximum fetched pages. |
| `max_profiles` | Maximum normalized saved profiles. |
| `max_depth` | Same-domain link-discovery depth. |
| `extract_emails` | Enables distinct public email extraction. |
| `request_delay_millis` | Delay between requests. |
| `custom_proxy_url` | Optional custom proxy URL. |

### Output
```json
{
  "title": "Example public profile",
  "emails": ["hello@example.com"],
  "phone_numbers": ["12025550123"],
  "source_url": "https://open.gsa.gov/api/entity-api/",
  "actor_concept": "SAM.gov Expiring Registration Monitor"
}
````

You can download the dataset in formats such as JSON, HTML, CSV, or Excel.

### Data table

| Field | Description |
| --- | --- |
| `title` | Page or profile title. |
| `description` | Public page summary when available. |
| `emails` | Distinct public emails found on the fetched page. |
| `phone_numbers` | Plausible public phone numbers found on the fetched page. |
| `source_url` | Public source URL. |
| `company` | Public company signal when available |
| `UEI` | Public UEI signal when available |
| `registration_expiration` | Public registration expiration signal when available |
| `NAICS` | Public NAICS signal when available |
| `location` | Public location signal when available |
| `entity_URL` | Public entity URL signal when available |
| `change_detected_date` | Public change detected date signal when available |

### Pricing / Cost estimation

The recommended starting point is **$0.03 per monitored entity** for saved profiles or monitored
signals. Public emails are charged separately when found. Configure the synthetic Actor-start event
in Apify Publication settings rather than charging it manually in code.

### Tips and advanced options

Start with a low page limit and a request delay. Some websites change layout or apply automated
traffic controls. Add a permitted custom proxy only when needed. Review the dataset after the first
run before increasing limits.

### FAQ, disclaimers, and support

#### Is this Actor production-validated?

This generated Actor is a runnable first pass. Its current hardening stage is:
`official-api-adapter-and-live-validation-required`. Validate the target source selectors, policies, and output quality before
publishing it as a Store product.

#### Does it extract emails?

Yes. It extracts distinct public email addresses exposed on fetched pages when
`extract_emails` is enabled.

#### Is scraping legal?

Only collect public data you are permitted to use. Follow the target website's terms, applicable
laws, and reasonable request limits. Do not collect sensitive or restricted personal information.

#### Can I request a custom integration?

For Custom Solutions and Integrations - reach out to me lead.gen.labs.1@gmail.com

# Actor input Schema

## `start_urls` (type: `array`):

Public source URLs to crawl. Replace or extend the defaults for your target market.

## `max_pages` (type: `integer`):

Maximum number of public pages to fetch.

## `max_profiles` (type: `integer`):

Maximum number of normalized public profiles to save.

## `max_depth` (type: `integer`):

Maximum same-domain discovery depth from each start URL.

## `extract_emails` (type: `boolean`):

Extract distinct public email addresses found on fetched pages.

## `request_delay_millis` (type: `integer`):

Delay between requests in milliseconds.

## `custom_proxy_url` (type: `string`):

Optional proxy URL. Leave empty unless the public source requires it.

## Actor input object example

```json
{
  "start_urls": [
    {
      "url": "https://open.gsa.gov/api/entity-api/"
    }
  ],
  "max_pages": 10,
  "max_profiles": 10,
  "max_depth": 1,
  "extract_emails": true,
  "request_delay_millis": 500,
  "custom_proxy_url": ""
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "start_urls": [
        {
            "url": "https://open.gsa.gov/api/entity-api/"
        }
    ]
};

// Run the Actor and wait for it to finish
const run = await client.actor("lead.gen.labs/sam-gov-expiring-registration-monitor").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "start_urls": [{ "url": "https://open.gsa.gov/api/entity-api/" }] }

# Run the Actor and wait for it to finish
run = client.actor("lead.gen.labs/sam-gov-expiring-registration-monitor").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "start_urls": [
    {
      "url": "https://open.gsa.gov/api/entity-api/"
    }
  ]
}' |
apify call lead.gen.labs/sam-gov-expiring-registration-monitor --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=lead.gen.labs/sam-gov-expiring-registration-monitor",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "SAM.gov Expiring Registration Monitor",
        "description": "Build targeted government supplier intelligence datasets from open.gsa.gov. Extract company, UEI, registration expiration, NAICS, location for outreach, research, and opportunity discovery.",
        "version": "0.0",
        "x-build-id": "qTuc0hktqjsp1oIXG"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/lead.gen.labs~sam-gov-expiring-registration-monitor/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-lead.gen.labs-sam-gov-expiring-registration-monitor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/lead.gen.labs~sam-gov-expiring-registration-monitor/runs": {
            "post": {
                "operationId": "runs-sync-lead.gen.labs-sam-gov-expiring-registration-monitor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/lead.gen.labs~sam-gov-expiring-registration-monitor/run-sync": {
            "post": {
                "operationId": "run-sync-lead.gen.labs-sam-gov-expiring-registration-monitor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "start_urls",
                    "max_pages",
                    "max_profiles"
                ],
                "properties": {
                    "start_urls": {
                        "title": "Start URLs",
                        "type": "array",
                        "description": "Public source URLs to crawl. Replace or extend the defaults for your target market.",
                        "default": [
                            {
                                "url": "https://open.gsa.gov/api/entity-api/"
                            }
                        ],
                        "items": {
                            "type": "object",
                            "required": [
                                "url"
                            ],
                            "properties": {
                                "url": {
                                    "type": "string",
                                    "title": "URL of a web page",
                                    "format": "uri"
                                }
                            }
                        }
                    },
                    "max_pages": {
                        "title": "Maximum Pages",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum number of public pages to fetch.",
                        "default": 10
                    },
                    "max_profiles": {
                        "title": "Maximum Profiles",
                        "minimum": 1,
                        "maximum": 10000,
                        "type": "integer",
                        "description": "Maximum number of normalized public profiles to save.",
                        "default": 10
                    },
                    "max_depth": {
                        "title": "Maximum Link Depth",
                        "minimum": 0,
                        "maximum": 5,
                        "type": "integer",
                        "description": "Maximum same-domain discovery depth from each start URL.",
                        "default": 1
                    },
                    "extract_emails": {
                        "title": "Extract Public Emails",
                        "type": "boolean",
                        "description": "Extract distinct public email addresses found on fetched pages.",
                        "default": true
                    },
                    "request_delay_millis": {
                        "title": "Request Delay",
                        "minimum": 0,
                        "maximum": 30000,
                        "type": "integer",
                        "description": "Delay between requests in milliseconds.",
                        "default": 500
                    },
                    "custom_proxy_url": {
                        "title": "Custom Proxy URL",
                        "type": "string",
                        "description": "Optional proxy URL. Leave empty unless the public source requires it.",
                        "default": ""
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
