Website Content Crawler
Pricing
from $0.50 / 1,000 results
Go to Apify Store
Pricing
from $0.50 / 1,000 results
Rating
0.0
(0)
Developer
Virtual Footprint LLC
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
2 days ago
Last modified
Categories
Share
Extract web content and document-derived structured data with Website Content Crawler for knowledge workflows.
Features
- Collect structured content and metadata fields
- Support direct URL and query-driven extraction modes
- Return normalized records suitable for RAG and analytics
- Batch-friendly processing for multiple sources
- Designed for automated content pipelines
- Output is optimized for Website Content Crawler buyer workflows on Apify
Common Use Cases
- Content intelligence
- Article monitoring
- Knowledge base ingestion
- Research workflows
- Data enrichment
- Internal reporting
Example Input
{"query": "market research","queries": ["market research"],"urls": ["https://www.example.org"],"maxResults": 25,"includeRaw": false,"maxCostPerRun": 5}
Example Output
{"query": "market research","url": "https://www.example.org","actorSlug": "record value","source": "record value","title": "record value","description": "record value","scrapedAt": "record value"}
Input Parameters
| Field | Type | Required | Description |
|---|---|---|---|
| query | string | No | Primary keyword, URL, profile, company, product, or identifier to collect |
| queries | array | No | Optional batch list of query strings. Used when query is empty or when batching is… |
| urls | array | No | Optional direct URLs to process. These take priority over discovery when provided |
| maxResults | integer | No | Maximum number of dataset items to emit |
| includeRaw | boolean | No | Include collection diagnostics and raw source metadata where available |
| maxCostPerRun | number | No | Optional guardrail in USD. The actor caps output before exceeding this amount |
| proxyConfiguration | object | No | Apify proxy settings for production runs |
Output Fields
| Field | Type | Description |
|---|---|---|
| query | string | Normalized query value |
| url | string | Normalized url value |
| actorSlug | string | Normalized actorSlug value |
| source | string | Normalized source value |
| title | string | Normalized title value |
| description | string | Normalized description value |
| scrapedAt | string | Normalized scrapedAt value |
| runId | string | Normalized runId value |
| rank | string | Normalized rank value |
| content | string | Normalized content value |
| summary | string | Normalized summary value |
| author | string | Normalized author value |
Export Formats
- JSON
- CSV
- Excel
- XML
- RSS
Pricing
Pricing Model: PAY_PER_EVENT
$3.00 per 1,000 dataset items.
FAQ
Does this actor support batch processing?
Yes.
Can I export results to CSV?
Yes.
Can I schedule runs?
Yes, through Apify schedules.
Can I run this actor via API?
Yes, via the Apify API.
Does it support direct URLs?
Yes.
Can I integrate this actor with n8n or Make?
Yes.