Website Content Crawler avatar

Website Content Crawler

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Website Content Crawler

Website Content Crawler

Full website crawling

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

Virtual Footprint LLC

Virtual Footprint LLC

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 days ago

Last modified

Share

Extract web content and document-derived structured data with Website Content Crawler for knowledge workflows.

Features

  • Collect structured content and metadata fields
  • Support direct URL and query-driven extraction modes
  • Return normalized records suitable for RAG and analytics
  • Batch-friendly processing for multiple sources
  • Designed for automated content pipelines
  • Output is optimized for Website Content Crawler buyer workflows on Apify

Common Use Cases

  • Content intelligence
  • Article monitoring
  • Knowledge base ingestion
  • Research workflows
  • Data enrichment
  • Internal reporting

Example Input

{
"query": "market research",
"queries": [
"market research"
],
"urls": [
"https://www.example.org"
],
"maxResults": 25,
"includeRaw": false,
"maxCostPerRun": 5
}

Example Output

{
"query": "market research",
"url": "https://www.example.org",
"actorSlug": "record value",
"source": "record value",
"title": "record value",
"description": "record value",
"scrapedAt": "record value"
}

Input Parameters

FieldTypeRequiredDescription
querystringNoPrimary keyword, URL, profile, company, product, or identifier to collect
queriesarrayNoOptional batch list of query strings. Used when query is empty or when batching is…
urlsarrayNoOptional direct URLs to process. These take priority over discovery when provided
maxResultsintegerNoMaximum number of dataset items to emit
includeRawbooleanNoInclude collection diagnostics and raw source metadata where available
maxCostPerRunnumberNoOptional guardrail in USD. The actor caps output before exceeding this amount
proxyConfigurationobjectNoApify proxy settings for production runs

Output Fields

FieldTypeDescription
querystringNormalized query value
urlstringNormalized url value
actorSlugstringNormalized actorSlug value
sourcestringNormalized source value
titlestringNormalized title value
descriptionstringNormalized description value
scrapedAtstringNormalized scrapedAt value
runIdstringNormalized runId value
rankstringNormalized rank value
contentstringNormalized content value
summarystringNormalized summary value
authorstringNormalized author value

Export Formats

  • JSON
  • CSV
  • Excel
  • XML
  • RSS

Pricing

Pricing Model: PAY_PER_EVENT

$3.00 per 1,000 dataset items.

FAQ

Does this actor support batch processing?

Yes.

Can I export results to CSV?

Yes.

Can I schedule runs?

Yes, through Apify schedules.

Can I run this actor via API?

Yes, via the Apify API.

Does it support direct URLs?

Yes.

Can I integrate this actor with n8n or Make?

Yes.