Website Content Extractor avatar
Website Content Extractor

Under maintenance

Pricing

$9.00/month + usage

Go to Store
Website Content Extractor

Website Content Extractor

Under maintenance

Developed by

fastidious_drawer

Maintained by Community

This extractor lets you extract content from any website with a single or multiple URLs. Use selectors to choose specific sections like the body and exclude elements like headers or navigation. It also extracts images and links, providing data in JSON and DataTable formats for easy processing.

0.0 (0)

Pricing

$9.00/month + usage

0

Total users

22

Monthly users

20

Runs succeeded

>99%

Last modified

12 days ago

You can access the Website Content Extractor programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {}
9
10# Run the Actor and wait for it to finish
11run = client.actor("fastidious_drawer/website-content-extractor").call(run_input=run_input)
12
13# Fetch and print Actor results from the run's dataset (if there are any)
14print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
15for item in client.dataset(run["defaultDatasetId"]).iterate_items():
16    print(item)
17
18# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Website Content Extractor API in Python

The Apify API client for Python is the official library that allows you to use Website Content Extractor API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

pip install apify-client

Other API clients include: