HTML/Web Media Scraper avatar

HTML/Web Media Scraper

Try for free

Pay $4.00 for 1,000 results

Go to Store
HTML/Web Media Scraper

HTML/Web Media Scraper

aweworkz/html-web-media-scraper
Try for free

Pay $4.00 for 1,000 results

Extracts various media files, such as images, videos, audio, and other related media elements, from multiple websites. It then provides the corresponding descriptions or the alt="" content. You may need to use proxies to run this actor for some websites with bot blocking features.

You can access the HTML/Web Media Scraper programmatically from your own Python applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = { "startUrls": [{ "url": "https://crawlee.dev" }] }
9
10# Run the Actor and wait for it to finish
11run = client.actor("aweworkz/html-web-media-scraper").call(run_input=run_input)
12
13# Fetch and print Actor results from the run's dataset (if there are any)
14print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
15for item in client.dataset(run["defaultDatasetId"]).iterate_items():
16    print(item)
17
18# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

HTML/Web Media Scraper API in Python

The Apify API client for Python is the official library that allows you to use HTML/Web Media Scraper API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

pip install apify-client

Other API clients include:

Developer
Maintained by Community

Actor Metrics

  • 12 monthly users

  • 4 stars

  • 95% runs succeeded

  • Created in Mar 2024

  • Modified 4 months ago