Website Media Link Scraper avatar
Website Media Link Scraper

Pricing

$5.00 / 1,000 results

Go to Store
Website Media Link Scraper

Website Media Link Scraper

Developed by

thenetaji

thenetaji

Maintained by Community

Extract video, audio, and image links from websites with this efficient web crawler. Uses lightweight HTTP requests to discover media URLs across entire sites without browser overhead. Fast, reliable, and resource-friendly.

4.1 (2)

Pricing

$5.00 / 1,000 results

1

Total users

51

Monthly users

24

Runs succeeded

>99%

Last modified

a day ago

🔍 Media Link Crawler

TL;DR: Extract videos, images, documents, and other media from any website. Save hours of manual downloading with automated crawling that finds hidden media links across multiple pages.

✅ Features

  • Extracts 12 media types: videos, audio, images, PDFs, documents, archives, eBooks, fonts, text files, APKs, and contact information
  • Crawls through multiple pages to find hidden media content
  • Customizable depth, concurrency, and targeting options
  • Works with proxy support for accessing restricted content

🎯 Supported Media Types

Media TypeSupported Formats
Video Filesmp4, webm, mkv, mov, avi, flv, 3gp, etc.
Audio Filesmp3, wav, ogg, aac, flac, m4a, etc.
Image Filesjpg, jpeg, png, gif, webp, svg, bmp, apng, heic, heif, tiff, ico, etc.
PDF Filespdf
Document Filesdoc, docx, ppt, pptx, xls, xlsx, csv, odt, ods, odp, rtf, md, txt, json, xml
Archive Fileszip, rar, tar, gz, 7z, bz2
eBook Filesepub, mobi, azw3, fb2
Font Filesttf, otf, woff, woff2, eot
Text Filestxt, md, rtf, json, xml, csv
Android Packagesapk, xapk
Contact InformationEmails, phone numbers, social media profiles, addresses
AllAll supported media types above

⚙️ How to Use

Basic Setup

  1. Paste your target URL(s)
  2. Select media type to extract
  3. Set crawl depth (how many clicks deep)
  4. Run the actor

Input Configuration

{
"startUrls": [{ "url": "https://example.com" }],
"mediaType": "video",
"maxCrawlDepth": 2
}

💡 Pro Tips

  • Use maxCrawlDepth: 1 for quick media scanning of a single page
  • Try mediaType: "all" first to discover what's available
  • Set maxUrlsToCrawl to limit large websites
  • Enable proxy for region-restricted content

📊 Output Example

{
"sourceUrl": "https://example.com/gallery",
"pageTitle": "Photo Gallery",
"mediaLinks": [
{
"url": "https://example.com/images/photo1.jpg",
"type": "image",
"foundAt": "2025-05-20T11:17:17Z"
},
{
"url": "https://example.com/videos/presentation.mp4",
"type": "video",
"foundAt": "2025-05-20T11:17:17Z"
}
]
}

❓ FAQ

How deep should I crawl?

Start with depth 2 for most sites. Higher depths find more content but take longer.

Can I extract only specific file formats?

Yes! Choose one of the 12 media types or customize the selection patterns.

Is proxy necessary?

Only for restricted content or to avoid rate limiting on large crawls.

⚠️ Limitations

  • Some websites block automated crawling
  • JavaScript-heavy sites may require additional configuration
  • Very large websites might need pagination handling
  • Media embedded in complex players might not be detected

🚀 Try It Now

Paste your URL, select media type, and start extracting in seconds!

📮 Need Help?

Contact @thenetaji through the Apify platform for support, implementation questions, or feature requests.