All-In-One Content Scraper avatar

All-In-One Content Scraper

Pricing

Pay per event

Go to Apify Store
All-In-One Content Scraper

All-In-One Content Scraper

The Ultimate All-in-One Web Scraping Solution - Extract ANY media from ANY website with professional-grade precision and speed.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Jeff Halverson

Jeff Halverson

Maintained by Community

Actor stats

1

Bookmarked

7

Total users

0

Monthly active users

2 days ago

Last modified

Share

ALL in One Dats Scraper-Revamped

One actor. Four scraping surfaces. One pricing surface.

Universal Data Harvester is designed as a Swiss-army-knife actor for users who need more than one scraping workflow without paying for a different actor every time. Instead of forcing separate purchases for local leads, Apollo-style sales research, Instagram pulls, and general web extraction, this actor puts those modes behind one input schema.

The current build focuses on returning complete, useful records instead of thin proof-of-life responses:

  • web returns article blocks, fallback text previews, contact points, social links, metadata, media, and links
  • google_maps can follow each discovered place into its detail pane for enriched lead fields such as address, website, phone, hours, plus code, and review counts
  • apollo returns company cards, structured company profiles, FAQs, product links, and lead-style outputs from Apollo pages
  • instagram uses Instagram's public profile surface for richer profile metadata and recent post records on public accounts

Modes

  • web: general website and content extraction with articles, metadata, media, and links
  • google_maps: local business and place extraction from Google Maps searches
  • apollo: Apollo-focused discovery and Apollo page scraping for company and people data
  • instagram: Instagram profiles, posts, hashtags, places, and search surfaces

What It Returns

Every dataset item uses the same top-level contract:

  • mode
  • contentType
  • url
  • title
  • query
  • summary
  • exports
  • plus mode-specific fields such as places, companies, people, profile, posts, images, videos, links, contactPoints, textPreview, companyProfile, featuredFaqs, and highlights

Input Examples

Web

{
"scrapingMode": "web",
"startUrls": [
{ "url": "https://example.com" }
],
"maxRequestsPerCrawl": 5,
"includeImages": true,
"includeVideos": true
}

Google Maps

{
"scrapingMode": "google_maps",
"searchTerms": [
"dentists seattle",
"roofing contractors tacoma"
],
"googleMapsResultLimit": 10,
"followGoogleMapsPlaceDetails": true
}

Apollo

{
"scrapingMode": "apollo",
"apolloQueries": [
"openai",
"b2b saas sales intelligence"
],
"apolloDiscoveryLimit": 5,
"followDiscoveredApolloUrls": true
}

Or scrape direct Apollo URLs:

{
"scrapingMode": "apollo",
"apolloStartUrls": [
{ "url": "https://www.apollo.io/companies/example" }
]
}

Instagram

{
"scrapingMode": "instagram",
"instagramInputType": "profiles",
"instagramHandles": [
"instagram"
],
"instagramResultLimit": 12
}

Hashtags, places, posts, and search are also supported through instagramInputType.

Session Cookies

Apollo and Instagram can expose more data when run against authenticated pages. If you have a valid session, pass cookies in sessionCookiesJson as a JSON array.

Product Intent

This actor is built around one idea: if your workflow needs two or more scraping categories, you should not have to buy two or more separate actors with unrelated pricing logic just to finish one job.