Szwego Scraper avatar
Szwego Scraper
Under maintenance

Pricing

Pay per event

Go to Store
Szwego Scraper

Szwego Scraper

Under maintenance

Developed by

Muhammad Hanzala

Muhammad Hanzala

Maintained by Community

0.0 (0)

Pricing

Pay per event

0

1

1

Last modified

4 days ago

Szwego Images/Videos Scraper

This Apify actor scrapes videos and images from szwego.com stores.

Input

FieldTypeRequiredDescription
urlStringYesThe URL of the szwego.com store to scrape
storeStringYesA name or ID for output file separation
tabStringNoTab to scrape: all, new, video, or photos. Default: all
batchIntegerNoNumber of posts to process per run. Default: 100

Output

The actor outputs data to the default Apify dataset. Each item contains:

  • unique_id: The unique identifier for the post
  • photo_urls: Comma-separated list of image URLs (for all, new, photos tabs)
  • video_urls: Comma-separated list of video URLs (for all, new, photos tabs) or single video URL (for video tab)

Example Input

{
"url": "https://a202310130410455772001072.szwego.com/weshop/store/_dvuqfhCtRn_dKQlVJasDKjjwTrZCD3fXTY_xKMA",
"store": "regina",
"tab": "video",
"batch": 10
}

Usage

  1. Go to your actor's page on Apify Console
  2. Click "Run"
  3. Provide the required input fields
  4. Monitor the run and view results in the dataset

Tabs

  • All: Extracts all images and videos from posts
  • New: Extracts images and videos from modal after clicking posts
  • Video: Extracts video URLs from video posts
  • Photos: Extracts description, images, and videos from photo posts

Features

  • Batch Processing: Process posts in configurable batches
  • Duplicate Prevention: Skips already processed posts
  • Multi-tab Support: Scrapes different content types (All, New, Video, Photos)
  • Robust Error Handling: Continues processing even if individual posts fail
  • Headless Browser: Uses Selenium with Chrome in headless mode for cloud execution

Technical Details

  • Built with Python and Selenium WebDriver
  • Uses Apify SDK for cloud execution
  • Supports both English and Chinese tab names
  • Automatically handles scrolling and modal interactions
  • Filters out non-HTTP/HTTPS URLs

Local Development

To run this actor locally:

  1. Install dependencies:

    $pip install -r requirements.txt
  2. Run the actor:

    $python3 main.py

The actor will use default values for local testing if no input is provided.