Shopify Products Scraper - Automate Product Extraction avatar

Shopify Products Scraper - Automate Product Extraction

Try for free

3 days trial then $9.99/month - No credit card required now

View all Actors
Shopify Products Scraper - Automate Product Extraction

Shopify Products Scraper - Automate Product Extraction

dainty_screw/shopify-products-scraper
Try for free

3 days trial then $9.99/month - No credit card required now

Automate monitoring prices on the most popular solution for building online stores and selling products online. Crawl arbitrary Shopify-powered online stores and extract a list of all products in a structured form, including product title, price, description, etc.

Start URLs

startUrlsarrayOptional

Provide Shopify shop URLs as the starting point

Default value of this property is []

Max items

maxRequestsPerCrawlintegerOptional

Maximum number of items to scrape. Set it to 0 to scrape everything.

Default value of this property is 10

Proxy Configuration

proxyConfigobjectOptional

Use either automatic Apify proxies, Residentials or your own.

Default value of this property is {"useApifyProxy":true}

Check for Shopify on robots

checkForBannerbooleanOptional

Ensure that the remote robots.txt file contains the Shopify keyword.

Default value of this property is true

Extend Output Function

extendOutputFunctionstringOptional

Add or remove properties on the output object or omit the output returning null

Default value of this property is "async ({ data, item, product, images, fns, name, request, variants, context, customData, input, Apify }) => {\n return item;\n}"

Extend Scraper Function

extendScraperFunctionstringOptional

Advanced function that allows you to extend the default scraper functionality, allowing you to manually perform actions on the page

Default value of this property is "async ({ fns, customData, Apify, label }) => {\n \n}"

Custom data

customDataobjectOptional

Any data that you want to have available inside the Extend Output/Scraper Function

Default value of this property is {}

Fetch HTML

fetchHtmlbooleanOptional

If you decide to fetch the HTML of the pages, it will take twice as long. Make sure to only enable this if needed

Default value of this property is false

Max concurrency

maxConcurrencyintegerOptional

Max concurrency to use

Default value of this property is 10

Max request retries

maxRequestRetriesintegerOptional

Set the max request retries

Default value of this property is 3

Debug Log

debugLogbooleanOptional

Enable a more verbose logging to be able to understand what's happening during the scraping

Default value of this property is false

Developer
Maintained by Community
Actor metrics
  • 2 monthly users
  • 2 stars
  • 100.0% runs succeeded
  • Created in Apr 2024
  • Modified 5 days ago