Actor picture

Etsy Scraper

tugkan/etsy-scraper

Scrape data from over 4 million sellers on Etsy. Crawl and extract descriptions, images, seller details, listed date, and all other product details. You can specify search terms, filters, and much more.

Free trial for 7 days. Then $5/month.
Author's avatarTu臒kan Cengiz
  • Modified
  • Used by6 users
  • Used32 times
Actor picture
Etsy Scraper
Free trial for 7 days. Then $5/month.

Etsy scraper

Since Etsy doesn't provide an API, this actor should help you to retrieve data from it.

The Etsy data scraper supports the following features:

  • Scrape product details - You can scrape attributes like images, seller information, photos, time of listing and many more. You can find details below.

  • Scrape search results - You can scrape for a specific search result by keyword.

  • Scrape and filter any categories - You can provide any category with any kind of filter that you want.

  • Define maximum number of pages that needs to be scraped - If you only want to scrape first 3 pages, there is an option for that.

Etsy specific

Don't worry when you get little bit different products than you saw in browser page. Etsy is ordering products differently for each user.

Bugs, fixes, updates and changelog

This scraper is under active development. If you have any feature requests you can create an issue from here.

Incoming Changes

  • Fetching product reviews
  • Fetch seller profile
  • Fetch seller items

Input Parameters

The input of this scraper should be JSON containing the list of pages on Etsy that should be visited. Required fields are:

Field Type Description
startUrls Array (optional) List of Etsy URLs. You should only provide category detail, product detail or search URLs
search String (optional) Keyword that can be searched in Etsy search engine.
endPage Integer (optional) Final number of page that you want to scrape. Default is Infinite.
includeDescription Boolean (optional) If you want to fetch description HTML (or text) you can enable this option. Default value is false.
maxItems Integer (optional) You can limit scraped products. This should be useful when you search through the big subcategories.
proxy Object Proxy configuration
extendOutputFunction String (optional) Function that takes a JQuery handle ($) as argument and returns object with data
This solution requires the use of Proxy servers, either your own proxy servers or you can use Apify Proxy.
Tip

When you want to have a filtering over a category URL; go to Etsy, create filters over the category and copy and paste the link as one of the startUrl.

If you would like to scrape only the first page of a search list or category list, then put the link for the page and have the endPage as 1.

With the last approach that explained above you can also fetch any interval of pages. If you provide the 5th page of a category and define the endPage parameter as 6 then you'll have the 5th and 6th pages only.

Compute Unit Consumption

The actor optimized to run blazing fast and scrape many as product as possible. Therefore, it forefronts all product detail requests. If actor doesn't block very often it'll scrape 50 products in 1 minute with ~0.04-0.05 compute units.

Etsy Scraper Input example

{
    "startUrls":[
        {
            "url": "https://www.etsy.com/c/clothing-and-shoes?ref=pagination&page=4"
        }
    ],
    "search": "apple watch",
    "endPage":6,
    "includeDescription": true,
    "maxItems": 100
}

During the Run

During the run, the actor will output messages letting you know what is going on. Each message always contains a short label specifying which page from the provided list is currently specified. When items are loaded from the page, you should see a message about this event with a loaded item count and total item count for each page.

If you provide incorrect input to the actor, it will immediately stop with failure state and output an explanation of what is wrong.

Etsy Export

During the run, the actor stores results into a dataset. Each item is a separate item in the dataset.

You can manage the results in any languague (Python, PHP, Node JS/NPM). See the FAQ or our API reference to learn more about getting results from this Etsy actor.

Scraped Etsy Products

The structure of each item in Etsy products looks like this:

{
  "id": "498417899",
  "url": "https://www.etsy.com/listing/498417899/edc-slim-valet-pocket-organizer?ga_order=most_relevant&ga_search_type=all&ga_view_type=gallery&ga_search_query=&ref=sr_gallery-4-29&frs=1&cns=1&col=1",
  "name": "EDC Slim Valet - Pocket Organizer\n        Update your settings",
  "images": [
    "https://i.etsystatic.com/13963159/r/il/2b7207/1198502839/il_794xN.1198502839_olzc.jpg",
    "https://i.etsystatic.com/13963159/r/il/a3c7cf/1617748881/il_794xN.1617748881_1o26.jpg",
    "https://i.etsystatic.com/13963159/r/il/9eb645/1570294448/il_794xN.1570294448_oe8w.jpg",
    "https://i.etsystatic.com/13963159/r/il/6b2a6a/1570293332/il_794xN.1570293332_8z6i.jpg",
    "https://i.etsystatic.com/13963159/r/il/fe2c32/1198502863/il_794xN.1198502863_2baa.jpg",
    "https://i.etsystatic.com/13963159/r/il/a735e6/1151892222/il_794xN.1151892222_95a2.jpg"
  ],
  "seller": {
    "name": "RutsuDesigns",
    "url": "https://www.etsy.com/shop/RutsuDesigns?ref=simple-shop-header-name&listing_id=498417899",
    "sales": 357,
    "rating": 5,
    "numberOfReviews": 86
  },
  "variations": [
    {
      "label": "Color",
      "values": [
        "A. Dark Chocolate ($49.00)",
        "B. Mahogany ($51.00)"
      ]
    }
  ],
  "highlights": [
    "Handmade",
    "Materials: leather, metal rivets"
  ],
  "availability": "Low in stock",
  "listedOn": "Mar 17, 2021",
  "favorites": 716
}