Thisisl Urls Spider avatar
Thisisl Urls Spider

Pricing

from $9.00 / 1,000 results

Go to Apify Store
Thisisl Urls Spider

Thisisl Urls Spider

This Apify actor efficiently crawls Thisisl URLs to extract detailed product data on tampons and related items, including names, descriptions, images, and metadata....

Pricing

from $9.00 / 1,000 results

Rating

0.0

(0)

Developer

GetDataForMe

GetDataForMe

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

19 days ago

Last modified

Share

This Apify Actor is designed to efficiently crawl and extract detailed product information from specified URLs on the Thisisl website, focusing on tampons and related products. It provides structured data including product names, descriptions, images, and metadata, making it ideal for e-commerce data collection and analysis. By automating the scraping process, it saves time and ensures reliable, up-to-date information for various business and research needs.

Features

  • Targeted Scraping: Precisely extracts product details from Thisisl URLs, including names, titles, descriptions, and breadcrumbs.
  • Image Handling: Captures featured images with titles, descriptions, URLs, and file sizes for comprehensive media data.
  • Metadata Inclusion: Adds crawled dates, actor IDs, and run IDs for traceability and auditing.
  • Flexible Input: Accepts a list of URLs for batch processing, supporting multiple pages in a single run.
  • High Reliability: Built with robust error handling to manage varying page structures and network issues.
  • Fast Performance: Optimized for quick execution, minimizing resource usage while delivering accurate results.
  • JSON Output: Produces clean, structured JSON data ready for integration into databases or analytics tools.

Input Parameters

ParameterTypeRequiredDescriptionExample
UrlsarrayYesA list of URLs to scrape from the Thisisl website. Each URL must start with http:// or https:// and point to product pages.["https://www.thisisl.com/l-tampons-light-regular/"]

Example Usage

To run the actor, provide the input in JSON format as shown below:

{
"Urls": [
"https://www.thisisl.com/l-tampons-light-regular/"
]
}

Example output in JSON format:

[
{
"Product_Name": "PDP - L. Light/Regular Tampons",
"Title": "L. | Light/Regular Tampons",
"Description": "Organic light and regular tampons with a 100% cotton core ",
"URL": "https://www.thisisl.com/l-tampons-light-regular/",
"Breadcrumbs": "L. | Light/Regular Tampons",
"Featured_Image": {
"Title": "00073010715639 C1N1",
"Description": "L. Tampons Light/Regular\n",
"URL": "https://images.ctfassets.net/hk5leik3t8gi/5Ag6W6r7cN5HsKLJE3LgNm/baf25ea5b1adfcd1f6e193eaaea02d26/00073010715639_C1N1.png",
"Size": 733582
},
"Crawled_Date": "2026-01-14",
"actor_id": "iMIxDmr3c8Y2TbGar",
"run_id": "Gp5x58xhw5kbD4ZAg"
}
]

Use Cases

  • Market Research: Gather detailed product information to analyze trends in organic tampon offerings.
  • Competitive Intelligence: Compare Thisisl products with competitors by extracting descriptions and images.
  • Price Monitoring: Track product details for pricing strategies, though prices aren't directly scraped here.
  • Content Aggregation: Build a database of product catalogs for e-commerce platforms.
  • Academic Research: Study consumer products in the feminine hygiene sector with structured data.
  • Business Automation: Automate data collection for inventory management or marketing campaigns.

Installation and Usage

  1. Search for "Thisisl Urls Spider" in the Apify Store
  2. Click "Try for free" or "Run"
  3. Configure input parameters
  4. Click "Start" to begin extraction
  5. Monitor progress in the log
  6. Export results in your preferred format (JSON, CSV, Excel)

Output Format

The actor outputs an array of JSON objects, each representing a scraped product. Key fields include:

  • Product_Name: The full product name.
  • Title: A shortened title.
  • Description: Detailed product description.
  • URL: The source URL.
  • Breadcrumbs: Navigation path.
  • Featured_Image: An object with image title, description, URL, and size in bytes.
  • Crawled_Date: Date of scraping (YYYY-MM-DD).
  • actor_id and run_id: Unique identifiers for the actor run.

This structure ensures easy parsing and integration.

Support

For custom/simplified outputs or bug reports, please contact:

We're here to help you get the most out of this Actor!