Parsera avatar

Parsera

Try for free

This Actor is paid per event

Go to Store
Parsera

Parsera

parsera-labs/parsera
Try for free

This Actor is paid per event

Extract Data from ANY website with Parsera.org

Developer
Maintained by Community

Actor Metrics

  • 4 Monthly users

  • No reviews yet

  • 4 bookmarks

  • >99% runs succeeded

  • 56 days response time

  • Created in Nov 2024

  • Modified 15 days ago

Categories

Run Parsera Actor on Apify

Parsera Actor

Extract structured data from any website using Parsera's AI-powered data extraction API.

Example

Input url you want to scrape in Basic Configuration > Target URL, and list columns to extract in Extraction Settings > Extraction Attributes. For example, you can extract list of articles from https://news.ycombinator.com/ by putting this value into Target URL and filling Extraction Attributes with:

1[
2    {
3        "description": "News title",
4        "name": "title"
5    },
6    {
7        "description": "Number of points",
8        "name": "points"
9    },
10    {
11        "description": "Number of comments",
12        "name": "nr_comments"
13    }
14]

At end you'll get a table that looks like this:

nr_commentspointstitle
1141The Inevitability of the Borrow Checker
119When Louis Armstrong Conquered Chicago
448689Meta torrented & seeded 81.7 TB dataset containing copyrighted data
.........

📝 Input Configuration

The actor accepts the following input parameters:

FieldTypeRequiredDescription
urlStringYesThe target URL to extract data from
attributesArrayYesList of data attributes to extract
proxyCountryStringNoCountry for proxy IP (defaults to United States)
cookiesArrayNoCookies to inject into the request
precisionModeBooleanNoEnable high-precision extraction mode

Attributes Structure

Each attribute in the attributes array should have:

  • name: Identifier for the extracted data
  • description: Natural language description of what to extract

💡 Tips

  • Use precise, detailed descriptions in your attributes for better extraction accuracy
  • Enable precisionMode for highest accuracy (uses more credits)
  • Test your extraction pattern on a few pages before running large-scale scrapes
  • The speed of the response depends mainly on the LLM output so if you're collecting a lot of data, the response time will increase. We're working on a code generation sytem to provide back data instantly, so stay tuned and sign up for news at https://parsera.org!

📊 Usage Limits

🤝 Support