Pricing

Pay per usage

Go to Store

LLM Dataset Processor

Try for free

Developed by

Dušan Vystrčil

Allows you to process output of other actors or stored dataset with single LLM prompt. It's useful if you need to enrich data, summarize content, extract specific information, or manipulate data in a structured way using AI.

0.0 (0)

Pricing

Pay per usage

Total users

Monthly users

Runs succeeded

66%

Issues response

2.6 days

Last modified

2 months ago

Open source

Input Dataset ID

inputDatasetIdstringOptional

The ID of the dataset to process.

Large Language Model

modelEnumRequired

The LLM to use for processing. Each model has different capabilities and pricing. GPT-4o-mini and Claude 3.5 Haiku are recommended for cost-effective processing, while models like Claude 3 Opus or GPT-4o offer higher quality but at a higher cost.

Value options:

"gpt-4o-mini": string"gpt-4o": string"claude-3-5-haiku-latest": string"claude-3-5-sonnet-latest": string"claude-3-opus-latest": string"gemini-1.5-flash": string"gemini-1.5-flash-8b": string"gemini-1.5-pro": string

LLM Provider API Key

llmProviderApiKeystringRequired

Your API key for the LLM Provider (e.g., OpenAI).

Temperature

temperaturestringRequired

Sampling temperature for the LLM API (controls randomness). We recommend using a value closer to 0 for exact results. In case of more 'creative' results, we recommend to use a value closer to 1.

Default value of this property is "0.1"

Multiple columns in output

multipleColumnsbooleanOptional

When enabled, instructs the LLM to return responses as JSON objects, creating multiple columns in the output dataset. The columns need to be named and described in the prompt. If disabled, responses are stored in a single llmresponse column.

Default value of this property is false

Prompt Template

promptstringRequired

The prompt template to use for processing. You can use ${fieldName} placeholders to reference fields from the input dataset.

Skip item if one or more ${fields} are empty

skipItemIfEmptybooleanOptional

When enabled, items will be skipped if any ${field} referenced in the prompt is empty, null, undefined, or contains only whitespace. This helps prevent processing incomplete data.

Default value of this property is true

Max Tokens

maxTokensintegerRequired

Maximum number of tokens in the LLM API response for each item.

Default value of this property is 300

Test Prompt Mode

testPromptbooleanOptional

Test mode that processes only a limited number of items (defined by testItemsCount). Use this to validate your prompt and configuration before running on the full dataset. We highly recommend enabling this option first to validate your prompt because of ambiguity of the LLM responses.

Default value of this property is true

Test Items Count

testItemsCountintegerOptional

Number of items to process when Test Prompt Mode is enabled.

Default value of this property is 3

Preprocessing function

preprocessingFunctionstringOptional

Function to transform item before they are put into the promp.

Substack Publications Scraper 📚

easyapi/substack-publications-scraper

Scrape detailed publication information from Substack based on keywords. Get comprehensive data about newsletters, authors, subscriber counts, and publication metrics in structured JSON format.

EasyApi

5.0

Substack Newsletter Scraper

red.cars/substack-newsletter-scraper

Extract newsletter content, subscriber counts, post analytics, and creator intelligence from any Substack publication - completely free, no authentication needed!

AutomateLab

1.0

🐺 App Store Reviews Scraper | Pay Per Result

thewolves/appstore-reviews-scraper

App Store Reviews Scraper is your ultimate tool to retrieve the reviews directly from the Apple Store. With extremely capable information retrieval, the lowest price, and lightning speed, this actor is unbeatable. It's priced at just $0.10 per 1000 reviews!

The Wolves

275

1.4

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

QPS

248

Substack Leaderboard Scraper 📊

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

EasyApi

5.0

Substack Notes Scraper 🔍

easyapi/substack-notes-scraper

Extract notes and comments from Substack's search results with images, user info, and engagement metrics. Perfect for content analysis, user research, and tracking discussions around specific topics on Substack.

EasyApi

📚 Substack People Scraper

easyapi/substack-people-scraper

A powerful scraping tool that extracts comprehensive Substack author and publication data using keywords. Get detailed insights about writers, their publications, themes, and engagement metrics to understand the newsletter ecosystem.

EasyApi

5.0

🐺 Google Play Reviews Scraper | Pay per Result

thewolves/google-play-reviews-scraper

Google Play Reviews Scraper stands as the most enhanced solution for extracting reviews directly from Google Play, offering unparalleled data retrieval efficiency, unbeatable pricing, and exceptional speed. It charges only $0.10 for 1000 reviews in lightning-speed ⚡️

The Wolves

322

App Store Data Extractor - Scrape reviews too!

epctex/appstore-scraper

Discover a vast collection of apps, movies, podcasts, reviews, and more on iTunes and the App Store. Extract comprehensive data including images, ISBN, author, description, title, language, user ratings, and reviews focused on countries without limitations. Unlimited and extremely fast!

epctex

651

3.4

App Store Reviews Scraper

easyapi/app-store-reviews-scraper

App Store Reviews Scraper is a valuable tool for collecting and analyzing user reviews from the App Store. With customizable options and detailed feedback retrieval, it’s perfect for developers and marketers looking to enhance their understanding of app performance and user satisfaction! 📊

EasyApi

102

4.0