Auto Dataset Analysis & HTML Reports
Pricing
from $10.00 / 1,000 results
Auto Dataset Analysis & HTML Reports
Simply input any tabular dataset, and you'll receive a detailed and elegantly formatted profiling report in return—no coding skills necessary. This user-friendly process allows you to effortlessly explore the dataset's characteristics and insights, making data analysis accessible to everyone.
Pricing
from $10.00 / 1,000 results
Rating
0.0
(0)
Developer
Jamshaid Arif
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
8 days ago
Last modified
Categories
Share
📊 Automated dataset analysis and HTML report generation
Automated dataset analysis and HTML report generation, inspired by ydata-profiling but built natively for the Apify platform.
Feed it any tabular dataset and get back a comprehensive, beautifully styled profiling report — no code required.
What It Does
| Feature | Details |
|---|---|
| Overview | Row/column counts, memory usage, missing-cell %, duplicate rows |
| Per-Column Stats | Mean, median, std, quartiles, skewness, kurtosis (numeric); top values, avg string length (categorical); min/max/range (datetime) |
| Histograms | Auto-binned SVG charts for every numeric column |
| Correlations | Full heatmap + auto-alerts for highly correlated pairs |
| Data Quality Alerts | Flags high-missing columns and high-correlation pairs |
| Sample Preview | First & last 5 rows rendered in the report |
Data Sources
| Source | Input Field |
|---|---|
| CSV URL | sourceUrl — any publicly accessible .csv link |
| JSON URL | sourceUrl — any publicly accessible .json link |
| Raw JSON | rawJson — paste a JSON array of objects directly |
| Apify Dataset | apifyDatasetId — ID of an existing Apify dataset |
Input Example
{"dataSource": "csvUrl","sourceUrl": "https://people.sc.fsu.edu/~jburkardt/data/csv/airtravel.csv","reportTitle": "Air Travel Dataset Profile","sampleRows": 0,"correlationThreshold": 0.8,"missingThreshold": 10}
Output
1. HTML Report
Saved to the Key-Value Store as profiling_report.html. Open it directly in your browser from the actor run's Storage tab.
2. Structured Summary
Pushed to the Dataset output for programmatic access — includes overview stats, column profiles, and all alerts as JSON.
Configuration Reference
| Parameter | Type | Default | Description |
|---|---|---|---|
dataSource | enum | csvUrl | One of: csvUrl, jsonUrl, rawJson, apifyDatasetId |
sourceUrl | string | — | URL to CSV or JSON file |
rawJson | string | — | Raw JSON array of objects |
apifyDatasetId | string | — | Existing Apify dataset ID |
reportTitle | string | Data Profiling Report | Title in the HTML report header |
sampleRows | integer | 0 | Max rows to analyze (0 = all) |
correlationThreshold | number | 0.8 | Alert when |
missingThreshold | number | 10 | Alert when missing % exceeds this |
License
ISC


