Auto Dataset Analysis & HTML Reports avatar

Auto Dataset Analysis & HTML Reports

Pricing

from $10.00 / 1,000 results

Go to Apify Store
Auto Dataset Analysis & HTML Reports

Auto Dataset Analysis & HTML Reports

Simply input any tabular dataset, and you'll receive a detailed and elegantly formatted profiling report in return—no coding skills necessary. This user-friendly process allows you to effortlessly explore the dataset's characteristics and insights, making data analysis accessible to everyone.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Jamshaid Arif

Jamshaid Arif

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

8 days ago

Last modified

Share

📊 Automated dataset analysis and HTML report generation

Automated dataset analysis and HTML report generation, inspired by ydata-profiling but built natively for the Apify platform.

Feed it any tabular dataset and get back a comprehensive, beautifully styled profiling report — no code required.


What It Does

FeatureDetails
OverviewRow/column counts, memory usage, missing-cell %, duplicate rows
Per-Column StatsMean, median, std, quartiles, skewness, kurtosis (numeric); top values, avg string length (categorical); min/max/range (datetime)
HistogramsAuto-binned SVG charts for every numeric column
CorrelationsFull heatmap + auto-alerts for highly correlated pairs
Data Quality AlertsFlags high-missing columns and high-correlation pairs
Sample PreviewFirst & last 5 rows rendered in the report

Data Sources

SourceInput Field
CSV URLsourceUrl — any publicly accessible .csv link
JSON URLsourceUrl — any publicly accessible .json link
Raw JSONrawJson — paste a JSON array of objects directly
Apify DatasetapifyDatasetId — ID of an existing Apify dataset

Input Example

{
"dataSource": "csvUrl",
"sourceUrl": "https://people.sc.fsu.edu/~jburkardt/data/csv/airtravel.csv",
"reportTitle": "Air Travel Dataset Profile",
"sampleRows": 0,
"correlationThreshold": 0.8,
"missingThreshold": 10
}

Output

1. HTML Report

Saved to the Key-Value Store as profiling_report.html. Open it directly in your browser from the actor run's Storage tab.

2. Structured Summary

Pushed to the Dataset output for programmatic access — includes overview stats, column profiles, and all alerts as JSON.


Configuration Reference

ParameterTypeDefaultDescription
dataSourceenumcsvUrlOne of: csvUrl, jsonUrl, rawJson, apifyDatasetId
sourceUrlstringURL to CSV or JSON file
rawJsonstringRaw JSON array of objects
apifyDatasetIdstringExisting Apify dataset ID
reportTitlestringData Profiling ReportTitle in the HTML report header
sampleRowsinteger0Max rows to analyze (0 = all)
correlationThresholdnumber0.8Alert when
missingThresholdnumber10Alert when missing % exceeds this

License

ISC