Smart Data Converters
Pricing
from $10.00 / 1,000 results
Smart Data Converters
A versatile 5-in-1 data conversion actor that intelligently transforms data between formats. Every converter includes sensible defaults in the input schema so you can run it instantly without any configuration.
Pricing
from $10.00 / 1,000 results
Rating
0.0
(0)
Developer
Jamshaid Arif
Actor stats
0
Bookmarked
1
Total users
0
Monthly active users
3 days ago
Last modified
Categories
Share
π§ Smart Data Converters β Apify Actor
A versatile 5-in-1 data conversion actor that intelligently transforms data between formats. Every converter includes sensible defaults in the input schema so you can run it instantly without any configuration.
Converters
| # | Converter | Input | Output |
|---|---|---|---|
| 1 | HTML β Clean Text | Raw HTML / URL | Plain text (no tags, ads, scripts) |
| 2 | HTML β Article Summary | Raw HTML / URL | Title, metadata, extractive summary |
| 3 | JSON β Human-Readable Report | JSON string | Formatted report with tables & stats |
| 4 | Logs β Structured JSON | Raw log text | Parsed entries + summary analytics |
| 5 | Form Input β API Request | Form fields JSON | Full HTTP request payload |
Quick Start
- Select a converter via the
converter_typedropdown. - Provide input (or use the pre-filled defaults).
- Run the actor β results are pushed to the default dataset.
All fields have defaults, so you can run the actor immediately after selecting a converter type.
Input Schema Overview
Global
| Field | Type | Default | Description |
|---|---|---|---|
converter_type | select | html_to_clean_text | Which converter to run |
url_to_scrape | string | "" | Optional URL to fetch HTML from (overrides html_input) |
Converter 1 & 2 β HTML Input
| Field | Type | Default |
|---|---|---|
html_input | textarea | Sample AI article HTML |
preserve_links | boolean | true |
preserve_line_breaks | boolean | true |
summary_sentences | integer | 5 |
Converter 3 β JSON Input
| Field | Type | Default |
|---|---|---|
json_input | textarea | Sample company data JSON |
report_max_width | integer | 100 |
Converter 4 β Log Input
| Field | Type | Default |
|---|---|---|
log_input | textarea | 10 sample app log lines |
Converter 5 β Form + API Config
| Field | Type | Default |
|---|---|---|
form_input | textarea | Sample user registration fields |
field_mapping | textarea | Dot-notation mapping to nested API paths |
api_base_url | string | https://api.example.com |
api_endpoint | string | /v2/users/register |
api_method | select | POST |
api_auth_type | select | bearer |
validation_rules | textarea | Sample required/min/max rules |
Output
Results are pushed to the default dataset as structured JSON. Key outputs are also saved to the key-value store as plain text:
| Converter | KV Store Key | Content |
|---|---|---|
html_to_clean_text | clean_text.txt | Cleaned plain text |
html_to_article_summary | summary.txt | Formatted summary |
json_to_report | report.txt | ASCII-formatted report |
Project Structure
apify-smart-converters/βββ .actor/β βββ actor.json # Actor metadata & configβ βββ input_schema.json # Full input schema with defaultsβββ src/β βββ __init__.pyβ βββ main.py # Actor entry point & routingβ βββ converters.py # All 5 converter classesβββ Dockerfileβββ requirements.txtβββ README.md
Running Locally
# Install dependenciespip install -r requirements.txt# Set input via environment or INPUT.jsonecho '{"converter_type": "html_to_clean_text"}' > INPUT.json# Runpython -m src.main
Supported Log Formats (Converter 4)
The log parser auto-detects:
- Apache Combined β
127.0.0.1 - - [10/Oct/2000:13:55:36 -0700] "GET /index.html ..." - Nginx Error β
2024/06/15 08:30:02 [error] 1234#0: ... - Syslog β
Jun 15 08:30:01 myhost sshd[1234]: ... - Generic App β
2024-06-15T08:30:01.123Z [INFO] [Source] message - JSON Lines β
{"timestamp": "...", "level": "INFO", "message": "..."}
License
MIT