Smart Data Converters avatar

Smart Data Converters

Pricing

from $10.00 / 1,000 results

Go to Apify Store
Smart Data Converters

Smart Data Converters

A versatile 5-in-1 data conversion actor that intelligently transforms data between formats. Every converter includes sensible defaults in the input schema so you can run it instantly without any configuration.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Jamshaid Arif

Jamshaid Arif

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 days ago

Last modified

Share

🧠 Smart Data Converters β€” Apify Actor

A versatile 5-in-1 data conversion actor that intelligently transforms data between formats. Every converter includes sensible defaults in the input schema so you can run it instantly without any configuration.


Converters

#ConverterInputOutput
1HTML β†’ Clean TextRaw HTML / URLPlain text (no tags, ads, scripts)
2HTML β†’ Article SummaryRaw HTML / URLTitle, metadata, extractive summary
3JSON β†’ Human-Readable ReportJSON stringFormatted report with tables & stats
4Logs β†’ Structured JSONRaw log textParsed entries + summary analytics
5Form Input β†’ API RequestForm fields JSONFull HTTP request payload

Quick Start

  1. Select a converter via the converter_type dropdown.
  2. Provide input (or use the pre-filled defaults).
  3. Run the actor β€” results are pushed to the default dataset.

All fields have defaults, so you can run the actor immediately after selecting a converter type.


Input Schema Overview

Global

FieldTypeDefaultDescription
converter_typeselecthtml_to_clean_textWhich converter to run
url_to_scrapestring""Optional URL to fetch HTML from (overrides html_input)

Converter 1 & 2 β€” HTML Input

FieldTypeDefault
html_inputtextareaSample AI article HTML
preserve_linksbooleantrue
preserve_line_breaksbooleantrue
summary_sentencesinteger5

Converter 3 β€” JSON Input

FieldTypeDefault
json_inputtextareaSample company data JSON
report_max_widthinteger100

Converter 4 β€” Log Input

FieldTypeDefault
log_inputtextarea10 sample app log lines

Converter 5 β€” Form + API Config

FieldTypeDefault
form_inputtextareaSample user registration fields
field_mappingtextareaDot-notation mapping to nested API paths
api_base_urlstringhttps://api.example.com
api_endpointstring/v2/users/register
api_methodselectPOST
api_auth_typeselectbearer
validation_rulestextareaSample required/min/max rules

Output

Results are pushed to the default dataset as structured JSON. Key outputs are also saved to the key-value store as plain text:

ConverterKV Store KeyContent
html_to_clean_textclean_text.txtCleaned plain text
html_to_article_summarysummary.txtFormatted summary
json_to_reportreport.txtASCII-formatted report

Project Structure

apify-smart-converters/
β”œβ”€β”€ .actor/
β”‚ β”œβ”€β”€ actor.json # Actor metadata & config
β”‚ └── input_schema.json # Full input schema with defaults
β”œβ”€β”€ src/
β”‚ β”œβ”€β”€ __init__.py
β”‚ β”œβ”€β”€ main.py # Actor entry point & routing
β”‚ └── converters.py # All 5 converter classes
β”œβ”€β”€ Dockerfile
β”œβ”€β”€ requirements.txt
└── README.md

Running Locally

# Install dependencies
pip install -r requirements.txt
# Set input via environment or INPUT.json
echo '{"converter_type": "html_to_clean_text"}' > INPUT.json
# Run
python -m src.main

Supported Log Formats (Converter 4)

The log parser auto-detects:

  • Apache Combined β€” 127.0.0.1 - - [10/Oct/2000:13:55:36 -0700] "GET /index.html ..."
  • Nginx Error β€” 2024/06/15 08:30:02 [error] 1234#0: ...
  • Syslog β€” Jun 15 08:30:01 myhost sshd[1234]: ...
  • Generic App β€” 2024-06-15T08:30:01.123Z [INFO] [Source] message
  • JSON Lines β€” {"timestamp": "...", "level": "INFO", "message": "..."}

License

MIT