Superclean Urls avatar
Superclean Urls

Pricing

from $0.35 / 1,000 results

Go to Apify Store
Superclean Urls

Superclean Urls

Clean messy URLs from lead exports. Remove 60+ tracking parameters (utm_*, fbclid, gclid), normalize format, extract domains, and optionally verify URLs are reachable. Perfect for cold email personalization and CRM data hygiene.

Pricing

from $0.35 / 1,000 results

Rating

0.0

(0)

Developer

Superlative

Superlative

Maintained by Community

Actor stats

2

Bookmarked

2

Total users

1

Monthly active users

2 hours ago

Last modified

Share

Clean messy URLs from lead exports. Remove tracking parameters, normalize format, and extract domains. Optimized for cold email personalization workflows.


Features

  • Remove tracking parameters - Strip UTM, fbclid, gclid, and 50+ other tracking params
  • Normalize format - Consistent protocol, trailing slashes
  • Extract domains - Pull clean domain names from full URLs
  • Validate URLs - Identify and flag invalid URL formats

Input Parameters

ParameterTypeRequiredDefaultDescription
itemsarrayYesList of URLs to clean
stylestringNofullOutput format: full (cleaned URL) or domain (domain only)
forceHttpsbooleanNotrueConvert http:// to https://
removeTrackingbooleanNotrueRemove tracking parameters (utm_*, fbclid, etc.)

Output Styles

  • full (default) - Full cleaned URL in output field
  • domain - Just the domain name in output field

Component fields (protocol, path, query, hash) are always included regardless of style.


Output Fields

FieldTypeDescription
idintegerRow number (1-based)
inputstringOriginal URL
outputstringCleaned result (format depends on style)
domainstringExtracted registrable domain (e.g., example.com)
protocolstringProtocol (http or https)
pathstringURL path (e.g., /about/contact)
querystringQuery string without ? (e.g., foo=bar&baz=1)
hashstringFragment/anchor without #
validbooleanWhether the URL format is valid
confidencenumberConfidence score (0-1)

Confidence Scoring

  • 1.0 - Valid URL, no changes needed
  • 0.9 - Valid URL, tracking removed or normalized
  • 0.7 - URL fixed (protocol added)
  • 0.3 - Partially valid (domain extracted but issues remain)
  • 0.0 - Invalid URL (couldn't parse)

Example

Input

{
"items": [
"https://www.example.com/?utm_source=linkedin&fbclid=abc123",
"http://ACME.COM/about/",
"example.com/contact",
"not a valid url"
],
"style": "full"
}

Output

[
{
"id": 1,
"input": "https://www.example.com/?utm_source=linkedin&fbclid=abc123",
"output": "https://www.example.com",
"domain": "example.com",
"valid": true,
"confidence": 0.9
},
{
"id": 2,
"input": "http://ACME.COM/about/",
"output": "https://acme.com/about",
"domain": "acme.com",
"valid": true,
"confidence": 0.9
},
{
"id": 3,
"input": "example.com/contact",
"output": "https://example.com/contact",
"domain": "example.com",
"valid": true,
"confidence": 0.7
},
{
"id": 4,
"input": "not a valid url",
"output": "not a valid url",
"domain": "",
"valid": false,
"confidence": 0
}
]

Use Cases

  • Cold email personalization - Clean company URLs before using in email templates
  • Lead enrichment - Normalize URLs from scraped or imported lead lists
  • Data hygiene - Remove tracking params before storing in CRM
  • Domain extraction - Pull domains for company research or deduplication

Tracking Parameters Removed

The actor removes 60+ tracking parameters including:

  • UTM - utm_source, utm_medium, utm_campaign, utm_term, utm_content
  • Facebook - fbclid, fb_action_ids, fb_source
  • Google - gclid, gclsrc, dclid, gbraid, wbraid
  • Microsoft - msclkid
  • LinkedIn - li_fat_id, li_tc
  • Email marketing - mc_eid, mc_cid, _hsenc, _hsmi, mkt_tok
  • Analytics - _ga, _gl, ref, spm, clickid

More from Superlative

Built by Superlative — Clean data in. Better emails out.