Smart Article Extractor
No credit card required
Smart Article Extractor
No credit card required
📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.
Do you want to learn more about this Actor?
Get a demoHi Lukas,
For some reason, crawler says this article has 19 words, which is not true https://community.aws/content/2eYoqeFRqaVnk900emsknDfzhfW/the-ultimate-cheat-sheet-for-using-amazon-q-developer-in-your-ide
Is there something I could do?
Hello,
Thanks for the report. Sadly, the automatic extraction is not perfect and this article has a bit non-standard structure.
There is a way to override the parser using the Extend Output Function, like this
1($) => { 2 const result = {}; 3 result.text = $('article div, article h2, article h3').text() 4 5 return result; 6}
See this run but it is still not perfect https://console.apify.com/view/runs/fgyTfOIjyt5chg8yT
Actor Metrics
199 monthly users
-
65 stars
>99% runs succeeded
1.2 days response time
Created in Nov 2019
Modified 4 months ago