Smart Article Extractor avatar
Smart Article Extractor

Pricing

Pay per usage

Go to Store
Smart Article Extractor

Smart Article Extractor

Developed by

Lukáš Křivka

Lukáš Křivka

Maintained by Apify

📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.

4.7 (6)

Pricing

Pay per usage

124

Total users

5.1K

Monthly users

383

Runs succeeded

>99%

Issues response

7.7 days

Last modified

2 months ago

FC

Took too much time propably glitched

Closed

facelessaiagency opened this issue
24 days ago

it glitched for 2 hours, excuse me wtf ?

FC

facelessaiagency

24 days ago

(SOLVED) so this actor tend to "glitch"/get stuck on some articles but very very rarely. I made a workaround by making a timeout = 300 seconds. The default timeout of 604 800 seconds (168 hours) is INSANE and shouldn't be as big...

lukaskrivka avatar

Hello,

Thanks for the report. Indeed, this is a very weird page that made the Actor stuck. This is quite hard to fix since it gets stuck during the HTML parsing which cannot be easily intercepted. We haven't seen such problem yet in the few years of this Actor so will see if it repeats with some other page.

The timeout is big because often users are scraping entire news portals.