Pricing

Pay per usage

Smart Article Extractor

📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.

Pricing

Pay per usage

Rating

4.1

(9)

Developer

Lukáš Křivka

Actor stats

190

Bookmarked

7.6K

Total users

322

Monthly active users

18 hours ago

Last modified

2024-03-21

Features

Add navigationWaitUntil input option for browser to allow faster or slower loading depending on the use-case

2023-09-12

Features

Add maxArticlesPerStartUrl to input to limit the number of articles per start URL

2023-08-03

Features

Add onlyArticlesForLastDays to input for easier dynamic date filtering

2023-03-27

Changes

snapshotUrls output have been replaced by screenshotUrl
extendOutputFunction is run after all fields were assigned forfull control

Fixes

extendOutputFunction now correctly works with undefined fields for browser

2023-03-20

Features

Add crawlWholeSubdomain to input so you don't need to set pseudoUrls or linkSelector
Add onlySubdomainArticles to input to limit articles and enqueueing to the subdomain of the start URL
Add saveHtmlAsLink to input to save HTML of articles as a link in the output
Add referrer, startUrl and depth to output

2023-03-01

Features

Update SDK to version 3

2022-10-13

Features

Deprecate saveSnapshotsOfInvalidArticles input field in favor of new saveSnapshots input field that save for all articles.
Deprecate pageWaitSelector and instead add pageWaitSelectorCategory and pageWaitSelectorArticle inputs

2022-09-29

Features

Added infinite scroll feature for browsers with 3 inputs: scrollToBottom, scrollToBottomButtonSelector, scrollToBottomMaxSecs

2022-09-21

Features

Nicer messages explaining why an article was marked as invalid
Added saveSnapshotsOfInvalidArticles option to input

2021-6-17

Features

Added enqueueFromArticles option to enqueue articles from article pages to get even more articles from the website. You need to enable it in input.
Added scanSitemaps and sitemapUrls parameters. scanSitemaps automatically searches sitemaps for articles for each start URL and sitemapUrls allows you to add the sitemaps manually if necessary. Be careful that scanSitemaps may dump a huge amount of (sometimes old) article URLs into the scraping process

2021-03-12

Fixes

onlyNewArticles and onlyNewArticlesPerDomain was loading duplicate items which caused excess usage of dataset read.

2021-03-31

Features

Added new input option onlyNewArticlesPerDomain. This is much more efficient way to deduplicate articles, so use it instead of onlyNewArticles.
onlyNewArticlesPerDomain works also on local datasets

2021-01-21

Fix: Now works with Start URLs from a public spreadsheet

2020-09-28

Upgraded Apify version 0.21.0 that sometimes crashed at the start of the run
Added currentItem param to extendOutputFunction
Improved logs
Increased request timeouts to work better on very slow sites

2020-07-07

Added option to run with browser (Puppeteer)
Added option to wait for page load or for selector (browser only)
Added articleUrls directly as input option to parse directly on articles

Youtube Search Scraper

scrapesmith/youtube-search-scraper

⚡ Fast YouTube Search Query Scraper – Extract video titles, views, likes, comments, publish dates, thumbnails & channel info directly from search results. No proxies needed. Get thousands of videos in minutes, 100% free.

Scrape Smith

422

5.0

Facebook Groups Scraper

mo_khairy/fb-multi-group-scraper

Extract posts from one or many Facebook groups with robust parsing and production-focused outputs.

Mohamed Khairy

5.0

Facebook Group Posts & Engagement Scraper

scrapio/facebook-groups-posts-scraper

Scrapes posts from multiple public Facebook groups, collecting text, images, authors, timestamps, reactions, comments, and post URLs. Ideal for community research, content analysis, trend tracking, and large-scale automated data extraction across many groups

Scrapio

Facebook Groups Posts & Comments Scraper

simpleapi/Facebook-Groups-Scraper

Gather high-quality data from public Facebook groups—posts, comments, reactions, images, and contextual metadata. Designed for researchers, brands, and developers needing accurate, scalable group insights for analytics or automation.

SimpleAPI

1.0

LinkedIn Public Profile Extractor - No Login, No Cookies

whoareyouanas/linkedin-profile-actor

Extract publicly visible LinkedIn profile details from profile URLs or usernames using a lightweight HTTP-first actor.

Anas Nadeem

Facebook Groups Posts Scraper

api-empire/facebook-groups-posts-scraper

Facebook Groups Posts Scraper extracts posts from public Facebook groups. Capture text, media, authors, timestamps, reactions, comments, and metadata. Ideal for research, community insights, trend tracking, and workflows needing structured Facebook group post data.

API Empire

Facebook Group Posts and Details Scraper

memo23/apify-facebook-group-scraper

Scrape detailed information from Facebook groups, including posts, author details, and engagement metrics. This scraper allows you to gather insights from public and private Facebook groups, ideal for social listening, trend analysis, and competitor research.

Muhamed Didovic

503

1.3

Facebook Group & Post Scraper - Posts, Comments & Reactions

whoareyouanas/facebook-group-scraper

Scrape posts from public & private Facebook groups and page. Extract post text, author info, reactions, comments, shares, images, videos, and group metadata. Cookie auth for private groups. Clean JSON/CSV output. No API key needed.

Anas Nadeem

602

5.0

Bloomberg Full Article Scraper - Cheapest

xtracto/bloomberg-news-article-scraper

Extract full Bloomberg news articles - including premium content - fast and cost-efficient using HTTP-only requests. No proxy, no browser, no login required.

Farhan Febrian Nauval

5.0

Facebook Group Post Scraper

scrapier/facebook-group-post-scraper

Scrape posts from Facebook groups with the Facebook Group Post Scraper. Extract post text, images, videos, comments, reactions, and timestamps. Ideal for community analysis, engagement tracking, and market research. Fast, reliable, and scalable for single or multiple groups.

Scrapier

5.0

Smart Article Extractor

2024-03-21

2023-09-12

2023-08-03

2023-03-27

2023-03-20

2023-03-01

2022-10-13

2022-09-29

2022-09-21

2021-6-17

2021-03-12

2021-03-31

2021-01-21

2020-09-28

2020-07-07

You might also like

Youtube Search Scraper

Facebook Groups Scraper

Facebook Group Posts & Engagement Scraper

Facebook Groups Posts & Comments Scraper

LinkedIn Public Profile Extractor - No Login, No Cookies

Facebook Groups Posts Scraper

Facebook Group Posts and Details Scraper

Facebook Group & Post Scraper - Posts, Comments & Reactions

Bloomberg Full Article Scraper - Cheapest

Facebook Group Post Scraper

2024-03-21

2023-09-12

2023-08-03

2023-03-27

2023-03-20

2023-03-01

2022-10-13

2022-09-29

2022-09-21

2021-6-17

2021-03-12

2021-03-31

2021-01-21

2020-09-28

2020-07-07