![Smart Article Extractor avatar](https://images.apifyusercontent.com/pF3q3fioe8IKJon6EjK4LJdR1BNiFjSiTBnagUQbPLA/rs:fill:250:250/cb:1/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9oeTVUWWlDQndROW84dVJLRy9DRDIyeHVTNXg0V3N3REdBbi10eXBlLWFydGljbGUucG5n.webp)
Smart Article Extractor
No credit card required
![Smart Article Extractor](https://images.apifyusercontent.com/pF3q3fioe8IKJon6EjK4LJdR1BNiFjSiTBnagUQbPLA/rs:fill:250:250/cb:1/aHR0cHM6Ly9hcGlmeS1pbWFnZS11cGxvYWRzLXByb2QuczMuYW1hem9uYXdzLmNvbS9oeTVUWWlDQndROW84dVJLRy9DRDIyeHVTNXg0V3N3REdBbi10eXBlLWFydGljbGUucG5n.webp)
Smart Article Extractor
No credit card required
📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.
How do i get article with certain keywords?
What do i need to modify in the actor inputs? Thanks!
![milunnn avatar](https://www.gravatar.com/avatar/e2744c89aa15dec7f2c8ced2f53a6d59?d=https%3A%2F%2Fcdn.apify.com%2Fimg%2Ficons%2Fanonymous_user_picture.png)
Hi,
Could you please clarify what do you wish to accomplish? Do you already have sites from which to scrape, but just need a keyword filter for the articles?
BSD_24
Hi, sorry I just read your response. Suppose that I want to scrape from 2 sites, like newsweb.com and newsb.com. From each of these sites, I want to find articles only with certain keywords. For example in newsweb.com I want to get articles with "football" keyword. I tried filtering the raw data externally in python, but it still makes each scraping process highly expensive. Thanks
![ondrejklinovsky avatar](https://apify-image-uploads-prod.s3.amazonaws.com/BystFRdYdjKhJvYvJ/ep744ynpmgG7GX3aJ-profile-1000x1000.png)
Ondrej Klinovský (ondrejklinovsky)
Hey,
unfortunately this actor can't do that. But you could use Google Search Results Scraper for that. See this example run: I used site:
and intext:
search operators to specify site and what word should the page contain: "site:cnn.com intext:football"
. You can then pass the results to this actor.
BSD_24
Hi, I have tried that and its working pretty good! Thanks for the info!
Actor Metrics
308 monthly users
-
87 stars
>99% runs succeeded
15 days response time
Created in Nov 2019
Modified 2 months ago