Reddit Scraper Lite avatar
Reddit Scraper Lite
Try for free

Pay $4.00 for 1,000 results

View all Actors
Reddit Scraper Lite

Reddit Scraper Lite

trudax/reddit-scraper-lite
Try for free

Pay $4.00 for 1,000 results

Pay Per Result, unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

User avatar

Scraping outside of the URL?

Closed

chinolex opened this issue
6 months ago

I dont think this reddit scraper is working right.

  1. here's the URLs i wanted to scrape https://console.apify.com/actors/oAuCIx3ItNrs2okjQ/runs/02ZQthreq4q3wymAB#output
  2. Here's what I tried https://chat.openai.com/share/5fbfa1ba-3fa3-413c-8814-5ae4385694ea
  3. The problem is that a) its getting really expensive and I don't know how to turn it off and b) it's scraping things outside of the URL inputs I set.
User avatar

chinolex

6 months ago

Following up here, thanks! Let me know if there is anything I can help with.

User avatar

If you use a search URL it will go to each page of the results until no result is found or it reaches the limit specified on the input. Do you have an example of something that scraped outside of the scope you wanted?

User avatar

Also, you can abort a run using Apify's user interface, which will stop the current run.

User avatar

chinolex

6 months ago

I entered Specific sub Reddit URL’s with a search, specific, sub, Reddit, and a site search of Solar. Can you take a look at my URLs?

User avatar

One of your search terms is just solar, so it is returning any post or comment with that term in it.

User avatar

chinolex

6 months ago

Got it. What about the subreddit + solar? Does it scrape all the posts in the subreddit with that? Not sure if that works

User avatar

If you are using the subreddit + solar search as a startUrl it will work since it's pagination the results of that URL.

User avatar

chinolex

6 months ago

It will work… great!

Developer
Maintained by Community
Actor metrics
  • 133 monthly users
  • 99.8% runs succeeded
  • 0.51 days response time
  • Created in Jun 2020
  • Modified about 6 hours ago