Reddit Scraper Lite avatar
Reddit Scraper Lite
Try for free

Pay $4.00 for 1,000 results

View all Actors
Reddit Scraper Lite

Reddit Scraper Lite

trudax/reddit-scraper-lite
Try for free

Pay $4.00 for 1,000 results

Pay Per Result, unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

User avatar

Reddit scraper isn't working

Closed

researcher_1999 opened this issue
a year ago

The paid version of the Reddit scraper isn't working. The free version works, though.

User avatar

I don't see any issues with the paid version. Can you give me more details about the error you are getting?

User avatar

researcher_1999

a year ago

Sure! Here are the errors, I can copy the whole log if you need it: (This happens with every sub, every URL including comments and users, so I'm not sure why)

2023-05-25T02:48:05.119Z WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. SyntaxError: Unexpected token o in JSON at position 1 2023-05-25T02:48:05.121Z at JSON.parse (

User avatar

researcher_1999

a year ago

Now the free version isn't working, either, and when I just checked it says it's under maintenance. Reddit did recently revoke API access from pushshift, so I don't know if you are using their API and they may have revoked access.

User avatar

researcher_1999

a year ago

This is the error that comes up now on the paid version: 2023-05-25T05:50:51.626Z ERROR This route is under maintenance, please use the previous actor version for now

User avatar

I have fixed the issue. You should be able to scrape communities again now.

User avatar

researcher_1999

a year ago

Thank you, you rock! I will test it out shortly :)

User avatar

researcher_1999

a year ago

I'm still getting this error on the paid Reddit Scraper: 2023-05-25T20:29:21.366Z ERROR This route is under maintenance, please use the previous actor version for now

I got the above error when scraping a user account for comments and this error just now for a community: 2023-05-25T20:32:47.425Z WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Navigation timeout of 30000 ms exceeded

User avatar

Can you share the run ID with me so I can take a closer look? Just open the run and copy the ID from it like in the image:

User avatar

researcher_1999

a year ago

Sure! Here is one run (the sub) HAXeBbcUPmKUn3SA0

And here is the second one (the comments) 9XlBVZi6b45JMpUe0

User avatar

Had a small bug in there. I tested both runs with the new fix and now is working with no problems. Give it a try and let me know.

User avatar

researcher_1999

a year ago

Right on, it works to scrape communities! Now the only issue is that it doesn't work for users, for this URL structure: https://www.reddit.com/user/username/comments/ https://www.reddit.com/user/username/submitted/

I'm not sure if I'm just not setting limits correctly, but for subs, I can't seem to get results past 1 year. I set all the limits to 5000000 in case that was the issue and still, it cuts off at 1 year for subs that have posts dating back 4-7 years. But it is working to scrape content from the last year at least.

User avatar

I will probably need to add those url structures to the scraper. Let me check that.

User avatar

researcher_1999

a year ago

Sweet! This tool is really amazing, and I have a feeling you'll be getting a lot more people using it soon. :)

User avatar

researcher_1999

a year ago

I don't know if you thought about adding this as a feature, but it would be really useful if we could search for keywords in specific communities. Currently, the search can only be applied to Reddit as a whole. Just a thought!

User avatar

You can go to the Reddit community page, search for what you want, and then copy the URL and use it as starting point on the Reddit scrapper.

User avatar

researcher_1999

a year ago

Unfortunately, that isn't the same, even with search parameters, Reddit doesn't return proper results. I was using Camas, but they revoked the pushshift API so a lot of us are trying to scrape full subs and search for specific posts and when you search on Reddit you don't get all the results, compared to when you use a tool, unfortunately. :(

User avatar

Just an updated, I am testing the user URLs so they should be added to the scraper soon.

Developer
Maintained by Community
Actor metrics
  • 134 monthly users
  • 99.8% runs succeeded
  • 0.51 days response time
  • Created in Jun 2020
  • Modified about 8 hours ago