Yelp Business Info Scraper avatar

Yelp Business Info Scraper

Try for free

1 day trial then $25.00/month - No credit card required now

Go to Store
Yelp Business Info Scraper

Yelp Business Info Scraper

delicious_zebu/yelp-business-info-scraper
Try for free

1 day trial then $25.00/month - No credit card required now

Quickly gather rich, detailed data from Yelp business pages—perfect for insights and analysis! 🚀

Developer
Maintained by Community

Actor Metrics

  • 19 Monthly users

  • No reviews yet

  • 18 bookmarks

  • >99% runs succeeded

  • 15 hours response time

  • Created in Nov 2024

  • Modified 8 days ago

competent_path avatar

Multiple urls as input stopped working

Closed
Competent Path (competent_path) opened this issue
12 days ago

In the previous version 0.0.8 I was able to pass multiple urls at an input, and it would return output for all of them. It broke in the latest version 0.0.9

12025-02-28T21:02:18.114Z ACTOR: Pulling Docker image of build 5iO3ZMpkagGdJaCAe from repository.
22025-02-28T21:02:18.889Z ACTOR: Creating Docker container.
32025-02-28T21:02:18.972Z ACTOR: Starting Docker container.
42025-02-28T21:02:20.489Z [apify] INFO  Initializing Actor...
52025-02-28T21:02:20.493Z [apify] INFO  System info ({"apify_sdk_version": "2.3.1", "apify_client_version": "1.9.2", "crawlee_version": "0.5.4", "python_version": "3.12.9", "os": "linux"})
62025-02-28T21:02:20.524Z [apify] INFO  Hello from the Actor!
72025-02-28T21:02:36.709Z [apify] ERROR Error occurred while requesting parameters: https://www.yelp.com/biz/total-auto-protect-wilmington, skipping. Error: RetryError[<Future at 0x74edd09a0530 state=finished raised Exception>]
82025-02-28T21:02:36.711Z [apify] INFO  Exiting Actor ({"exit_code": 0})

Example input that causes failure

1{
2  "Urls": [
3    "https://www.yelp.com/biz/total-auto-protect-wilmington-8",
4    "https://www.yelp.com/biz/total-auto-protect-wilmington"
5  ]
6}

however if I run the following input, it works just fine

1{
2  "Urls": [
3    "https://www.yelp.com/biz/total-auto-protect-wilmington"
4  ]
5}

[update] Seems like it was running for really long time and timedout

The Actor timed out. You can resurrect it with more timeout to continue where you left off.

competent_path avatar

Please also see my other ticket https://console.apify.com/actors/OrbxHdajgAHeoPvlv/issues/4JwqkifMIyLX1bzJy

It would be really great if you retained versions history.

delicious_zebu avatar

Hi, thank you so much for your feedback. I’ve fixed and optimized the issue. You can now re-run the Actor to check the results, and feel free to contact me if you encounter any further issues.

Also, regarding your suggestion not to delete historical versions, that’s a great idea. You can first try the current version "0.0.11" to see if it works fine. If there are no issues, I will make sure to keep the historical versions when updating in the future.

competent_path avatar

I still have an issue with 0.0.11 Not sure if it is the same or something else.

When I run a single url: https://www.yelp.com/biz/harrys-tobacco-shop-burbank-2

I get

2025-03-01T13:26:28.988Z ACTOR: The Actor run has reached the timeout of 60 seconds, aborting it. You can increase the timeout in Input > Run options.

or sometimes I get

2025-03-01T13:45:28.395Z [apify] ERROR Error occurred while requesting parameters: https://www.yelp.com/biz/harrys-tobacco-shop-burbank-2, skipping. Error: RetryError[<Future at 0x74a6ebcc74d0 state=finished raised Exception>]

Note that I still get the output, so it's both the output and timeout received. See screenshot attached.

delicious_zebu avatar

Hi,

Regarding the first timeout issue: It might occasionally be affected by server network conditions (which doesn't happen often). If the Actor fails to load the webpage within a short time, this issue may occur. You can increase the timeout in Input > Run options, for example, changing it from 60 seconds to 3600 seconds.

Regarding the second issue: This happens because a CAPTCHA appears when requesting the Yelp website. In this case, the Actor will attempt to retry the request. If a CAPTCHA is encountered five times in a row, the Actor will skip that parameter. As a solution, I have increased the number of retry attempts from 5 to 10. If the error still occurs, you will need to rerun the Actor with that parameter.

You can try the latest version "0.0.12".

competent_path avatar

Thank you for this. I also see that you released new version and retained the old one which is great. I will keep testing and let you know how it goes.