Zillow Detail Scraper avatar

Zillow Detail Scraper

Try for free

Pay $3.00 for 1,000 results

View all Actors
Zillow Detail Scraper

Zillow Detail Scraper

maxcopell/zillow-detail-scraper
Try for free

Pay $3.00 for 1,000 results

Get details of Zillow properties from URLs. This Actor can be easily integrated with other Zillow Scrapers.

Do you want to learn more about this Actor?

Get a demo
TH

Getting Warning errors

Closed

thunderdagger opened this issue
3 months ago

I see this in the log when the actor runs

2024-07-30T16:31:52.866Z ACTOR: Creating Docker container. 2024-07-30T16:31:53.006Z ACTOR: Starting Docker container. 2024-07-30T16:31:54.835Z INFO System info {"apifyVersion":"3.1.15","apifyClientVersion":"2.8.4","crawleeVersion":"3.7.2","osType":"Linux","nodeVersion":"v20.14.0"} 2024-07-30T16:32:02.050Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:02.173Z INFO Verified query RECENTLY_SOLD with hash 3b123ac2c33226bce39c810f136361c1c7cb88993477a528019bb2a82d66bd1e 2024-07-30T16:32:02.556Z WARN Failed to populate cache for Building, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:02.930Z WARN Failed to populate cache for PaidBuilderHomeDetail, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:03.954Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:04.691Z WARN Failed to populate cache for PaidBuilderHomeDetail, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:06.280Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:07.560Z INFO Verified query PaidBuilderHomeDetail with hash 590ca13d69e757b441445359a26d1b6b6d5aed52b3ef5fe9d605a1d96611e695 2024-07-30T16:32:08.770Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:11.441Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:22.918Z INFO Verified query CommunityDetails with hash 0e71082cfc846035f0f06bb1dd0eb1a8837813039218e4dbc912726924651d2c 2024-07-30T16:32:28.342Z INFO Verified query FOR_SALE with hash e48c1a391b46684464cc695885db5e86bceb34e0ed9666aa24dcc25cd3c51ed7 2024-07-30T16:32:33.031Z WARN Failed to populate cache for FOR_RENT, retrying. Error: RequestError: The HTTP/2 stream has been early terminated 2024-07-30T16:32:34.280Z WARN Failed to populate cache for FOR_RENT, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:36.171Z INFO Verified query FOR_RENT with hash 529ee8546635bce1a1f2fb014fb4cf8a75354a29d03b44894113ab8c6822b618 2024-07-30T16:32:52.018Z WARN Failed to populate cache for Building, retrying. Error: RequestError: The HTTP/2 stream has been early terminated 2024-07-30T16:32:54.218Z WARN Failed to populate cache for Building, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:56.543Z WARN Failed to populate cache for Building, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:32:58.380Z WARN Failed to populate cache for Building, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:33:00.529Z WARN Failed to populate cache for Building, retrying. Error: SyntaxError: Unexpected token '<', "<!DOCTYPE "... is not valid JSON 2024-07-30T16:33:03.561Z INFO Verified query Building with hash a0a71bb05d85154b246b5480760c26d1986bc17ae720658c417034f405215fe0 2024-07-30T16:33:03.745Z INFO HttpCrawler: Starting the crawler.

I don't think its my code rather something in the actor thats causing it because its asking for json i think .

mvolfik avatar

Hi, this is a known behavior. Sometimes, Zillow blocks our request and serves a HTML error page instead of JSON, which produces this error. We keep it in the log so that we can investigate when the scraper starts getting blocked more often or users report hard failures, instead of just warnings.

However, I understand that this can be confusing to end users. Thanks for raising this, we will reconsider it and eventually remove it from the log if we find that the information we get is not worth the confusion, or we can solve this in some other way.

I hope this helps, thanks for your understanding.
Best regards
Matěj

Developer
Maintained by Apify
Actor metrics
  • 122 monthly users
  • 29 stars
  • 96.2% runs succeeded
  • 5.8 days response time
  • Created in Jun 2023
  • Modified 23 days ago
Categories