Deep Website Content Crawler avatar

Deep Website Content Crawler

Try for free

Pay $2.00 for 1,000 results

Go to Store
Deep Website Content Crawler

Deep Website Content Crawler

6sigmag/deep-website-content-crawler
Try for free

Pay $2.00 for 1,000 results

Scrape Failed Killer! A high-performance web scraper that rapidly extracts and analyzes content from multiple websites simultaneously. Perfect for competitive research, content aggregation, and website structure analysis.

nikita-sviridenko avatar

If one of websites hangs, the whole task fails

Open

Nikita Sviridenko (nikita-sviridenko) opened this issue
a month ago

Could you please add a per-website timeout? And an error column so that we know which of them will fail?

This way other websites will be fetched successfully.

nikita-sviridenko avatar

F.e. from this list 7 websites were imported and then it timed out after 5 min:

1{
2  "startUrls": [
3    "pearl-executivesearch.com",
4    "e-epitech.com",
5    "taylormaderecrutement.com",
6    "carrhure.com",
7    "focus-recrutement.fr",
8    "vauban-recrutement.fr",
9    "beager.com",
10    "job-tourisme.fr",
11    "eurojob-consulting.com",
12    "headhuntingfactory.com",
13    "pentabell.com",
14    "la-releve.com",
15    "haussmann-es.com",
16    "prismemploi.eu",
17    "briks.fr",
18    "88jobs.com",
19    "archibat.com",
20    "solantis.fr",
21    "selescope.com",
22    "ltd-international.com",
23    "sintel-recrutement.fr",
24    "macanders.eu",
25    "jobmania.fr",
26    "quadra-consultants.com",
27    "recruter.tn",
28    "grantalexander.com",
29    "fedlegalimports.com",
30    "asap.work",
31    "synapse-executive.com",
32    "dlsi-emploi.com",
33    "easypartner.fr",
34    "vendome-associes.com",
35    "now-consulting.fr",
36    "recrutop.com",
37    "myrecrutement.eu",
38    "skillwise.fr",
39    "ccld.com",
40    "alerys.fr",
41    "talysio.fr",
42    "gif-emploi.fr",
43    "gif-emploi.fr",
44    "tasterh.fr",
45    "axxis-interimetrecrutement.com",
46    "abg.asso.fr",
47    "lynkus.fr",
48    "morganmallet.agency",
49    "highdev.com",
50    "achil.io",
51    "fedafrica.com",
52    "aperlead.com"
53  ]
54}
nikita-sviridenko avatar

It'll be also helpful to see in the logs which websites are already scraped.

Developer
Maintained by Community

Actor Metrics

  • 52 monthly users

  • 6 stars

  • >99% runs succeeded

  • 33 days response time

  • Created in Oct 2024

  • Modified 2 months ago