Contact Details Scraper
vdrmota/contact-info-scraper
- Readme
- API
- Input
- Source code
The following table shows specification of the actor INPUT fields as defined by its input schema. These fields can be entered either manually in the app, or provided in a JSON object when running the actor using the API. Read more in docs.
Start URLs
List of web pages where the actor will start crawling.
Required
Type: array
Proxy
Proxy servers let you bypass website protections, avoid IP address blocking and view content for other countries. Try to use the proxy if you are experiencing timeout errors.
Optional
Type: object
Maximum link depth
The maximum number of links away from the Start URLs that the actor will crawl. If 0
, the actor will not follow any links. If empty or null, the actor will follow links to arbitrary depth.
Optional
Type: integer
minimum: 0
Total maximum pages
The maximum number of pages the crawler will load. It is always a good idea to limit the number of pages, otherwise the actor might run infinitely or consume too much resources.
Optional
Type: integer
minimum: 1
Maximum pages per start URL
The maximum number of pages that will be enqueued from each start URL you provide.
Optional
Type: integer
minimum: 1
Stay within domain
If set, the actor will only follow links within the same domain as the referring page.
Optional
Type: boolean
Live view
If set, the actor will display a live view on the container URL, where you can monitor its progress. Note that the live view has a small performance overheads.
Optional
Type: boolean
Probe frames
If set, the actor will extract contact information also from IFRAMEs. Sometimes, you might not want that (e.g. this will include data from online ads).
Optional
Type: boolean