Extract information from leboncoin.fr : simply provide your search URL (+ your filters and location) and the number of times you want to click on "Page Suivante".
- Modified
- Users23
- Runs387
This actor aims to extract information from leboncoin.fr. Data can be exported to various formats such as JSON or CSV. You have to option to customise the information you want to extract by modifying the javascript function which is executed on each page.
How to Start
1 - Setup
- On your laptop, simply go to leboncoin and make a search, with all the filters you want, and location you want. Dont' forget to launch the search
and watch for :
- Then copy the URL of first page of search results, and paste it in the start-URL of this Actor. Please add the root url first, so that it mimics the user behaviour, as shown in the image bellow :
- Update the field "Page suivante: how many times ?" to paginate deeper in the results.
- If you encounter issues, set up a french residential proxy. It helps because leboncoin bans non-french request from its website. Go to "Advanced Configuration" :
2 - Run & Download
- Click Run
Then wait for the result to appear in the "dataset" section and download them.
Advanced Configuration
- Max pages : it's a limit, to prevent the actor from crawling too many pages.
- Function : It's the function extracting the result on each leboncoin.fr pages. If the default is not enough for you, just update it using jQuery selectors to fetch other piece of information from leboncoin.fr !
- Respect URL pattern : you can ask the actor to follow only links with a specific Regex pattern. By default, it follows only
https://www.leboncoin.fr[.*]
to avoid going wild in the web ! - Avoid anti-bot detection with Proxy : the Proxy option sets proxies to prevent detection by leboncoin. Best to have french proxies since leboncoin bans foreign IPs. see Apify Proxy for more info