Actor picture

Leboncoin extractor

anchor/leboncoin

Extract information from leboncoin.fr : simply provide your search URL (+ your filters and location) and the number of times you want to click on "Page Suivante".

Free trial for 30 days

Then $9/month

No credit card required now

Author's avatarguillim
  • Modified
  • Users5
  • Runs105
Actor picture

Leboncoin extractor

Free trial for 30 days

Then $9/month

This actor aims to extract information from leboncoin.fr using a Chromium browser and french Proxies. Data can be exported to various formats such as JSON or CSV. You have to option to customise the information you want to extract by modifying the javascript function which is executed on each page.

How to Start

1 - Setup

  • On your laptop, simply go to leboncoin and make a search, with all the filters you want, and location you want. Dont' forget to launch the search lbc_search
  • Then copy the URL of first page of search results, and paste it in the start-URL of this Actor. url1
  • Update the field "Page suivante: how many times ?" to paginate deeper in the results.
  • Get custom french proxy (See bellow for more details about Proxy) because leboncoin bans non-french request from its website lbc_search

2 - Run & Download

  • Click Run

Then wait for the result to appear in the "dataset" section and download them.

Proxy

The Proxy option sets proxies used in order to prevent detection by leboncoin. It's compulsory to have french proxies since leboncoin bans foreign IPs. That's why you must choose "Custom Proxies" since every other option are not french ones. see Apify Proxy

The following table lists the available options of the proxy configuration setting:

Custom proxies

The scraper will use a custom list of proxy servers. Proxies must be specified in the scheme://user:password@host:port format. Note: multiple proxies should be separated by a space or new line. Scheme can be http or socks5. User & password are optional.

Example:

http://groups-RESIDENTIAL,country-FR:balbalbalbaGNyTidoCZCfqg@proxy.apify.com:8000

<s>None</s> [ban from leboncoin] The scraper will not use any proxies, and will have AWS IPs from the united states.
<s>Apify Proxy, automatic</s> [ban from leboncoin] The proxy uses all proxy groups that are available to the user, but none of them are french
<s>Apify Proxy, selected groups</s> [ban from leboncoin] The proxy uses specific groups, but none of them are french

Advanced Configuration

  • Respect URL pattern : you can ask the actor to follow only links with a specific Regex pattern. By default, it follows only https://www.leboncoin.fr[.*] to avoid going wild in the web !
  • Max pages : it's a limit, to prevent the actor from crawling too many pages.
  • Function : It's the function extracting the result on each leboncoin.fr pages. If the default is not enough for you, just update it using jQuery selectors to fetch other piece of information from leboncoin.fr !