The Scraper targets to crawl blog content from Naver after searching. Please see below for detail information.
- Keywords. Put a list of keywords that you want to search.
- Start Date. Blogs before this date will not be scraped.
- End Date. Blogs after this date will not be scraped.
- Max pages. The maximun number of pages that it will scraped. By the time of developing this scraper, 7 blogs are displayed per page.
Following fields are included in the output:
- Short Description
- Full Text
- Number of likes
- Number of comments
Note: Given the blog content in Naver are enclosed by an iframe, this scraper can only return all the text from the blog. That means including headers, menu bar, side bar and even adverstiment. Post data cleansing is needed to get the blog content.