Articles Extractor avatar
Articles Extractor
Try for free

3 days trial then $15.00/month - No credit card required now

View all Actors
Articles Extractor

Articles Extractor

web.harvester/articles-extractor
Try for free

3 days trial then $15.00/month - No credit card required now

A fast and powerful tool for extracting article data from URLs. It uses advanced scraping techniques to parse HTML and extract relevant information such as the article title, author, publication date, and content. Download your data in any format (JSON, CSV, XML, RSS, HTML Table).

RE

Some valid article content not being scraped

Open

releasd opened this issue
5 months ago

In this article the last two paragraphs of text are not extracted: https://www.sussexexpress.co.uk/news/people/father-and-daughter-team-up-at-bellway-development-in-crawley-4379088

The extracted text ends with "£455,000 and £605,000 respectively."

The following two paragraphs are missed:

"To find out more about the development, visit https://www.bellway.co.uk/new-homes/south-london/riverbrook-place or call the sales team on 01293 306782.

To find out more about careers with Bellway, visit https://www.bellwaycareers.co.uk/."

Why would this text not be scraped?

Thanks

Developer
Maintained by Community
Actor metrics
  • 34 monthly users
  • 4 stars
  • 100.0% runs succeeded
  • 8.1 hours response time
  • Created in Jun 2023
  • Modified about 1 month ago
Categories