Construction Dive Scraper avatar

Construction Dive Scraper

Pricing

from $2.90 / 1,000 articles

Go to Apify Store
Construction Dive Scraper

Construction Dive Scraper

Construction Dive scraper for USA construction news and press releases: extract articles, contacts, images, and metadata from ConstructionDive.com Deep Dive section for construction tech, PR, and market research workflows.

Pricing

from $2.90 / 1,000 articles

Rating

0.0

(0)

Developer

Lexis Solutions

Lexis Solutions

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Construction Dive Scraper

What does the Construction Dive Scraper do?

This actor crawls Construction Dive (constructiondive.com) to collect article and press-release content. It supports both article/search pages (topic pages, press-release index) and article/detail pages (news and press releases). The scraper normalizes and produces structured dataset items for downstream analysis.

What data can I extract from ConstructionDive.com with this scraper?

With this actor you can extract:

  • Article-level metadata: url, title, publishDate, subHeading, description (cleaned paragraphs and headings)
  • Authors: author, authorTitle
  • Lead image: articleImage, imageCaption, sourceText
  • Inline and gallery images: images array with imageUrl, captionText, sourceText
  • Press release fields (when page is a press release): contactName, contactEmail, contactPhone, aboutSection

How to use this Scraper

This actor is runnable on Apify or locally. Provide startUrls (search/listing pages or direct article pages) and maxItems to limit how many detail pages are enqueued per start URL.

Steps:

  1. Provide start URLs (search/listing or detail pages) in the input or via the startUrls console field.
  2. Set maxItems to limit how many detail pages to collect per start URL.
  3. Run the actor and download the dataset when complete.

Input

The actor accepts the following input parameters:

  • startUrls (array of objects) - Required. URLs to start with. Can be search/listing pages or individual article/press-release pages.
  • maxItems (integer) - The maximum number of detail pages to enqueue per start URL. Example: 5
  • proxyConfiguration (object) - Proxy configuration settings.

Supported URL Examples

Example input:

{
"startUrls": [
{ "url": "https://www.constructiondive.com/press-release/" },
{ "url": "https://www.constructiondive.com/topic/commercial-building/" }
],
"maxItems": 5,
"proxyConfiguration": { "useApifyProxy": true }
}

Output

The scraped data is saved to the default dataset. Each item represents either a news article or a press release depending on the page type. You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Common Fields

  • url, title, publishDate, subHeading, description
  • author, authorTitle (news pages)
  • articleImage, imageCaption, sourceText (lead image)
  • images (array of image objects: imageUrl, captionText, sourceText)
  • contactName, contactEmail, contactPhone, aboutSection (press releases)

All absent values are explicitly set to null.

Example Output - Press Release

{
"url": "https://www.constructiondive.com/press-release/20260330-southern-impression-homes-jlm-living-advance-the-eleanor-bloomingdale-b/",
"title": "Southern Impression Homes & JLM Living Advance The Eleanor - Bloomingdale BTR Community | Construction Dive",
"publishDate": "March 30, 2026",
"contactName": "Kara Pound",
"contactEmail": "kara@oldcitypr.com",
"contactPhone": "386-237-4500",
"description": "JACKSONVILLE, Fla. — Southern Impression Homes (SIH), a leading full-service property development group specializing in Build-to-Rent (BTR) communities, announces its joint project with JLM Living on The Eleanor - Bloomingdale, a 253-unit single-family rental community currently under construction on Little Neck Road in Bloomingdale, Georgia, just outside Savannah. Vertical construction commenced in October 2025, with the first units expected in April 2026 and full project completion slated for early 2027. \"The Eleanor - Bloomingdale demonstrates our ability to seamlessly integrate design, development, and construction into a single execution platform,\" said Chris Funk, President and CEO of Southern Impression Homes...",
"aboutSection": "Search Home Topics Commercial Corporate News Economy Infrastructure Labor Safety Tech Sustainability Legal/Regs Deep Dive Opinion Library Events Press Releases Get Construction Dive in your inbox...",
"subHeading": "253-unit Bloomingdale development highlights strength of vertically-integrated partnership",
"author": null,
"authorTitle": null,
"articleImage": null,
"imageCaption": null,
"sourceText": null,
"images": []
}

Example Output - News Article

{
"url": "https://www.constructiondive.com/news/data-centers-community-benefits-spec-new-york-build-panel/816335/",
"title": "Data centers must prove their community worth, panel says",
"publishDate": "April 2, 2026",
"author": "Kate Serpico",
"authorTitle": "Senior Editor",
"subHeading": "As AI drives demand for data centers, developers face growing pressure to deliver tangible local benefits",
"description": "NEW YORK — Data center developers rushing to build facilities to support artificial intelligence applications must prove their worth to local communities, industry experts said at a panel discussion here...",
"articleImage": "https://www.constructiondive.com/imgproxy/...",
"imageCaption": "Aerial view of data center construction site",
"sourceText": "Permission granted by Construction Photography",
"images": [
{
"imageUrl": "https://www.constructiondive.com/imgproxy/...",
"captionText": "Interior view of data center server room",
"sourceText": "Construction Photography"
}
],
"contactName": null,
"contactEmail": null,
"contactPhone": null,
"aboutSection": null
}

Notes and Limitations

  • The actor depends on the current Construction Dive HTML structure; update selectors if the site changes.
  • Respect site Terms of Service and robots.txt. Use proxies and throttling to reduce blocking risk.
  • Pagination links that are relative (e.g., ?page=2) are resolved against the current page URL to produce absolute next-page links.

🔍 Looking to Scrape more News Websites?

In addition to this actor, you can explore our suite of dedicated scrapers tailored for other popular news websites. Each scraper is optimized for its target site to ensure accurate, efficient, and high-performance data extraction.

ScraperCountryDescription
Ynet.co.il ScraperIsrealScrape news content from ynet.co.il to gather headlines, summaries, and metadata. Ideal for news aggregation, market analysis, and tracking real-time trends. Fast, structured, and customizable extraction from an Israel-based source.
ElEspanol.com ScraperSpainScrape news content from El Español - including headlines, summaries, article bodies, authors, and publish dates. Ideal for news aggregation, market analysis, and trend tracking. Fast, structured, and customizable extraction from Spain’s leading news source.
Reddit Answers ScraperGlobalUnlock structured AI-powered Q&A from Reddit Answers—extract organized answers, source subreddits, related posts, and suggested topics. Perfect for market research, content creation, SEO strategy, and knowledge base building. Fast, reliable, and fully customizable.

Explore these solutions to expand your data collection capabilities across events data extraction websites.


👀 p.s.

Need changes or a custom export format (CSV/JSONL)? I can add dataset schema views or additional fields.

Contact the maintainer or open an issue in the repo for improvements.

Image Credit: https://www.constructiondive.com/