Legal News Aggregator - National Law Review Articles
Pricing
Pay per event
Legal News Aggregator - National Law Review Articles
Extract attorney-authored legal news and analysis articles from the National Law Review. Returns title, author, law firm, publication date, practice areas, jurisdictions, summary, and full text. First legal news aggregator on Apify.
Pricing
Pay per event
Rating
0.0
(0)
Developer
BowTiedRaccoon
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
16 hours ago
Last modified
Categories
Share
National Law Review Legal News Scraper
Scrape attorney-authored legal news and analysis articles from the National Law Review. Returns article title, author, law firm, publication date, practice areas, jurisdictions, summary, full text, and lead image URL for 33,000+ articles across every major US practice area.
Legal News Scraper Features
- Extracts 11 fields per article — title, author, firm, date, practice areas, jurisdictions, summary, full body text, image, source, and scrape timestamp
- Pulls from the full sitemap index — 33,000+ articles spanning a decade of legal commentary
- Sorts newest-first by lastmod, so
maxItems: 100returns the most recent 100 articles, not a random slice - Accepts a direct URL list too, for targeted scrapes of specific articles
- No proxies, no browser, no CAPTCHA — just clean HTML from a server-rendered site
- Parses JSON-LD schema.org
NewsArticlemetadata, which is about as stable as web data gets
Who Uses National Law Review Data?
- Law firm marketing teams — Track which firms and attorneys publish on which practice areas, benchmark thought-leadership output
- Compliance and regulatory teams — Monitor new analysis on regulatory changes across jurisdictions you care about
- Legal tech startups — Build datasets of attorney-authored content for search, summarization, or LLM training
- Market intelligence analysts — Track sentiment, topic frequency, and firm activity across the legal industry
- Dataset builders — Collect a deep corpus of structured legal writing without scraping paywalled publications
How the Legal News Scraper Works
- Walk the sitemap — Fetches the National Law Review sitemap index and each child sitemap, collecting article URLs with their last-modified timestamps
- Sort and slice — Orders articles newest-first, then caps the list to
maxItems - Fetch each article — CheerioCrawler pulls each page at moderate concurrency, respecting rate limits
- Parse and save — Pulls JSON-LD metadata for the authoritative fields and CSS selectors for the body, practice areas, and jurisdictions
Skip steps 1 and 2 by passing a list of article URLs directly. The scraper handles that mode too, since sometimes you already know which articles you want.
Input
{"maxItems": 100,"sp_intended_usage": "Compliance monitoring across tax and employment practice areas","sp_improvement_suggestions": "None"}
Or target specific articles by URL:
{"articleUrls": [{ "url": "https://natlawreview.com/article/major-h-1b-changes-announced-including-new-100000-fee" },{ "url": "https://natlawreview.com/article/whats-domain-name-explainer-domain-investing" }],"maxItems": 2,"sp_intended_usage": "Targeted research","sp_improvement_suggestions": "None"}
| Field | Type | Default | Description |
|---|---|---|---|
| maxItems | integer | 100 | Maximum number of articles to scrape. Articles are sorted newest-first when walking the sitemap. Set to 0 for unlimited. |
| articleUrls | array | [] | Optional list of specific article URLs. When provided, the sitemap walk is skipped and only these URLs are crawled. |
| proxyConfiguration | object | none | Proxy settings. Not required — National Law Review is a public site with no anti-bot protection. |
Legal News Scraper Output Fields
{"article_url": "https://natlawreview.com/article/major-h-1b-changes-announced-including-new-100000-fee","title": "Major H-1B Changes Announced, Including New $100,000 Fee","source_site": "natlawreview","author_name": "Norris McLaughlin P.A.","author_firm": "Norris McLaughlin P.A.","publication_date": "2025-09-22","summary": "In a series of startling and conflicting announcements that caused a great deal of panic over the weekend for H-1B holders and their employers, President Trump ","full_text": "In a series of startling and conflicting announcements ... particularly small business and nonprofits.","practice_areas": ["Immigration","Labor Employment","Administrative Regulatory"],"jurisdictions": ["All Federal"],"image_url": "https://natlawreview.com/sites/default/files/2025-09/H1B%20Visa%20Lottery%20Employment%20Immigration_2.jpg","scraped_at": "2026-04-18T01:19:26.230Z"}
| Field | Type | Description |
|---|---|---|
| article_url | string | Canonical article URL |
| title | string | Article headline |
| source_site | string | Source publication — currently always natlawreview |
| author_name | string | Attorney or author page name |
| author_firm | string | Law firm the author works for |
| publication_date | string | Publication date in ISO 8601 format (YYYY-MM-DD) |
| summary | string | Short article summary from JSON-LD description |
| full_text | string | Full article body as plain text, HTML tags stripped and entities decoded |
| practice_areas | array | Practice areas tagged on the article (e.g. Construction Law, Real Estate) |
| jurisdictions | array | Jurisdictions tagged on the article (e.g. Florida, All Federal) |
| image_url | string | Lead image URL, if present |
| scraped_at | string | ISO 8601 timestamp of when the article was scraped |
FAQ
How do I scrape the latest articles from the National Law Review?
Run the scraper with default input. It walks the sitemap, sorts by last-modified date, and returns the most recent 100 articles. Change maxItems to scrape more or fewer.
How do I scrape specific National Law Review articles by URL?
Pass a list of articleUrls in the input. The scraper skips the sitemap walk and fetches only the URLs you provide. Useful for re-scraping specific articles or building custom pipelines.
How much does the National Law Review Scraper cost to run?
The scraper uses the standard $0.10 per actor start + $0.001 per article record pricing. A 100-article run costs about $0.20 and finishes in under a minute. A full 33,000-article sitemap walk runs in under 10 minutes.
Does the scraper need proxies?
No. The National Law Review is a public Drupal site served through Varnish cache. No Cloudflare, no CAPTCHAs, no rate limiting in practice — the scraper ships with proxy settings disabled by default.
What practice areas does the National Law Review cover?
Every major US practice area, roughly. Construction law, immigration, tax, labor and employment, IP, real estate, financial services, environmental, health care, and dozens more. Each article is tagged with its practice areas and jurisdictions in the output.
Need More Features?
Need custom fields, filters, or coverage of additional legal news sites (JD Supra, Above the Law, Mondaq)? File an issue or get in touch.
Why Use the Legal News Aggregator Scraper?
- First of its kind — No other Apify actor targets legal news and analysis. This is the only one.
- Clean structured output — JSON-LD-backed fields mean consistent author, firm, and date attribution across tens of thousands of articles, which saves you the cleanup pass you were going to run anyway
- Affordable — ~$0.001 per article, no proxy costs, no browser costs