🔥Czech News Scraper

Pricing

from $2.00 / 1,000 articles

Try for free

Go to Apify Store

🔥Czech News Scraper

Try for free

Extracts articles from Czech news sites (Novinky.cz, Seznam Zprávy, Super.cz, Proženy.cz) in JSON, real-time & historical, 1M+ articles.

Pricing

from $2.00 / 1,000 articles

Rating

5.0

(1)

Developer

P Brother

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

Czech News Scraper

Czech News Scraper extracts article texts from selected news focusing on both speed and data quality.

In addition to the article text, Czech News Scraper also retrieves various metadata for each article. The full output is detailed below.

Features

⚡ High speed — scrape thousands of articles within seconds
📦 JSON output — structured data ready for further processing
🏷️ Extracted metadata — author, title, dates, tags, categories, and more
📝 Content in Markdown — clean, analysis-ready format
🔄 Unified content format — consistent schema across all supported websites
⏱️ Realtime & historical articles — access both the latest and historical articles

Filtering

🔍 Full-text query
📅 Created date range
🕒 Updated date range

Sorting

📆 By created date
♻️ By updated date
⭐ By rank (full-text relevance)

Pagination

📑 Up to 100 articles per page

🔗 Try all filters and features right away in the Start Console section — you only pay for the results you actually want and receive. Plus, you get free credits from Apify to get started.

Supported Websites

Czech News Scraper currently supports scraping articles from the following websites:

Novinky.cz — over 724,000 articles
Seznam Zprávy — over 171,000 articles
Super.cz — over 134,000 articles
ProŽeny.cz — over 46,000 articles

Total: over 1,076,000 articles

Additional websites will be added over time.
If you’d like to see a new website supported, go to the Issues tab and create a request.

Why Scrape News Articles?

There are many reasons why scraping news articles is useful:

Media monitoring: Track mentions of your company, competitors, or industry-related keywords to stay on top of reputation and trends.
Research and analysis: Collect and analyze articles to identify patterns, trends, and insights in politics, economics, or social issues.
Sentiment analysis: Determine sentiment around topics, companies, or individuals to understand public opinion.
Event detection: Detect and track events (natural disasters, protests, product launches) for fast response.
Topic modeling: Identify underlying topics and themes to understand broader context.
Entity extraction: Extract people, organizations, and locations to build databases or track relationships.
News recommendation: Build personalized recommendation systems for users.
Fake news detection: Identify potential misinformation and promote fact-based journalism.
Historical research: Archive articles for long-term study of past events and trends.
Business intelligence: Gather competitive intelligence, track markets, and discover opportunities.
Content generation: Use articles as input for summaries, abstracts, or generated content.
Academic research: Support studies in journalism, communication, sociology, and political science.
Data journalism: Create interactive dashboards and visualizations for storytelling.
AI training: Provide large, high-quality datasets for training AI models.

Output example

The scraped articles will be shown as a dataset which you can find in the Output tab.
For easier inspection, the results are first displayed as a table.

Below is a sample dataset in JSON format:

{
    "articleId": 40528950,
    "created": 1751640522,
    "updated": 1751652939,
    "recommendedUntilDate": 1751816446,
    "domicile": null,
    "url": "https://www.novinky.cz/clanek/ekonomika-jako-kdyby-spadl-most-na-plne-dalnici-komentuje-analytik-blackout-40528950",
    "section": "ekonomika",
    "tags": ["Elektřina", "Blackout", "Blackout v Česku"],
    "relatedArticles": [],
    "authors": ["Martin Procházka"],
    "title": "Jako kdyby spadl most na plné dálnici, komentuje analytik blackout",
    "perex": "Příčiny výpadku elektřiny v částech Česka nejsou zatím jasné...",
    "captionTitle": "Analytik společnosti Capitalinked Radim Dohnal",
    "captionImageUrl": "//d15-a.sdn.cz/d_15/c_img_ob_A/nPvP7Dwxi2txZrV7DpuCck/1c9a/radim-dohnal.jpeg",
    "content": "# `Jako kdyby spadl most na plné dálnici, komentuje analytik blackout`\n\nVytvořeno: 04.07.25 14:48\n\nAktualizováno: 04.07.25 18:15\n\n..."
}

NOTE: On this page you can also see a larger sample output with full JSON data and rendered Markdown content:
https://apify-czech-news.vercel.app/

NOTE: All textual content is converted into Markdown (titles, text, images, even tables, external links such as Facebook posts, Tweets, and many more).
Sometimes, however, embedded HTML widgets remain unprocessed. These are inserted inside html blocks in the output.
You can safely delete them locally, or further process them with tools like BeautifulSoup (Python) or Cheerio (Node.js).

Is It Legal to Extract Articles?

Yes, extracting articles is legal, since you are scraping publicly available content. However, most articles are protected by copyright laws.

Before publishing extracted content anywhere, always check the terms of use of the source website.

Your feedback

If you find a bug or have feedback, please create an issue in the Issues tab.

📧 Contact: pbrother@seznam.cz — don’t hesitate to reach out, I’ll look into it quickly.

Seznam Scraper

conduit/seznam-scraper

Extract search results from Seznam.cz, the leading Czech search engine. Get titles, URLs, and content snippets from any search query with automatic pagination support.

Conduit

Jobs.cz Scraper

lexis-solutions/jobs-cz-scraper

Scrape job listings from Jobs.cz - including titles, companies, locations, salaries, and requirements. Ideal for building job boards, market analysis, and trend tracking. Fast, structured, and customizable extraction from the Czech Republic’s leading job portal.

Lexis Solutions

5.0

Jobs.cz Scraper

shahidirfan/Jobs-cz-Scraper

Extract job listings effortlessly with the Jobs.cz Scraper. This lightweight actor is optimized for speed and reliability on the Jobs.cz platform. For seamless performance and to avoid blocks, the use of residential proxies is highly recommended. Perfect for recruitment data!

Shahid Irfan

Obchodni Rejstrik Downloader

valek.josef/obchodni-rejstrik-downloader

Downloads data from Czech company registry https://or.justice.cz/

Josef Válek

5.0

Google News Realtime Scraper

devisty/google-news

Provide real-time news and articles sourced from Google News

Devisty

237

5.0

Google News PPR

devisty/google-news-ppr

Provide real-time news and articles sourced from Google News (Pay per result)

Devisty

Super Fast Google News Scraper

aymorato/super-fast-google-news-scraper

Efficiently extract direct links to the latest Google News articles from the past 24 hours.

Alwin Morato

Google News Scraper

easyapi/google-news-scraper

Powerful Google News scraper, collect up to 5000 news articles with flexible search options, language support. Perfect for news aggregation, market research, and sentiment analysis. 📰🔍

EasyApi

750

4.8

Macys News Articles

pintostudio/macys-news-articles

The Macy's News Articles Actor is a powerful Apify web scraping tool designed to extract press releases and news articles from Macy's official newsroom.

Pinto Studio

Ultimate News API

glitch_404/Ultimate-News-Scraper

Scrape up to 10000 news articles from over 4500 news sources in less than 20 minutes, news from over 20 categories, e.g., Crypto news, World News, Latest News, Celebrities, and a lot more. You can find news on websites such as Fox News, BBC News, CNN, and Cryptocurrency-Related News Sources.