Pricing

Pay per event

Biketo China Cycling News & Product Scraper

Scrapes Biketo (美骑网) — China's largest cycling portal — for news, product reviews, and race coverage since 2008. Enumerates articles by sequential ID across three channels. Returns title, author, publish date, channel, body text, lead image, and engagement metrics.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

13 days ago

Last modified

What you get

Each scraped record contains:

Field	Description
`articleId`	Numeric article ID (e.g. 56323)
`articleUrl`	Full canonical URL
`channel`	Biketo's channel label in Chinese (e.g. 美骑快讯, 产品快讯, 赛事新闻)
`title`	Article headline in Chinese
`tags`	Comma-separated category tags from the article header
`author`	Author or source attribution
`publishDate`	Publish date-time (YYYY-MM-DD HH:MM:SS)
`leadImage`	URL of the first image in the article body
`bodyText`	Full article body text, whitespace-collapsed
`viewCount`	Page view count (integer)
`commentCount`	Comment count (integer)
`scrapedAt`	ISO-8601 scrape timestamp

Input parameters

Parameter	Type	Default	Description
`startId`	integer	1	Article ID to start enumeration from
`endId`	integer	56500	Article ID to stop at (inclusive)
`channels`	array	`["news","product","racing"]`	Content channels to include
`maxItems`	integer	—	Cap on total articles to return

Content channels

news — Cycling news, industry coverage, product announcements (/news/<id>.html)
product — Gear reviews and product features (/product/<id>.html)
racing — Race coverage and results (/racing/<id>.html)

All three channels share the same sequential ID space. IDs are enumerated in parallel across selected channels; invalid IDs for a given channel are silently skipped.

Usage examples

Full back-catalog (all channels, ~56k articles):

{
  "startId": 1,
  "endId": 56500,
  "channels": ["news", "product", "racing"]
}

Recent articles only (incremental update):

{
  "startId": 56200,
  "endId": 56500,
  "channels": ["news", "product", "racing"],
  "maxItems": 100
}

Product reviews only:

{
  "startId": 1,
  "endId": 56500,
  "channels": ["product"]
}

Notes

Charset: Biketo serves pages in GB2312. The actor transparently decodes to UTF-8 via Crawlee's built-in charset handling — all output fields are clean UTF-8 Chinese text.
Rate limiting: The actor uses moderate concurrency (5–15) with polite crawling. No proxy is required; the site is fully accessible to datacenter IPs.
Invalid IDs: Not every ID exists in every channel. The actor skips URLs that return 404 or lack an article heading — no error is logged for these, keeping run logs clean.
Resumability: For large runs, set startId and endId to narrow ranges. Re-run with updated startId for incremental updates.

Cyclingnews Races & News Scraper

jungle_synthesizer/cyclingnews-races-news-scraper

Scrapes pro-cycling news articles and race reports from Cyclingnews.com. Extracts headline, author, dates, body text, summary, and LATAM-cycling relevance flags (riders and races). For sports-analytics, LLM training, and cycling intelligence dashboards.

BowTiedRaccoon

CAST China Space Technology News Scraper

jungle_synthesizer/cast-cn-china-academy-space-technology-news-scraper

Scrapes news articles from CAST (中国空间技术研究院), China's primary satellite manufacturer. Extracts articles from channels including 本院动态, 媒体聚焦, 科技动态, and more. Each article includes title, full body text and HTML, publish date, source attribution, and channel name.

BowTiedRaccoon

USA Cycling Sport80 Events Scraper

jungle_synthesizer/usacycling-sport80-events-finder-scraper

Scrapes USA Cycling's Sport80 event locator for cycling race and event listings. Extracts organizer contact details (name, email, phone), event dates, location coordinates, pricing, entry windows, and capacity — ideal for B2B lead generation targeting race-service suppliers.

BowTiedRaccoon

China Tech & Startup News — 科技创投 API

nexgendata/china-tech-startup-news

Track China tech and startup news coverage. Clean JSON for analysts, quants and AI agents.

NexGenData

News Source Crawler

crawlerbros/news-source-crawler

Given a news website URL, discover and extract articles with full metadata with title, authors, publish date, body text, top image, keywords, and summary. Works with any news site via sitemap or HTML discovery.

Crawler Bros

Professional Cycling Results & Classifications

trovevault/professional-cycling-results-classifications

Returns cycling race winners, stage results, GC, points, mountains, youth, and team classifications. Export data, run via API, schedule and monitor runs, or integrate with other tools.

Trove Vault

Made in China Product Scraper 🔎🇨🇳🛒

scrapestorm/made-in-china-product-scraper

🟠 Easily collect Products from Made-in-China Provide one or multiple Made-in-China search or category URLs and extract product data including 🆔 Product ID 🏷️ Title 💲 Price Range 📍 Location 🌐 Source URL 🔗 Product URL & more… Perfect for B2B sourcing research and competitor monitoring 🚀📊🏭

Storm_Scraper

5.0

Toutiao News Tracker - China News Hot Board & Search

nexgendata/toutiao-news-tracker

Scrape Toutiao (今日头条), ByteDance's #1 China news app: the live hot board (热榜) plus keyword news search. Get rank, headline, hot value, source, abstract and article URL as JSON. For China news monitoring, media intelligence and sentiment research. No CN account.

NexGenData

36Kr China Tech Startup News Feed Scraper

jungle_synthesizer/36kr-china-tech-startup-news-feed-scraper

Scrape 36Kr (36氪), China's premier tech and startup news outlet. Extract articles, funding-round announcements, author metadata, section tags, and full body text across all verticals.

BowTiedRaccoon

Made-in-China.com Product Scraper

agenscrape/made-in-china-com-product-scraper

Scrape Made-in-China.com products by keyword, category, or URL. Extract product names, prices, MOQ, supplier details, certifications, OEM/sample support, and more. Perfect for B2B sourcing and supplier discovery from China's largest wholesale platform.