Rakuten Ichiba Scraper avatar

Rakuten Ichiba Scraper

Pricing

Pay per event

Go to Apify Store
Rakuten Ichiba Scraper

Rakuten Ichiba Scraper

Scrape Rakuten Ichiba product listings for prices, shops, ratings, reviews, images, badges, shipping snippets, and product URLs.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Scrape public Rakuten Ichiba Japan search and category pages for structured product listing data.

Rakuten Ichiba is one of Japan's largest ecommerce marketplaces. This actor turns public listing pages into a clean Apify dataset with product titles, prices, shop names, ratings, review counts, image URLs, promotional badges, Rakuten points snippets, shipping text, ranking positions, and product URLs.

Use it when you need repeatable Rakuten product data without manually copying Japanese search result pages.

What this actor does

Rakuten Ichiba Scraper requests public Rakuten search, mall, ranking, and category listing pages, extracts visible product cards, follows pagination, and saves each product listing as one dataset item.

It supports two input styles:

  • πŸ”Ž Search keywords, such as カパラ, γƒŽγƒΌγƒˆγƒ‘γ‚½γ‚³γƒ³, or γ‚ΉγƒžγƒΌγƒˆγ‚¦γ‚©γƒƒγƒ
  • πŸ”— Direct Rakuten URLs, such as search pages, category pages, or already-filtered result pages

The actor is HTTP-based, so it is lighter and faster than browser-based scrapers while still capturing the core product listing data visible on Rakuten pages.

Why use it

Rakuten pages are optimized for shoppers, not for repeatable analysis. Product cards contain mixed Japanese text, prices, points, shipping badges, seller snippets, and ranking positions that are hard to copy consistently.

This actor gives you:

  • βœ… Clean product rows instead of manual copy/paste
  • βœ… Ranking positions for keyword and category monitoring
  • βœ… Numeric prices plus original Japanese price text
  • βœ… Shop and review signals for marketplace research
  • βœ… Image and product URLs for catalog matching
  • βœ… Timestamps so recurring runs can be compared over time
  • βœ… Apify dataset export to JSON, CSV, Excel, XML, or API pipelines

Who is it for

This actor is useful for teams that monitor Japanese ecommerce data:

  • Ecommerce analysts tracking competitive pricing on Rakuten Ichiba
  • Marketplace sellers checking category visibility and assortment
  • Brands monitoring authorized and gray-market shop listings
  • Retail research teams comparing Rakuten with Amazon Japan or Yahoo Shopping
  • Agencies preparing category reports for clients entering Japan
  • Catalog enrichment teams matching products by title, image, and URL
  • Lead generation teams identifying shops active in a product niche

Typical use cases

  • Track competitor prices on Rakuten Ichiba every day.
  • Monitor product availability and shipping snippets.
  • Collect marketplace assortment for a keyword or category.
  • Watch review counts and ratings for popular products.
  • Build lead lists of shops selling products in a category.
  • Compare Rakuten pricing with Amazon, Yahoo Shopping, or your own store.
  • Export search result snapshots for BI dashboards.
  • Detect new promotional badges, points campaigns, or free-shipping offers.

Data extraction table

FieldTypeDescription
sourceTypestringWhether the row came from a keyword search or direct URL.
searchQuerystringSearch keyword used for keyword runs.
sourceUrlstringRakuten result page URL that produced the row.
pagenumberRakuten result page number.
positionnumberOverall position in the run.
pagePositionnumberPosition on the current page.
titlestringProduct listing title.
urlstringProduct detail URL on Rakuten.
pricenumberParsed numeric JPY price when available.
priceTextstringOriginal visible price text, such as 24,800円.
currencystringCurrency code, currently JPY.
shopNamestringShop or seller name when visible.
ratingnumberProduct rating when visible on the card.
reviewCountnumberReview count when visible on the card.
imageUrlstringMain product image URL.
badgesarrayPromotional labels, badges, coupon text, shipping labels, or points snippets.
pointsTextstringRakuten points text when visible.
shippingTextstringShipping or availability snippet.
rawTextstringCompact visible card text for QA/debugging.
scrapedAtstringISO timestamp for the scrape.

Example output

{
"sourceType": "search",
"searchQuery": "カパラ",
"sourceUrl": "https://search.rakuten.co.jp/search/mall/%E3%82%AB%E3%83%A1%E3%83%A9/",
"page": 1,
"position": 1,
"pagePosition": 1,
"title": "Example digital camera listing",
"url": "https://item.rakuten.co.jp/example-shop/example-item/",
"price": 24800,
"priceText": "24,800円",
"currency": "JPY",
"shopName": "Example Shop",
"rating": 4.5,
"reviewCount": 128,
"imageUrl": "https://thumbnail.image.rakuten.co.jp/example.jpg",
"badges": ["送料焑料", "γƒγ‚€γƒ³γƒˆ"],
"pointsText": "γƒγ‚€γƒ³γƒˆ5倍",
"shippingText": "送料焑料",
"rawText": "Example compact card text",
"scrapedAt": "2026-05-19T08:00:00.000Z"
}

How much does it cost to scrape Rakuten Ichiba?

This actor uses pay-per-event pricing. Each run has a small start event and then a result event for every product listing saved to the dataset.

The pricing is designed for both small tests and larger recurring monitoring jobs. You can control your spend by setting maxResults before the run starts.

Cost tips:

  • Start with 20-50 products to confirm the keyword or URL is correct.
  • Use one keyword per scheduled monitoring run if you need stable ranking comparisons.
  • Increase maxResults only after checking that the first dataset rows contain the product category you expect.
  • Keep proxy enabled for reliability; Rakuten may return short reference pages to some direct traffic.

How to scrape Rakuten Ichiba

  1. Open the actor in Apify Console.
  2. Enter one or more Rakuten search keywords in searchQueries, or paste direct Rakuten result URLs in startUrls.
  3. Set maxResults to the number of product listings you want.
  4. Keep useApifyProxy enabled for Rakuten reliability.
  5. Leave proxyGroups as RESIDENTIAL and proxyCountryCode as JP unless you know another route works.
  6. Click Start.
  7. Open the default dataset after the run finishes.
  8. Export the data as JSON, CSV, Excel, XML, RSS, or consume it through the dataset API.

Input parameters

ParameterTypeRequiredDescription
searchQueriesarrayNo*Rakuten keywords to search. Japanese keywords usually work best.
startUrlsarrayNo*Direct Rakuten search/category URLs.
maxResultsintegerNoMaximum product listings to save across all inputs.
useApifyProxybooleanNoEnables Apify Proxy. Recommended for Rakuten.
proxyGroupsarrayNoProxy groups to use. Prefill is RESIDENTIAL.
proxyCountryCodestringNoProxy country code. Prefill is JP.
maxRequestRetriesintegerNoRetry attempts for failed or blocked requests.

*Provide at least one searchQueries item or one startUrls item.

Search queries

Use searchQueries for normal keyword searches. Japanese keywords usually produce the best results because Rakuten Ichiba is a Japanese marketplace. English terms can also work when Rakuten has matching product titles.

Example:

{
"searchQueries": ["γƒŽγƒΌγƒˆγƒ‘γ‚½γ‚³γƒ³", "カパラ"],
"maxResults": 50
}

Direct URLs

Use startUrls when you already have a Rakuten search, mall, ranking, or category URL. The actor will request the URL and paginate with Rakuten's p parameter.

Example:

{
"startUrls": [
{ "url": "https://search.rakuten.co.jp/search/mall/%E3%82%AB%E3%83%A1%E3%83%A9/" }
],
"maxResults": 100
}

For a first run, keep maxResults between 20 and 50. For recurring monitoring, set one keyword or one URL per run when you want stable ranking comparisons. For broad research, provide multiple keywords and increase maxResults after validating the first sample.

Proxy settings

Rakuten can return short reference pages or sparse HTML to some datacenter traffic. The actor therefore enables Apify Proxy by default. Residential proxies are prefilled because they are usually more reliable for marketplace pages.

If you already know your traffic source works without proxy, you can disable useApifyProxy. For production use, leave it enabled.

Pagination

The actor paginates by adding or updating Rakuten's p query parameter. It stops when it reaches maxResults, when a page no longer contains product cards, or when Rakuten returns no additional listings.

Data freshness

The actor scrapes live public pages. Prices, shipping snippets, point campaigns, and availability can change frequently. Use scrapedAt to identify when each row was collected.

Ranking analysis

Use position for the overall run order. Use page and pagePosition when you need to compare Rakuten page layout or detect ranking movement over time.

Shop monitoring

The shopName field helps identify sellers active in a category. Not every Rakuten card exposes the shop name in the same markup, so this field can be empty for some rows.

Review monitoring

When visible, rating and reviewCount are parsed from the listing card. If Rakuten hides review data for a layout, the fields are omitted instead of guessed.

Images

The actor extracts the first product image URL from each card. Use imageUrl for visual QA, catalog matching, or downstream enrichment.

Badges and promotions

Rakuten cards often contain many promotional snippets. The actor collects short visible labels that look like coupons, points, sales, ranking badges, used/new labels, or free-shipping badges.

Tips

  • Use Japanese search terms for better coverage.
  • Paste a filtered Rakuten URL when you need a specific category, price range, or sort order.
  • Keep scheduled monitoring inputs stable so ranking changes are comparable.
  • Treat optional fields as optional; Rakuten does not show every signal on every card.
  • Inspect rawText when troubleshooting a layout or missing field.
  • Use sourceUrl and page to trace a row back to the exact result page.

Limitations

This actor focuses on listing cards, not full product detail pages. It does not log in, add products to carts, bypass paywalls, or access private seller dashboards. It extracts only information visible on public Rakuten pages.

Anti-bot behavior

If Rakuten returns a short reference page, the run log will include a warning. In that case, keep Apify Proxy enabled, use residential proxies, reduce the number of keywords in one run, and retry later.

Performance

The actor is HTTP-based and does not run a browser. This keeps compute use low compared with Playwright actors. Network and proxy cost are the main cost drivers for large jobs.

Dataset export

After a run, export the dataset as JSON, CSV, Excel, XML, or RSS from Apify. CSV is convenient for spreadsheets. JSON is recommended for API pipelines.

Integrations

You can connect this actor to repeatable workflows:

  • Scheduled price monitoring: run the same keyword every morning, export CSV, and compare price, position, and shopName against yesterday's dataset.
  • BI dashboard pipeline: trigger the actor from an orchestrator, read the dataset API, and load product rows into BigQuery, Snowflake, Airtable, or Google Sheets.
  • Competitive alerting: use an Apify webhook after each successful run, then send low-price or new-shop matches to Slack, email, or your internal alert system.
  • Catalog enrichment: join title, imageUrl, and url with internal SKU matching logic.
  • Agency reporting: run category URLs for multiple clients and export separate datasets for each recurring report.

API usage

You can run the actor from the Apify API, schedule it, or integrate it with webhooks. A common workflow is to schedule daily keyword runs and export the dataset to your analytics warehouse.

Node.js example

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/rakuten-ichiba-scraper').call({
searchQueries: ['カパラ'],
maxResults: 20,
useApifyProxy: true,
proxyGroups: ['RESIDENTIAL'],
proxyCountryCode: 'JP'
});
console.log(`Dataset: https://api.apify.com/v2/datasets/${run.defaultDatasetId}/items`);

Python example

from apify_client import ApifyClient
import os
client = ApifyClient(os.environ['APIFY_TOKEN'])
run = client.actor('automation-lab/rakuten-ichiba-scraper').call(run_input={
'searchQueries': ['カパラ'],
'maxResults': 20,
'useApifyProxy': True,
'proxyGroups': ['RESIDENTIAL'],
'proxyCountryCode': 'JP',
})
print(run['defaultDatasetId'])

cURL example

curl -X POST "https://api.apify.com/v2/acts/automation-lab~rakuten-ichiba-scraper/runs?token=$APIFY_TOKEN" \
-H 'Content-Type: application/json' \
-d '{"searchQueries":["カパラ"],"maxResults":20,"useApifyProxy":true,"proxyGroups":["RESIDENTIAL"],"proxyCountryCode":"JP"}'

Example API input

{
"searchQueries": ["γ‚ΉγƒžγƒΌγƒˆγ‚¦γ‚©γƒƒγƒ"],
"maxResults": 200,
"useApifyProxy": true,
"proxyGroups": ["RESIDENTIAL"],
"maxRequestRetries": 3
}

MCP

Use Apify's hosted MCP endpoint when connecting Claude Code, Claude Desktop, Cursor, VS Code, or another MCP-compatible client. The tool URL should include this actor slug so the client exposes the Rakuten Ichiba tool directly.

Claude Code MCP setup

$claude mcp add --transport http apify-rakuten-ichiba "https://mcp.apify.com?tools=automation-lab/rakuten-ichiba-scraper"

Claude Desktop, Cursor, and VS Code MCP JSON

{
"mcpServers": {
"apify-rakuten-ichiba": {
"type": "http",
"url": "https://mcp.apify.com?tools=automation-lab/rakuten-ichiba-scraper"
}
}
}

After connecting, authenticate with your Apify account/token as required by your MCP client.

Example MCP prompts

Try these example prompts after connecting the Apify MCP server:

  • "Run Rakuten Ichiba Scraper for カパラ with 30 results and summarize the cheapest products."
  • "Scrape this Rakuten category URL and return the top shops by listing count."
  • "Monitor γƒŽγƒΌγƒˆγƒ‘γ‚½γ‚³γƒ³ on Rakuten Ichiba and compare prices with the previous dataset."

Legality

The actor collects publicly visible Rakuten Ichiba listing information. Use it responsibly, respect Rakuten's terms, avoid excessive request volumes, and do not use the output for spam, deceptive activity, or privacy-invasive profiling. You are responsible for ensuring your use case complies with applicable laws and marketplace policies.

Other Automation Lab actors that can support ecommerce and marketplace workflows:

Use Rakuten Ichiba Scraper when your target is public product listing data from Rakuten Japan.

Troubleshooting

If a run returns no data, check the log for block-page warnings and verify that the input URL opens in a normal browser. Try a smaller maxResults, keep useApifyProxy enabled, and use Japanese keywords when possible.

If only some fields are empty, Rakuten probably did not show those values on that particular listing card. The actor does not invent missing ratings, review counts, or shop names.

Quality notes

Rakuten changes markup over time. The extractor uses multiple selector and text-pattern fallbacks instead of depending on one CSS class. The rawText field is included to make troubleshooting easier when Rakuten tests a new layout.

Best practices

Start with a small run. Inspect the output. Then scale to more keywords or pages. For tracking, keep the same query spelling and sorting URL between runs.

Support

If you need help, include the Apify run URL, your input JSON, and a short explanation of what you expected to see. This makes it possible to reproduce the issue quickly.

Changelog

  • 0.1.0 β€” Initial release for Rakuten Ichiba search/category product listing cards.

FAQ

Does it scrape product detail pages?

No. Version 0.1 extracts listing-card data from public search/category pages.

Can I scrape by category?

Yes. Paste a Rakuten category or search URL into startUrls.

Is Japanese input required?

No, but Japanese keywords usually provide better Rakuten coverage.

Why are some fields empty?

Rakuten does not show every detail on every card. Empty optional fields mean the value was not visible or could not be parsed safely.

Can I use this for price monitoring?

Yes. Schedule recurring runs and compare price, position, shopName, and scrapedAt over time.