Rakuten Ichiba Scraper
Pricing
Pay per event
Rakuten Ichiba Scraper
Scrape Rakuten Ichiba product listings for prices, shops, ratings, reviews, images, badges, shipping snippets, and product URLs.
Pricing
Pay per event
Rating
0.0
(0)
Developer
Stas Persiianenko
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Scrape public Rakuten Ichiba Japan search and category pages for structured product listing data.
Rakuten Ichiba is one of Japan's largest ecommerce marketplaces. This actor turns public listing pages into a clean Apify dataset with product titles, prices, shop names, ratings, review counts, image URLs, promotional badges, Rakuten points snippets, shipping text, ranking positions, and product URLs.
Use it when you need repeatable Rakuten product data without manually copying Japanese search result pages.
What this actor does
Rakuten Ichiba Scraper requests public Rakuten search, mall, ranking, and category listing pages, extracts visible product cards, follows pagination, and saves each product listing as one dataset item.
It supports two input styles:
- π Search keywords, such as
γ«γ‘γ©,γγΌγγγ½γ³γ³, orγΉγγΌγγ¦γ©γγ - π Direct Rakuten URLs, such as search pages, category pages, or already-filtered result pages
The actor is HTTP-based, so it is lighter and faster than browser-based scrapers while still capturing the core product listing data visible on Rakuten pages.
Why use it
Rakuten pages are optimized for shoppers, not for repeatable analysis. Product cards contain mixed Japanese text, prices, points, shipping badges, seller snippets, and ranking positions that are hard to copy consistently.
This actor gives you:
- β Clean product rows instead of manual copy/paste
- β Ranking positions for keyword and category monitoring
- β Numeric prices plus original Japanese price text
- β Shop and review signals for marketplace research
- β Image and product URLs for catalog matching
- β Timestamps so recurring runs can be compared over time
- β Apify dataset export to JSON, CSV, Excel, XML, or API pipelines
Who is it for
This actor is useful for teams that monitor Japanese ecommerce data:
- Ecommerce analysts tracking competitive pricing on Rakuten Ichiba
- Marketplace sellers checking category visibility and assortment
- Brands monitoring authorized and gray-market shop listings
- Retail research teams comparing Rakuten with Amazon Japan or Yahoo Shopping
- Agencies preparing category reports for clients entering Japan
- Catalog enrichment teams matching products by title, image, and URL
- Lead generation teams identifying shops active in a product niche
Typical use cases
- Track competitor prices on Rakuten Ichiba every day.
- Monitor product availability and shipping snippets.
- Collect marketplace assortment for a keyword or category.
- Watch review counts and ratings for popular products.
- Build lead lists of shops selling products in a category.
- Compare Rakuten pricing with Amazon, Yahoo Shopping, or your own store.
- Export search result snapshots for BI dashboards.
- Detect new promotional badges, points campaigns, or free-shipping offers.
Data extraction table
| Field | Type | Description |
|---|---|---|
sourceType | string | Whether the row came from a keyword search or direct URL. |
searchQuery | string | Search keyword used for keyword runs. |
sourceUrl | string | Rakuten result page URL that produced the row. |
page | number | Rakuten result page number. |
position | number | Overall position in the run. |
pagePosition | number | Position on the current page. |
title | string | Product listing title. |
url | string | Product detail URL on Rakuten. |
price | number | Parsed numeric JPY price when available. |
priceText | string | Original visible price text, such as 24,800ε. |
currency | string | Currency code, currently JPY. |
shopName | string | Shop or seller name when visible. |
rating | number | Product rating when visible on the card. |
reviewCount | number | Review count when visible on the card. |
imageUrl | string | Main product image URL. |
badges | array | Promotional labels, badges, coupon text, shipping labels, or points snippets. |
pointsText | string | Rakuten points text when visible. |
shippingText | string | Shipping or availability snippet. |
rawText | string | Compact visible card text for QA/debugging. |
scrapedAt | string | ISO timestamp for the scrape. |
Example output
{"sourceType": "search","searchQuery": "γ«γ‘γ©","sourceUrl": "https://search.rakuten.co.jp/search/mall/%E3%82%AB%E3%83%A1%E3%83%A9/","page": 1,"position": 1,"pagePosition": 1,"title": "Example digital camera listing","url": "https://item.rakuten.co.jp/example-shop/example-item/","price": 24800,"priceText": "24,800ε","currency": "JPY","shopName": "Example Shop","rating": 4.5,"reviewCount": 128,"imageUrl": "https://thumbnail.image.rakuten.co.jp/example.jpg","badges": ["ιζη‘ζ", "γγ€γ³γ"],"pointsText": "γγ€γ³γ5ε","shippingText": "ιζη‘ζ","rawText": "Example compact card text","scrapedAt": "2026-05-19T08:00:00.000Z"}
How much does it cost to scrape Rakuten Ichiba?
This actor uses pay-per-event pricing. Each run has a small start event and then a result event for every product listing saved to the dataset.
The pricing is designed for both small tests and larger recurring monitoring jobs. You can control your spend by setting maxResults before the run starts.
Cost tips:
- Start with 20-50 products to confirm the keyword or URL is correct.
- Use one keyword per scheduled monitoring run if you need stable ranking comparisons.
- Increase
maxResultsonly after checking that the first dataset rows contain the product category you expect. - Keep proxy enabled for reliability; Rakuten may return short reference pages to some direct traffic.
How to scrape Rakuten Ichiba
- Open the actor in Apify Console.
- Enter one or more Rakuten search keywords in
searchQueries, or paste direct Rakuten result URLs instartUrls. - Set
maxResultsto the number of product listings you want. - Keep
useApifyProxyenabled for Rakuten reliability. - Leave
proxyGroupsasRESIDENTIALandproxyCountryCodeasJPunless you know another route works. - Click Start.
- Open the default dataset after the run finishes.
- Export the data as JSON, CSV, Excel, XML, RSS, or consume it through the dataset API.
Input parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
searchQueries | array | No* | Rakuten keywords to search. Japanese keywords usually work best. |
startUrls | array | No* | Direct Rakuten search/category URLs. |
maxResults | integer | No | Maximum product listings to save across all inputs. |
useApifyProxy | boolean | No | Enables Apify Proxy. Recommended for Rakuten. |
proxyGroups | array | No | Proxy groups to use. Prefill is RESIDENTIAL. |
proxyCountryCode | string | No | Proxy country code. Prefill is JP. |
maxRequestRetries | integer | No | Retry attempts for failed or blocked requests. |
*Provide at least one searchQueries item or one startUrls item.
Search queries
Use searchQueries for normal keyword searches. Japanese keywords usually produce the best results because Rakuten Ichiba is a Japanese marketplace. English terms can also work when Rakuten has matching product titles.
Example:
{"searchQueries": ["γγΌγγγ½γ³γ³", "γ«γ‘γ©"],"maxResults": 50}
Direct URLs
Use startUrls when you already have a Rakuten search, mall, ranking, or category URL. The actor will request the URL and paginate with Rakuten's p parameter.
Example:
{"startUrls": [{ "url": "https://search.rakuten.co.jp/search/mall/%E3%82%AB%E3%83%A1%E3%83%A9/" }],"maxResults": 100}
Recommended settings
For a first run, keep maxResults between 20 and 50. For recurring monitoring, set one keyword or one URL per run when you want stable ranking comparisons. For broad research, provide multiple keywords and increase maxResults after validating the first sample.
Proxy settings
Rakuten can return short reference pages or sparse HTML to some datacenter traffic. The actor therefore enables Apify Proxy by default. Residential proxies are prefilled because they are usually more reliable for marketplace pages.
If you already know your traffic source works without proxy, you can disable useApifyProxy. For production use, leave it enabled.
Pagination
The actor paginates by adding or updating Rakuten's p query parameter. It stops when it reaches maxResults, when a page no longer contains product cards, or when Rakuten returns no additional listings.
Data freshness
The actor scrapes live public pages. Prices, shipping snippets, point campaigns, and availability can change frequently. Use scrapedAt to identify when each row was collected.
Ranking analysis
Use position for the overall run order. Use page and pagePosition when you need to compare Rakuten page layout or detect ranking movement over time.
Shop monitoring
The shopName field helps identify sellers active in a category. Not every Rakuten card exposes the shop name in the same markup, so this field can be empty for some rows.
Review monitoring
When visible, rating and reviewCount are parsed from the listing card. If Rakuten hides review data for a layout, the fields are omitted instead of guessed.
Images
The actor extracts the first product image URL from each card. Use imageUrl for visual QA, catalog matching, or downstream enrichment.
Badges and promotions
Rakuten cards often contain many promotional snippets. The actor collects short visible labels that look like coupons, points, sales, ranking badges, used/new labels, or free-shipping badges.
Tips
- Use Japanese search terms for better coverage.
- Paste a filtered Rakuten URL when you need a specific category, price range, or sort order.
- Keep scheduled monitoring inputs stable so ranking changes are comparable.
- Treat optional fields as optional; Rakuten does not show every signal on every card.
- Inspect
rawTextwhen troubleshooting a layout or missing field. - Use
sourceUrlandpageto trace a row back to the exact result page.
Limitations
This actor focuses on listing cards, not full product detail pages. It does not log in, add products to carts, bypass paywalls, or access private seller dashboards. It extracts only information visible on public Rakuten pages.
Anti-bot behavior
If Rakuten returns a short reference page, the run log will include a warning. In that case, keep Apify Proxy enabled, use residential proxies, reduce the number of keywords in one run, and retry later.
Performance
The actor is HTTP-based and does not run a browser. This keeps compute use low compared with Playwright actors. Network and proxy cost are the main cost drivers for large jobs.
Dataset export
After a run, export the dataset as JSON, CSV, Excel, XML, or RSS from Apify. CSV is convenient for spreadsheets. JSON is recommended for API pipelines.
Integrations
You can connect this actor to repeatable workflows:
- Scheduled price monitoring: run the same keyword every morning, export CSV, and compare
price,position, andshopNameagainst yesterday's dataset. - BI dashboard pipeline: trigger the actor from an orchestrator, read the dataset API, and load product rows into BigQuery, Snowflake, Airtable, or Google Sheets.
- Competitive alerting: use an Apify webhook after each successful run, then send low-price or new-shop matches to Slack, email, or your internal alert system.
- Catalog enrichment: join
title,imageUrl, andurlwith internal SKU matching logic. - Agency reporting: run category URLs for multiple clients and export separate datasets for each recurring report.
API usage
You can run the actor from the Apify API, schedule it, or integrate it with webhooks. A common workflow is to schedule daily keyword runs and export the dataset to your analytics warehouse.
Node.js example
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: process.env.APIFY_TOKEN });const run = await client.actor('automation-lab/rakuten-ichiba-scraper').call({searchQueries: ['γ«γ‘γ©'],maxResults: 20,useApifyProxy: true,proxyGroups: ['RESIDENTIAL'],proxyCountryCode: 'JP'});console.log(`Dataset: https://api.apify.com/v2/datasets/${run.defaultDatasetId}/items`);
Python example
from apify_client import ApifyClientimport osclient = ApifyClient(os.environ['APIFY_TOKEN'])run = client.actor('automation-lab/rakuten-ichiba-scraper').call(run_input={'searchQueries': ['γ«γ‘γ©'],'maxResults': 20,'useApifyProxy': True,'proxyGroups': ['RESIDENTIAL'],'proxyCountryCode': 'JP',})print(run['defaultDatasetId'])
cURL example
curl -X POST "https://api.apify.com/v2/acts/automation-lab~rakuten-ichiba-scraper/runs?token=$APIFY_TOKEN" \-H 'Content-Type: application/json' \-d '{"searchQueries":["γ«γ‘γ©"],"maxResults":20,"useApifyProxy":true,"proxyGroups":["RESIDENTIAL"],"proxyCountryCode":"JP"}'
Example API input
{"searchQueries": ["γΉγγΌγγ¦γ©γγ"],"maxResults": 200,"useApifyProxy": true,"proxyGroups": ["RESIDENTIAL"],"maxRequestRetries": 3}
MCP
Use Apify's hosted MCP endpoint when connecting Claude Code, Claude Desktop, Cursor, VS Code, or another MCP-compatible client. The tool URL should include this actor slug so the client exposes the Rakuten Ichiba tool directly.
Claude Code MCP setup
$claude mcp add --transport http apify-rakuten-ichiba "https://mcp.apify.com?tools=automation-lab/rakuten-ichiba-scraper"
Claude Desktop, Cursor, and VS Code MCP JSON
{"mcpServers": {"apify-rakuten-ichiba": {"type": "http","url": "https://mcp.apify.com?tools=automation-lab/rakuten-ichiba-scraper"}}}
After connecting, authenticate with your Apify account/token as required by your MCP client.
Example MCP prompts
Try these example prompts after connecting the Apify MCP server:
- "Run Rakuten Ichiba Scraper for γ«γ‘γ© with 30 results and summarize the cheapest products."
- "Scrape this Rakuten category URL and return the top shops by listing count."
- "Monitor γγΌγγγ½γ³γ³ on Rakuten Ichiba and compare prices with the previous dataset."
Legality
The actor collects publicly visible Rakuten Ichiba listing information. Use it responsibly, respect Rakuten's terms, avoid excessive request volumes, and do not use the output for spam, deceptive activity, or privacy-invasive profiling. You are responsible for ensuring your use case complies with applicable laws and marketplace policies.
Related scrapers
Other Automation Lab actors that can support ecommerce and marketplace workflows:
- Amazon Scraper β collect Amazon product search and listing data.
- Shopee Reviews Scraper β gather Shopee review data for product research.
- Made-in-China Scraper β extract supplier and product listings from Made-in-China.
- Trustpilot Scraper β monitor company reviews and ratings.
Use Rakuten Ichiba Scraper when your target is public product listing data from Rakuten Japan.
Troubleshooting
If a run returns no data, check the log for block-page warnings and verify that the input URL opens in a normal browser. Try a smaller maxResults, keep useApifyProxy enabled, and use Japanese keywords when possible.
If only some fields are empty, Rakuten probably did not show those values on that particular listing card. The actor does not invent missing ratings, review counts, or shop names.
Quality notes
Rakuten changes markup over time. The extractor uses multiple selector and text-pattern fallbacks instead of depending on one CSS class. The rawText field is included to make troubleshooting easier when Rakuten tests a new layout.
Best practices
Start with a small run. Inspect the output. Then scale to more keywords or pages. For tracking, keep the same query spelling and sorting URL between runs.
Support
If you need help, include the Apify run URL, your input JSON, and a short explanation of what you expected to see. This makes it possible to reproduce the issue quickly.
Changelog
0.1.0β Initial release for Rakuten Ichiba search/category product listing cards.
FAQ
Does it scrape product detail pages?
No. Version 0.1 extracts listing-card data from public search/category pages.
Can I scrape by category?
Yes. Paste a Rakuten category or search URL into startUrls.
Is Japanese input required?
No, but Japanese keywords usually provide better Rakuten coverage.
Why are some fields empty?
Rakuten does not show every detail on every card. Empty optional fields mean the value was not visible or could not be parsed safely.
Can I use this for price monitoring?
Yes. Schedule recurring runs and compare price, position, shopName, and scrapedAt over time.