Datamuse Word Finder Scraper
Pricing
from $3.75 / 1,000 result items
Datamuse Word Finder Scraper
Export English word lists from the Datamuse word-finder: synonyms, antonyms, rhymes, sound-alikes, hypernyms, holonyms, frequency scores, syllable counts, definitions, and part-of-speech tags. Pick a relation type, seed word, optional topic, and download the full result set.
Pricing
from $3.75 / 1,000 result items
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share

📝 Datamuse Word Finder Scraper
🚀 Export English word lists in seconds. Pull synonyms, antonyms, rhymes, sound-alikes, hypernyms, holonyms, definitions, frequency scores, and pronunciations across 370,000+ words. No registration, no API key, no manual JSON wrangling.
🕒 Last updated: 2026-05-20 · 📊 11 fields per word · 🔤 370k+ words queryable · 🔗 17 relation types
The Datamuse Word Finder Scraper exports English word lists from the Datamuse word-finder and returns up to 11 fields per word, including the word itself, relation score, syllable count, part-of-speech tags, frequency per million tokens, ARPABET pronunciation, and one or more dictionary definitions. The underlying dataset is one of the largest open word-relation indexes for English, covering 370,000+ unique forms.
Coverage spans 17 relation types: means-like (semantic), sounds-like (phonetic), spelled-like (wildcards), rhymes, near-rhymes, synonyms, antonyms, hypernyms (kind of), hyponyms (more specific), holonyms (part of), meronyms (has parts), triggers (co-occurrence), follows / precedes (collocations), adjectives-for-noun, nouns-for-adjective, and frequency-sorted lists.
| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| NLP developers, crossword and word-game builders, content writers, SEO keyword strategists, ESL and EdTech apps, copywriters, lyric and poetry tools | Vocabulary expansion, rhyme finding, sound-alike search, keyword research, autocomplete dictionaries, thesaurus apps, language-learning drills |
📋 What the Datamuse Word Finder Scraper does
Seventeen relation-typed lookups in a single Actor:
- 🧠 Means like. Semantic neighbors (synonyms, related concepts).
- 🔊 Sounds like / Rhymes / Near rhymes. Phonetic similarity for songwriting, poetry, mnemonics.
- 🔡 Spelled like. Wildcard lookups (
?,*) for crossword and word-puzzle apps. - 📚 Synonyms / Antonyms. Classic thesaurus relations.
- 🌳 Hypernyms / Hyponyms / Holonyms / Meronyms. Taxonomy relations (
is a kind of,is part of). - 🔁 Triggers / Follows / Precedes. Co-occurrence and collocation lookups for n-gram suggestions.
- 🏷️ Adjectives for noun / Nouns for adjective. Grammatical companions for autocomplete.
- 📊 Frequent. Frequency-ranked word lists with wildcards.
Optional metadata add-ons return definitions, parts of speech, syllable counts, frequency per million tokens, and ARPABET pronunciations.
💡 Why it matters: building a word-aware app means juggling thesauri, rhyming dictionaries, frequency tables, and pronunciation lexicons. Datamuse exposes all of that through a single uniform interface; this Actor wraps it for repeatable, schedulable runs that you can pipe into a spreadsheet, notebook, or downstream service.
🎬 Full Demo
🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.
⚙️ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
maxItems | integer | 10 | Words to return. Free plan caps at 10, paid plan at 1,000,000. |
queryType | string | "means_like" | One of 17 relation types. |
seedWord | string | "happy" | Seed word or pattern. Wildcards (?, *) supported for spelled-like. |
topicWord | string | "" | Optional topic bias. |
leftContext / rightContext | string | "" | Words immediately before/after the target. |
partOfSpeech | string | "" | Restrict to noun, verb, adjective, or adverb. |
includeDefs / includeTags | boolean | true | Include definitions and POS/frequency/pronunciation tags. |
Example: synonyms of "happy" with definitions.
{"maxItems": 25,"queryType": "means_like","seedWord": "happy","includeDefs": true,"includeTags": true}
Example: words that rhyme with "moon" biased toward astronomy.
{"maxItems": 50,"queryType": "rhymes","seedWord": "moon","topicWord": "astronomy"}
⚠️ Good to Know: Datamuse caps each query at 1,000 results, so for very large word lists, break your request into multiple seeded queries (alphabetical wildcards, multiple topics) and concatenate datasets. The default returns are ranked by relevance, not alphabetical order.
📊 Output
Each word record contains up to 11 fields. Download as CSV, Excel, JSON, or XML.
🧾 Schema
| Field | Type | Example |
|---|---|---|
🔤 word | string | "pleased" |
🔁 queryType | string | "means_like" |
🌱 seedWord | string | null | "happy" |
📊 score | number | null | 40004395 |
🔢 numSyllables | number | null | 1 |
🏷️ partsOfSpeech | string[] | null | ["adjective", "verb"] |
📈 frequencyPerMillion | number | null | 19.64 |
🔊 pronunciation | string | null | "P L IY1 Z D" (ARPABET) |
📚 definitions | string[] | null | ["adj Happy, content."] |
🏷️ tags | string[] | null | Raw Datamuse tag list |
🕒 scrapedAt | ISO 8601 | Collection timestamp |
📦 Sample records
✨ Why choose this Actor
| Capability | |
|---|---|
| 🔤 | Massive coverage. 370,000+ unique English forms across 17 relation types. |
| 🎯 | Flexible queries. Combine relation type with topic bias, left/right context, and part-of-speech filter. |
| 📚 | Rich metadata. Optional definitions, ARPABET pronunciations, syllable counts, and frequency per million tokens. |
| ⚡ | Fast. Hundreds of words in under 5 seconds. |
| 🔁 | Always fresh. Live word-finder pulls reflect the current Datamuse index. |
| 🧩 | Wildcards. Spelled-like supports ? (one char) and * (any) for crossword and pattern matching. |
| 🚫 | No authentication. Public word-finder. No login or token required. |
📊 Datamuse is one of the most-used open word APIs in the indie and educational developer community.
📈 How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| ⭐ Datamuse Word Finder Scraper (this Actor) | $5 free credit, then pay-per-use | 370k+ words | Live per run | 17 relation types, topic, context, POS | ⚡ 2 min |
| Direct Datamuse queries | Free, rate-limited | Full | Live | Manual | 🛠️ Coding |
| Commercial NLP word APIs | $$$+/month | Variable | Variable | Many | ⏳ Long signup |
| Local thesaurus / WordNet dumps | Free | Subset, stale | Rarely | Manual | 🕒 Setup time |
Pick this Actor when you want repeatable, schedulable word lists in CSV/JSON without writing query loops or rate-limiting handlers.
🚀 How to use
- 📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
- 🌐 Open the Actor. Go to the Datamuse Word Finder Scraper page on the Apify Store.
- 🎯 Set input. Pick a relation type, set a seed word, and add optional topic or context.
- 🚀 Run it. Click Start and let the Actor collect your data.
- 📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.
💼 Business use cases
🔌 Automating Datamuse Word Finder Scraper
Control the scraper programmatically for scheduled runs and pipeline integrations:
- 🟢 Node.js. Install the
apify-clientNPM package. - 🐍 Python. Use the
apify-clientPyPI package. - 📚 See the Apify documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Build keyword-expansion pipelines that refresh weekly without manual prompting.
🌟 Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
🤖 Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- 💬 ChatGPT
- 🧠 Claude
- 🔍 Perplexity
- 🅒 Copilot
❓ Frequently Asked Questions
🧩 How does it work?
Pick a relation type and seed word, click Start, and the Actor queries the open word-finder and returns one structured record per matching word, ranked by relevance.
📏 How accurate is the data?
Datamuse derives its relations from corpora including WordNet, Wiktionary, and the Google Books n-gram dataset. Scoring is empirical and tuned for usefulness, not strict linguistic correctness. Frequency scores reflect Google Books usage.
🔁 How often is the dataset refreshed?
Datamuse maintains a continuously updated word index. Every run pulls live, so your results reflect the current version of the underlying lexicons.
🔊 What format is the pronunciation field?
ARPABET, with stress markers. The numbers after vowels indicate primary (1), secondary (2), or no (0) stress.
⏰ Can I schedule regular runs?
Yes. Use Apify Schedules to run this Actor on any cron interval (daily, weekly) to refresh keyword sets or vocabulary lists.
⚖️ Is this data legal to use?
Datamuse's open word index is intended for free use. Some underlying source corpora (WordNet, Wiktionary) have their own licenses; review them for redistribution at scale.
💼 Can I use this data commercially?
Yes for non-substantial extracts. Datamuse's terms ask large-volume users to be considerate; if you need very high volumes, consider their commercial license.
💳 Do I need a paid Apify plan to use this Actor?
No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit.
🔁 What happens if a run fails or gets interrupted?
Apify automatically retries transient errors. Partial datasets are preserved.
🧩 Can I do wildcard searches like crossword apps?
Yes. Use queryType: spelled_like with ? (one character) and * (any sequence). Example: h?ll? matches hallo, hello, hills, and more.
🆘 What if I need help?
Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.
🔌 Integrate with any app
Datamuse Word Finder Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe word data into your warehouse
- GitHub - Trigger runs from commits and releases
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to trigger downstream actions when a run finishes. Build keyword pipelines that refresh content briefs or app vocabularies automatically.
🔗 Recommended Actors
- 📐 arXiv Preprint Scraper - Open-access research papers
- 🐉 Open5e SRD 5.1 Scraper - SRD 5.1 monsters, spells, and items
- 🖼️ Art Institute of Chicago Scraper - AIC catalog metadata
- 📊 OurAirports Global Airport Database Scraper - Worldwide airport reference data
💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.
🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.