Datamuse Word Finder Scraper avatar

Datamuse Word Finder Scraper

Pricing

from $3.75 / 1,000 result items

Go to Apify Store
Datamuse Word Finder Scraper

Datamuse Word Finder Scraper

Export English word lists from the Datamuse word-finder: synonyms, antonyms, rhymes, sound-alikes, hypernyms, holonyms, frequency scores, syllable counts, definitions, and part-of-speech tags. Pick a relation type, seed word, optional topic, and download the full result set.

Pricing

from $3.75 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

ParseForge Banner

📝 Datamuse Word Finder Scraper

🚀 Export English word lists in seconds. Pull synonyms, antonyms, rhymes, sound-alikes, hypernyms, holonyms, definitions, frequency scores, and pronunciations across 370,000+ words. No registration, no API key, no manual JSON wrangling.

🕒 Last updated: 2026-05-20 · 📊 11 fields per word · 🔤 370k+ words queryable · 🔗 17 relation types

The Datamuse Word Finder Scraper exports English word lists from the Datamuse word-finder and returns up to 11 fields per word, including the word itself, relation score, syllable count, part-of-speech tags, frequency per million tokens, ARPABET pronunciation, and one or more dictionary definitions. The underlying dataset is one of the largest open word-relation indexes for English, covering 370,000+ unique forms.

Coverage spans 17 relation types: means-like (semantic), sounds-like (phonetic), spelled-like (wildcards), rhymes, near-rhymes, synonyms, antonyms, hypernyms (kind of), hyponyms (more specific), holonyms (part of), meronyms (has parts), triggers (co-occurrence), follows / precedes (collocations), adjectives-for-noun, nouns-for-adjective, and frequency-sorted lists.

🎯 Target Audience💡 Primary Use Cases
NLP developers, crossword and word-game builders, content writers, SEO keyword strategists, ESL and EdTech apps, copywriters, lyric and poetry toolsVocabulary expansion, rhyme finding, sound-alike search, keyword research, autocomplete dictionaries, thesaurus apps, language-learning drills

📋 What the Datamuse Word Finder Scraper does

Seventeen relation-typed lookups in a single Actor:

  • 🧠 Means like. Semantic neighbors (synonyms, related concepts).
  • 🔊 Sounds like / Rhymes / Near rhymes. Phonetic similarity for songwriting, poetry, mnemonics.
  • 🔡 Spelled like. Wildcard lookups (?, *) for crossword and word-puzzle apps.
  • 📚 Synonyms / Antonyms. Classic thesaurus relations.
  • 🌳 Hypernyms / Hyponyms / Holonyms / Meronyms. Taxonomy relations (is a kind of, is part of).
  • 🔁 Triggers / Follows / Precedes. Co-occurrence and collocation lookups for n-gram suggestions.
  • 🏷️ Adjectives for noun / Nouns for adjective. Grammatical companions for autocomplete.
  • 📊 Frequent. Frequency-ranked word lists with wildcards.

Optional metadata add-ons return definitions, parts of speech, syllable counts, frequency per million tokens, and ARPABET pronunciations.

💡 Why it matters: building a word-aware app means juggling thesauri, rhyming dictionaries, frequency tables, and pronunciation lexicons. Datamuse exposes all of that through a single uniform interface; this Actor wraps it for repeatable, schedulable runs that you can pipe into a spreadsheet, notebook, or downstream service.


🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


⚙️ Input

InputTypeDefaultBehavior
maxItemsinteger10Words to return. Free plan caps at 10, paid plan at 1,000,000.
queryTypestring"means_like"One of 17 relation types.
seedWordstring"happy"Seed word or pattern. Wildcards (?, *) supported for spelled-like.
topicWordstring""Optional topic bias.
leftContext / rightContextstring""Words immediately before/after the target.
partOfSpeechstring""Restrict to noun, verb, adjective, or adverb.
includeDefs / includeTagsbooleantrueInclude definitions and POS/frequency/pronunciation tags.

Example: synonyms of "happy" with definitions.

{
"maxItems": 25,
"queryType": "means_like",
"seedWord": "happy",
"includeDefs": true,
"includeTags": true
}

Example: words that rhyme with "moon" biased toward astronomy.

{
"maxItems": 50,
"queryType": "rhymes",
"seedWord": "moon",
"topicWord": "astronomy"
}

⚠️ Good to Know: Datamuse caps each query at 1,000 results, so for very large word lists, break your request into multiple seeded queries (alphabetical wildcards, multiple topics) and concatenate datasets. The default returns are ranked by relevance, not alphabetical order.


📊 Output

Each word record contains up to 11 fields. Download as CSV, Excel, JSON, or XML.

🧾 Schema

FieldTypeExample
🔤 wordstring"pleased"
🔁 queryTypestring"means_like"
🌱 seedWordstring | null"happy"
📊 scorenumber | null40004395
🔢 numSyllablesnumber | null1
🏷️ partsOfSpeechstring[] | null["adjective", "verb"]
📈 frequencyPerMillionnumber | null19.64
🔊 pronunciationstring | null"P L IY1 Z D" (ARPABET)
📚 definitionsstring[] | null["adj Happy, content."]
🏷️ tagsstring[] | nullRaw Datamuse tag list
🕒 scrapedAtISO 8601Collection timestamp

📦 Sample records


✨ Why choose this Actor

Capability
🔤Massive coverage. 370,000+ unique English forms across 17 relation types.
🎯Flexible queries. Combine relation type with topic bias, left/right context, and part-of-speech filter.
📚Rich metadata. Optional definitions, ARPABET pronunciations, syllable counts, and frequency per million tokens.
Fast. Hundreds of words in under 5 seconds.
🔁Always fresh. Live word-finder pulls reflect the current Datamuse index.
🧩Wildcards. Spelled-like supports ? (one char) and * (any) for crossword and pattern matching.
🚫No authentication. Public word-finder. No login or token required.

📊 Datamuse is one of the most-used open word APIs in the indie and educational developer community.


📈 How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ Datamuse Word Finder Scraper (this Actor)$5 free credit, then pay-per-use370k+ wordsLive per run17 relation types, topic, context, POS⚡ 2 min
Direct Datamuse queriesFree, rate-limitedFullLiveManual🛠️ Coding
Commercial NLP word APIs$$$+/monthVariableVariableMany⏳ Long signup
Local thesaurus / WordNet dumpsFreeSubset, staleRarelyManual🕒 Setup time

Pick this Actor when you want repeatable, schedulable word lists in CSV/JSON without writing query loops or rate-limiting handlers.


🚀 How to use

  1. 📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. 🌐 Open the Actor. Go to the Datamuse Word Finder Scraper page on the Apify Store.
  3. 🎯 Set input. Pick a relation type, set a seed word, and add optional topic or context.
  4. 🚀 Run it. Click Start and let the Actor collect your data.
  5. 📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


💼 Business use cases

✍️ Content and SEO

  • Expand keyword sets with means-like and triggers queries
  • Find low-frequency long-tail variants for blog posts
  • Build content brief libraries with topic-biased thesaurus pulls
  • Generate alt-text and meta-description candidates

🎮 Word games and puzzles

  • Crossword builders with spelled_like wildcard searches
  • Rhyme-game seed lists for songwriting and battle-rap apps
  • Hangman/Wordle dictionaries filtered by length and frequency
  • Anagram and pangram word pools

📚 EdTech and language learning

  • Vocabulary lists keyed to CEFR level via frequency tiers
  • Pronunciation drills using ARPABET output
  • Synonym/antonym flashcards for SAT/GRE prep apps
  • ESL exercises with co-occurring word pairs

🤖 NLP and product features

  • Power autocomplete with relevance-ranked candidates
  • Build domain-specific thesauri (medical, legal, finance)
  • Generate paraphrase candidates for content rewriting
  • Seed embeddings benchmarks with curated word pairs

🔌 Automating Datamuse Word Finder Scraper

Control the scraper programmatically for scheduled runs and pipeline integrations:

  • 🟢 Node.js. Install the apify-client NPM package.
  • 🐍 Python. Use the apify-client PyPI package.
  • 📚 See the Apify documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Build keyword-expansion pipelines that refresh weekly without manual prompting.


🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

  • Corpus linguistics studies on word relations
  • Reproducible datasets for psycholinguistic experiments
  • NLP coursework with hands-on word-finder exercises
  • Frequency-based reading-difficulty research

🎨 Personal and creative

  • Songwriters mining rhymes and near-rhymes
  • Poets building thematic word palettes
  • Bloggers stretching headline vocabulary
  • Hobby word-list collectors and dictionary nerds

🤝 Non-profit and civic

  • Accessibility tools for dyslexia and language disorders
  • Literacy-program word lists for community libraries
  • Translation/localization aid kits for non-profit content
  • Open educational resources for ESL classrooms

🧪 Experimentation

  • Train domain-specific word2vec or fastText models
  • Prompt engineering for LLM paraphrase and rewriting
  • Agent pipelines that propose alternative phrasing
  • Benchmark embeddings on synonym and analogy tasks

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:


❓ Frequently Asked Questions

🧩 How does it work?

Pick a relation type and seed word, click Start, and the Actor queries the open word-finder and returns one structured record per matching word, ranked by relevance.

📏 How accurate is the data?

Datamuse derives its relations from corpora including WordNet, Wiktionary, and the Google Books n-gram dataset. Scoring is empirical and tuned for usefulness, not strict linguistic correctness. Frequency scores reflect Google Books usage.

🔁 How often is the dataset refreshed?

Datamuse maintains a continuously updated word index. Every run pulls live, so your results reflect the current version of the underlying lexicons.

🔊 What format is the pronunciation field?

ARPABET, with stress markers. The numbers after vowels indicate primary (1), secondary (2), or no (0) stress.

⏰ Can I schedule regular runs?

Yes. Use Apify Schedules to run this Actor on any cron interval (daily, weekly) to refresh keyword sets or vocabulary lists.

Datamuse's open word index is intended for free use. Some underlying source corpora (WordNet, Wiktionary) have their own licenses; review them for redistribution at scale.

💼 Can I use this data commercially?

Yes for non-substantial extracts. Datamuse's terms ask large-volume users to be considerate; if you need very high volumes, consider their commercial license.

💳 Do I need a paid Apify plan to use this Actor?

No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit.

🔁 What happens if a run fails or gets interrupted?

Apify automatically retries transient errors. Partial datasets are preserved.

🧩 Can I do wildcard searches like crossword apps?

Yes. Use queryType: spelled_like with ? (one character) and * (any sequence). Example: h?ll? matches hallo, hello, hills, and more.

🆘 What if I need help?

Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.


🔌 Integrate with any app

Datamuse Word Finder Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe word data into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Build keyword pipelines that refresh content briefs or app vocabularies automatically.


💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.