Dictionary Word Definitions Scraper avatar

Dictionary Word Definitions Scraper

Pricing

from $12.00 / 1,000 result items

Go to Apify Store
Dictionary Word Definitions Scraper

Dictionary Word Definitions Scraper

Pull English word definitions, phonetics, audio pronunciations, parts of speech, examples, synonyms, and antonyms. Look up a word list or paste a paragraph and the Actor breaks it into per-word records. Useful for language apps, NLP, vocabulary builders, content tools.

Pricing

from $12.00 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

ParseForge Banner

📖 Dictionary Word Definitions Scraper

🚀 Pull English word definitions, phonetics, audio, parts of speech, examples, synonyms, antonyms.

🕒 Last updated: 2026-05-07 · 📊 14 fields per record · English dictionary · definitions, phonetics, audio pronunciations, POS, examples, synonyms, antonyms

The Dictionary Word Definitions Scraper pulls structured English word definitions from a free public dictionary API. Output includes the headword, phonetic transcription, audio pronunciation URL, part of speech, definition, usage example, synonyms, and antonyms. Multiple entries per word are returned (one per part of speech / definition).

The Actor accepts a word list directly or a paragraph that it tokenizes. Stop-words ("the", "a", "of") are filtered out automatically. Useful for vocabulary apps, language-learning supplements, NLP corpora, and content tools.

🎯 Target Audience💡 Primary Use Cases
Language-learning apps, NLP/ML pipelines, content marketers, writers, vocabulary builders, accessibility toolsVocabulary builder apps, language-learning supplements, NLP training datasets, blog post enrichment, content writing

📋 What the Dictionary Word Definitions Scraper does

Five filtering workflows in a single run:

  • 📋 Word list lookup. Pass a list and pull each word's definition.
  • 📝 Paragraph tokenizer. Paste a paragraph; the Actor splits it and looks up each unique non-stop word.
  • 🔉 Audio pronunciation. Direct audio URL when available.
  • 📚 Multi-entry per word. One row per (POS, definition) pair, capped at 4 per word.
  • 🔗 Synonyms + antonyms. Both arrays included per definition.

💡 Why it matters: clean, server-side filtering and fresh data on every run.


🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


⚙️ Input

InputTypeDefaultBehavior
maxItemsinteger10Records to return. Free plan caps at 10, paid plan up to 1,000,000.
wordsstringnewline listWords to look up.
paragraphstring""Paragraph mode: tokenizes input into unique non-stop words.

Example: look up specific words.

{
"maxItems": 50,
"words": "hello\nlanguage\nserendipity\netymology\nalgorithm"
}

Example: extract vocabulary from a paragraph.

{
"maxItems": 100,
"paragraph": "The serendipitous discovery led to a revolutionary breakthrough in modern computing science."
}

📊 Output

Each record contains 14 fields. Download as CSV, Excel, JSON, or XML.

🧾 Schema

FieldTypeExample
🔤 wordstring"hello"
🔉 phoneticstring"/həˈləʊ/"
🔉 audioUrlstring"https://api.dictionaryapi.dev/media/.../hello.mp3"
🏷️ partOfSpeechstring"noun"
📜 definitionstring""Hello!" or an equivalent greeting."
📋 examplestringnull
🔗 synonymsarraynull
🔗 antonymsarraynull
🔗 sourceUrlstring"https://en.wiktionary.org/wiki/hello"

📦 Sample records


✨ Why choose this Actor

Capability
📚Free dictionary. No API key, no quota for moderate use.
🔉Audio pronunciation. Direct MP3 URLs when available.
🏷️Multiple entries. One row per part of speech and definition.
📝Paragraph tokenizer. Paste text and extract vocabulary automatically.
Fast. 100 lookups in under a minute.

📈 How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ This Actor$5 free creditEnglish dictionaryLive per runword list or paragraph⚡ 2 min
Manual dictionary scrapingFreeAllLiveDIY🐢 Days
OED API$$Larger but paywalledLiveYes🐢 Subscription
Wiktionary scrapingFreeMixed qualityLiveDIY🐢 Days

🚀 How to use

  1. 📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. 🌐 Open the Actor. Find the Dictionary Word Definitions Scraper on the Apify Store.
  3. 🎯 Set input. Pick filters and maxItems.
  4. 🚀 Run it. Click Start.
  5. 📥 Download. Grab results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to dataset: 3-5 minutes. No coding required.


💼 Business use cases

📱 Language-Learning Apps

  • Vocabulary builder lessons
  • Word-of-the-day content
  • Pronunciation lessons
  • Definition tooltips

🤖 NLP + ML

  • Train word-definition embeddings
  • Build synonym/antonym graphs
  • Vocabulary classification corpora
  • POS-tagging datasets

📰 Content + Writing

  • Blog post vocabulary checks
  • Synonym suggestions
  • Reading-level analysis
  • Writing-tool integrations

♿ Accessibility

  • Reader-mode definitions
  • ESL/EFL tooltips
  • Vocabulary support for dyslexia tools
  • Plain-language rewriters

🔌 Automating Dictionary Word Definitions Scraper

Control the scraper programmatically:

  • 🟢 Node.js. Install the apify-client NPM package.
  • 🐍 Python. Use the apify-client PyPI package.
  • 📚 See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval.


🌟 Beyond business use cases

Data like this powers more than commercial workflows.

🎓 Research and academia

  • Computational linguistics
  • Reproducible vocabulary snapshots
  • Course materials
  • ESL/EFL research

🎨 Personal and creative

  • Personal vocabulary databases
  • Word-of-the-day newsletters
  • Side projects
  • Hobby word collection

🤝 Non-profit and civic

  • Literacy outreach
  • Free language tools
  • Educational accessibility
  • Heritage-language preservation

🧪 Experimentation

  • Train POS classifiers
  • Prototype vocabulary chat agents
  • Build pronunciation tools
  • Test text-mining pipelines

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt in the AI of your choice:


❓ Frequently Asked Questions

🧩 How does it work?

Provide a word list, or paste a paragraph. For paragraph mode, the Actor tokenizes the text and filters stop-words automatically. Each remaining word is looked up in the dictionary.

📊 How many fields per record?

14, including word, phonetic, audio URL, part of speech, definition, example, synonyms, antonyms, and source URL.

📚 How many entries per word?

Up to 4 (one per definition under the first entry). For comprehensive coverage, look up specific words via word list.

🔉 Is audio always included?

No. Audio URL is included when the dictionary has a recording, which depends on the word.

🌐 Are languages other than English supported?

No. The dictionary covers English. For other languages, contact us via the Tally form.

🔁 Can I schedule runs?

Yes. Schedule weekly to capture new vocabulary additions.

⚖️ Is this data free?

Yes. The underlying dictionary is open and free. The Actor is a thin client over the public dictionary.

💳 Do I need a paid Apify plan?

No. The free plan covers preview runs.

🆘 What if a run fails?

Apify retries transient errors. Words not in the dictionary are silently skipped.

📝 What does the paragraph mode do?

Splits your paragraph into unique words, removes common stop-words and very short words, then looks up each. Returns up to maxItems definitions.


🔌 Integrate with any app

Dictionary Word Definitions Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications
  • Airbyte - Pipe data into your warehouse
  • GitHub - Trigger runs from commits
  • Google Drive - Export datasets to Sheets

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by dictionaryapi.dev, Wiktionary, the Wikimedia Foundation, or any cited reference work. All trademarks mentioned are the property of their respective owners. Only publicly available open data is collected.