Pricing

Pay per event + usage

Named Entity Extractor & Name Validator

Extract named entities from text using a NER API. Supports multilingual, English, and German text extraction with confidence scores for each detected name.

Pricing

Pay per event + usage

Rating

0.0

(0)

Developer

Dominic M. Quaiser

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

💡 Features

Real-Time API: On demand HTTP server with sub-second response times
Multilingual Support: Extract names from English, German, or multilingual text
Confidence Filtering: Set minimum confidence thresholds (0-100%) to control result quality
Automatic Data Storage: All results are automatically saved to your Apify dataset
Comprehensive Output: Returns names with confidence scores, plus organizations, locations, and other entities
Built for Scale: Automatic scaling handles concurrent requests efficiently

🚀 How to Use

This Actor runs in Standby mode, which means it operates like a standard API. You don't need to start the Actor manually - simply send HTTP requests to the Actor's endpoint and get instant results.

📍 Endpoint

Send a POST request to your Actor's Standby URL with the following format:

https://dominic-quaiser--named-entity-extractor.apify.actor?token=YOUR_API_TOKEN

📥 Request Format

You need an Apify account and API token to use this Actor. Get your token from Settings → Integrations in Apify Console.

Recommended method - Include the token in the Authorization header:

curl -X POST https://dominic-quaiser--named-entity-extractor.apify.actor \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Maria Salomea Skłodowska-Curie; known as Marie Curie was a Polish and naturalised-French physicist and chemist who conducted pioneering research on radioactivity.\n She was the first woman to win a Nobel Prize, the first person to win a Nobel Prize twice, and the only person to win a Nobel Prize in two scientific fields. Her husband, Pierre Curie, was a co-winner of her first Nobel Prize, making them the first married couple to win the Nobel Prize and launching the Curie family legacy of five Nobel Prizes. She was, in 1906, the first woman to become a professor at the University of Paris.",
    "language": "en",
    "minConfidence": 70
  }'

Alternative method - Add token as a query parameter:

curl -X POST https://dominic-quaiser--named-entity-extractor.apify.actor?token=YOUR_API_TOKEN \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Marie Skłodowska Curie war eine Physikerin und Chemikerin polnischer Herkunft, die in Frankreich lebte und wirkte. Sie untersuchte die 1896 von Henri Becquerel beobachtete Strahlung von Uranverbindungen und prägte für diese das Wort „radioaktiv“. Im Rahmen ihrer Forschungen, für die ihr 1903 ein anteiliger Nobelpreis für Physik und 1911 der Nobelpreis für Chemie zugesprochen wurde, entdeckte sie gemeinsam mit ihrem Ehemann Pierre Curie die chemischen Elemente Polonium und Radium. Marie Curie ist die einzige Frau unter den fünf Personen, denen bisher mehrfach ein Nobelpreis verliehen wurde, und neben Linus Pauling die einzige Person, die Nobelpreise auf zwei unterschiedlichen Fachgebieten erhielt.",
    "language": "de",
    "minConfidence": 95
  }'

Request Parameters

Parameter	Type	Description	Valid Values	Default	Required
`text`	string	Text to extract names from	10-2000 characters	-	Yes
`language`	string	Language model to use	`"multi"`, `"en"`, `"de"`	`"multi"`	No
`minConfidence`	number	Minimum confidence percentage	0-100	70	No

Language Options

Choose the appropriate language mode based on your text:

multi (Multilingual): Best for mixed-language text or when language is unknown. Works with most European languages.
en (English): Optimized for English text. Use when you know the text is primarily in English for better accuracy.
de (German): Optimized for German text. Use for German-language documents, particularly helpful with German naming conventions.

Confidence Threshold Guidelines

The minConfidence parameter controls result quality. Here's how to choose:

50-60%: Very permissive - includes many names but may have false positives
70-80%: Balanced - good mix of recall and precision (recommended for most use cases)
85-95%: Conservative - high precision but may miss some valid names
95-100%: Very strict - only highest confidence detections

Start with 70% and adjust based on your needs.

📤 Response Format

Success Response (200 OK)

{
  "language": "en",
  "total_names_found": 4,
  "processing_time": 4.2364819049835205,
  "names": [
    {
      "person": "Maria Salomea Skłodowska",
      "confidence": 0.929
    },
    {
      "person": "Curie",
      "confidence": 0.99
    },
    {
      "person": "Marie Curie",
      "confidence": 0.999
    },
    {
      "person": "Pierre Curie",
      "confidence": 0.999
    }
  ],
  "persons": [
    "Maria Salomea Skłodowska",
    "Curie",
    "Marie Curie",
    "Pierre Curie"
  ],
  "organizations": [
    "University Of Paris"
  ],
  "locations": [],
  "miscellaneous": [
    "Polish",
    "French",
    "Nobel Prize"
  ],
  "raw_entities": [
    {
      "text": "Maria Salomea Skłodowska",
      "type": "person",
      "score": 0.929,
      "start": 0,
      "end": 24
    },
    {
      "text": "Curie",
      "type": "person",
      "score": 0.99,
      "start": 25,
      "end": 30
    },
    {
      "text": "Marie Curie",
      "type": "person",
      "score": 0.999,
      "start": 41,
      "end": 52
    },
    {
      "text": "Polish",
      "type": "miscellaneous",
      "score": 0.997,
      "start": 59,
      "end": 65
    },
    {
      "text": "French",
      "type": "miscellaneous",
      "score": 0.999,
      "start": 82,
      "end": 88
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.996,
      "start": 196,
      "end": 207
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.996,
      "start": 235,
      "end": 246
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.996,
      "start": 283,
      "end": 294
    },
    {
      "text": "Pierre Curie",
      "type": "person",
      "score": 0.999,
      "start": 334,
      "end": 346
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.996,
      "start": 377,
      "end": 388
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.996,
      "start": 438,
      "end": 449
    },
    {
      "text": "Curie",
      "type": "person",
      "score": 0.922,
      "start": 468,
      "end": 473
    },
    {
      "text": "Nobel Prize",
      "type": "miscellaneous",
      "score": 0.994,
      "start": 496,
      "end": 507
    },
    {
      "text": "University of Paris",
      "type": "organization",
      "score": 0.969,
      "start": 573,
      "end": 592
    }
  ],
  "confidence_scores": {
    "persons": [
      0.929,
      0.99,
      0.999,
      0.999
    ],
    "organizations": [
      0.5
    ],
    "locations": [],
    "miscellaneous": [
      0.997,
      0.999,
      0.996
    ]
  },
  "model_used": "dbmdz/bert-large-cased-finetuned-conll03-english",
  "text": "Maria Salomea Skłodowska-Curie; known as Marie Curie was a Polish and naturalised-French physicist and chemist who conducted pioneering research on radioactivity.\n She was the first woman to win a Nobel Prize, the first person to win a Nobel Prize twice, and the only person to win a Nobel Prize in two scientific fields. Her husband, Pierre Curie, was a co-winner of her first Nobel Prize, making them the first married couple to win the Nobel Prize and launching the Curie family legacy of five Nobel Prizes. She was, in 1906, the first woman to become a professor at the University of Paris."
}

Response Fields

Field	Type	Description
`language`	string	Language model used for extraction
`total_names_found`	number	Count of names meeting confidence threshold
`processing_time`	number	Processing time in seconds
`names`	array	Filtered names with confidence ≥ minConfidence
`persons`	array	All detected person names
`organizations`	array	All detected organizations
`locations`	array	All detected locations
`miscellaneous`	array	Other detected entities
`raw_entities`	array	Detailed entity data with positions and scores
`confidence_scores`	object	Confidence scores by entity type
`model_used`	string	Name of the NER model used
`text`	string	Original input text

Error Response (400/500)

{
  "error": {
    "status_code": 400,
    "message": "Field 'text' is required and must be a non-empty string."
  }
}

💻 Code Examples

Python

import requests

url = "https://dominic-quaiser--named-entity-extractor.apify.actor?token=YOUR_API_TOKEN"
payload = {
    "text": "Angela Merkel met with Emmanuel Macron in Berlin.",
    "language": "multi",
    "minConfidence": 80
}

response = requests.post(url, json=payload)
print(response.json())

JavaScript/Node.js

const response = await fetch(
  'https://dominic-quaiser--named-entity-extractor.apify.actor?token=YOUR_API_TOKEN',
  {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify({
      text: 'Angela Merkel met with Emmanuel Macron in Berlin.',
      language: 'multi',
      minConfidence: 80
    })
  }
);

const data = await response.json();
console.log(data);

PHP

<?php
$url = 'https://dominic-quaiser--named-entity-extractor.apify.actor?token=YOUR_API_TOKEN';
$data = [
    'text' => 'Angela Merkel met with Emmanuel Macron in Berlin.',
    'language' => 'multi',
    'minConfidence' => 80
];

$ch = curl_init($url);
curl_setopt($ch, CURLOPT_POST, true);
curl_setopt($ch, CURLOPT_POSTFIELDS, json_encode($data));
curl_setopt($ch, CURLOPT_HTTPHEADER, ['Content-Type: application/json']);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);

$response = curl_exec($ch);
curl_close($ch);

echo $response;
?>

🎯 Use Cases

Lead Generation: Extract contact names from business emails or documents
Content Analysis: Identify people mentioned in articles or social media
Document Processing: Extract names from contracts, resumes, or legal documents
CRM Enrichment: Parse names from unstructured text data
Compliance & KYC: Extract and validate person names for regulatory purposes
News Monitoring: Track mentions of specific individuals in news feeds
Academic Research: Extract author names from papers or citations
Customer Support: Identify customer names from support tickets and feedback

❓ Error Handling

The Actor returns appropriate HTTP status codes:

200: Success - names extracted successfully
400: Bad Request - invalid input (check error message for details)
401: Unauthorized - invalid or missing API token
402: Payment Required - dataset limit reached, upgrade your plan to continue
500: Internal Server Error - unexpected error (contact support if persistent)

Example error response:

{
  "error": "Text length must be between 3 and 2000 characters, got 2",
  "status": 400
}

Example quota limit response:

{
  "error": {
    "status_code": 402,
    "message": "Dataset limit of 1000 items reached. Please upgrade your plan to continue storing results."
  }
}

⚖️ Legal Disclaimer

You are solely responsible for determining the legality of your use of this Actor and the data it generates. The handling of data, particularly personal information, is subject to complex legal frameworks like the General Data Protection Regulation (GDPR/DSGVO) and copyright laws. It is your responsibility to ensure your use case is compliant with all applicable laws. This text does not constitute legal advice.

Please be aware that an external NER API hosted on a private server in the European Union is used for data processing.

What is Processed: The request content you submit, which may contain personal names and other entities.
Why: To perform Named Entity Recognition (NER) and extract person names, organizations, locations, and other entities from your text.
Data Controller: You, the user, are the data controller. The Actor's developer acts as the data processor for this specific task
Location & Compliance: All NER processing for this feature occurs within the EU (Germany) and is subject to GDPR (DSGVO)
Data Storage: The request is processed in-memory and is not stored or logged on the external NER server
Important: This NER processing is external to the Apify platform and is not covered by Apify's DPA. By using this Actor, you acknowledge this separate data processing activity

🛠️ Maintainer

Author: Dominic M. Quaiser
Contact: mail@dominic-quaiser.io
Website: dominic-quaiser.io

Delete Named Storages

mnmkng/delete-named-storages

Deletes your named storages by matching their names with a RegExp, selecting a date, or more. Enables deleting multiple named storages fast and safe using a UI rather than API.

Ondra Urban

Ocr Pdf Extractor

vivid_astronaut/ocr-pdf-extractor

Extract text from images and PDFs using OCR. Supports multiple languages including English, Portuguese, Spanish, French, German. Uses Tesseract OCR engine with high accuracy text extraction and word-level confidence scores.

Fabio Suizu

Hugging Face Text

alizarin_refrigerator-owner/hugging-face-text

Comprehensive text processing w/Hugging Face models generation, NLP & embeddings. Text Generations Chat Completion Summarization Condense documents Translation Sentiment Analysis Classify sentiment NER Named entity recognition Q&A Answer context Embeddings Semantic search Zero-Shot without training

The Howlers

Text Analysis API

vivid_astronaut/text-analysis

Fabio Suizu

Web Text Extractor

rl1987/web-text-extractor

R.L.

Google Maps Lead Enricher

ryanclinton/google-maps-lead-enricher

Search Google Maps for businesses, then automatically enrich each result with emails, phone numbers, named contacts, social links, email patterns, and lead quality scores (0-100) through a 4-step pipeline.

ryan clinton

Text Scraper (Free)

karamelo/text-scraper-free

Website Text Extractor. Extract Text from Webpages and Feed Your LLMs

karamelo

941

5.0

Json Validator API

vivid_astronaut/json-validator

Fabio Suizu

Phone Validator API

vivid_astronaut/phone-validator

Fabio Suizu

PDF Text Extractor

jirimoravcik/pdf-text-extractor

PDF Text Extractor allows you to extract text from PDF files. It also supports chunking of the text to prepare the data for usage with large language models.