Article Text Extractor

No credit card required

Article Text Extractor

Article Text Extractor

mtrunkat/article-text-extractor

No credit card required

Simply extracts article texts and other meta info from the given URL. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.

The code examples below show how to run the Actor and get its results. To run the code, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token, which you can find under Settings > Integrations in Apify Console. Learn mode

import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with API token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "url": "https://www.bbc.com/news/world-asia-china-48659073"
};

(async () => {
    // Run the Actor and wait for it to finish
    const run = await client.actor("mtrunkat/article-text-extractor").call(input);

    // Fetch and print Actor results from the run's dataset (if any)
    console.log('Results from dataset');
    const { items } = await client.dataset(run.defaultDatasetId).listItems();
    items.forEach((item) => {
        console.dir(item);
    });
})();
Developer
Maintained by Community
Actor stats
  • 709 users
  • 122.3k runs
  • Modified 2 months ago
Categories

You might also like these Actors