PDF to HTML Converter

  • jancurn/pdf-to-html
  • Modified
  • Users 314
  • Runs 210.7k
  • Created by Author's avatarJan Čurn

Converts a PDF document to HTML using the pdf2htmlEX tool.

To run the code examples, you need to have an Apify account. Replace <YOUR_API_TOKEN> in the code with your API token. For a more detailed explanation, please read about running Actors via the API in Apify Docs.

import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with API token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "url": "https://apify.com/ext/ycf_application.pdf"
};

(async () => {
    // Run the Actor and wait for it to finish
    const run = await client.actor("jancurn/pdf-to-html").call(input);

    // Fetch and print Actor results from the run's dataset (if any)
    console.log('Results from dataset');
    const { items } = await client.dataset(run.defaultDatasetId).listItems();
    items.forEach((item) => {
        console.dir(item);
    });
})();