User

Marek Trunkát

mtrunkat

Full Stack Web Developer and Technology Enthusiast

All
Popularity
Actor

Twitter Hashtag Scraper

mtrunkat/twitter

This Twitter Hashtag Scraper will scrape and extracts all tweets for given hashtag and provide output in JSON, XML, CSV or HTML.

avatarmtrunkat
73star
FEATURED
Actor

Article Text Extractor

mtrunkat/article-text-extractor

Simply extracts article text and other meta info from given url. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.

avatarmtrunkat
88star
Actor

Crawler To Spreadsheet

mtrunkat/crawler-to-spreadsheet

This crawler takes last crawler run result and stores new items in Google Docs Spreadsheet.

avatarmtrunkat
68star
Actor

Example Hacker News

mtrunkat/example-hacker-news

Example crawler for news.ycombinator.com build using Apify SDK

avatarmtrunkat
60star
Actor

Url List Download Html

mtrunkat/url-list-download-html

This act accepts a url list and downloads HTML of each page. It has input parameter - "sources" (see soursec parameter of UrlList https://www.apify.com/docs/sdk/apify-runtime-js/beta#RequestList).

avatarmtrunkat
44star
Crawler

Aliexpress.com - own orders

Get all your orders from aliexpress.com in machine readable format.

avatarmtrunkat
37cloud_download
Actor

Crawl Url List 1by1

mtrunkat/crawl-url-list-1by1

Crawls given list of urls with one crawler execution per url.

avatarmtrunkat
27star
Crawler

Skoda-auto.cz - model variants

Get all model-engine-equipment package variants of Škoda Auto cars.

avatarmtrunkat
13cloud_download
Actor

Puppeteer Promise Pool Example

mtrunkat/puppeteer-promise-pool-example

Example how to use Puppeteer in parallel using 'es6-promise-pool' npm package.

avatarmtrunkat
8star
Actor

Crawler Timeline

mtrunkat/crawler-timeline

This act creates a timeline spreadsheet from crawler results. Main use-case is to create a spreadsheet containing changes of some web page in time.

avatarmtrunkat
8star
Actor

Crawler To Sitemap

mtrunkat/crawler-to-sitemap

This act can be used as crawler's finish webhook. It transforms crawler's result into sitemap XML file and stores it in key-value-store named "sitemaps".

avatarmtrunkat
7star
Crawler

HN Show

Scrapes the links with their rank from HN Show. Created for this blogpost https://medium.com/p/8cccfa25f5cb/edit

avatarmtrunkat
7cloud_download
Actor

Xmls To Dataset

mtrunkat/xmls-to-dataset

This act loads list of urls from INPUT.sources. Each of these links should point to a xml file. It downloads all the files and saves them to it's default dataset. Groups parameter in INPUT allows to choose Apify proxy groups to us...

avatarmtrunkat
3star
Actor

Proxy Test

mtrunkat/proxy-test

This actor simply tests given array of URLs against selected proxy URLs or Apify proxy groups.

avatarmtrunkat
3star
Actor

24 Hour Stats

mtrunkat/24-hour-stats

This act can be used as synchronous API. Returns a JSON containing actor runs finished in the last 24 hours along with information about their default datasets and request queues. Actors might be filtered via input array "actIds".

avatarmtrunkat
2star
Actor

Economist Category Scraper

mtrunkat/economist-category-scraper

Example implementation of economist.com scraper built using apify/web-scraper actor. Crawls latest updates from a given economist category.

avatarmtrunkat
1star
Actor

Delete Untitled Acts

mtrunkat/delete-untitled-actors

Deletes all actors and tasks named untitled-X, my-actor-X, my-task-X from your account. In a minute. For free. With one click!

avatarmtrunkat
1star