Pricing

$30.00/month + usage

Go to Store

Github Profile Scraper

Try for free

Developed by

SASWAVE

GitHub User Profile Scraper. Extracts data from GitHub profiles, including followers, following, LinkedIn, Twitter, achievements and much more. Ideal for developers, researchers, and marketers. From a list of Github profile or a repository stargazers link

0.0 (0)

Pricing

$30.00/month + usage

Total users

Monthly users

Runs succeeded

90%

Last modified

3 months ago

Automation

Lead generation

Back to issues Create new issue

Starts over after resurrect

Closed

Alex (pooky) opened this issue

Please see my run. After the timeout, the restart began from the beginning. I have nearly 4,000 URLs, and any failure will cost a lot of usage and time.

Alex (pooky)

UPD: no, not from the beginning. But for some reason the first run didn't start from the beginning of the list, but after resurrecting the actor it started from the first line.

Alex (pooky)

UPD 2: No, he started over after all. I'm pausing for now, waiting for your response.

SASWAVE (saswave)

Thank you for reporting the issue, we are working on it to add a queue system that will handle container migration (allows to restart from where it stop instead of re starting from 0)

Alex (pooky)

Thank you for the quick response! What are your timelines for the queue system? Meanwhile, what do you recommend? Should I split the processing of 4000 accounts into 8-10 parts?

SASWAVE (saswave)

Not sure since Infra is 100% managed by apify and a migration event can be triggered anytime

Actor should be updated by the end of morning (French time)

Alex (pooky)

Actor should be updated by the end of morning (French time)

Do you mean today?

SASWAVE (saswave)

Actor has been updated, have a try and you can close the issue if the 4000 accounts have been scraped successfully

Alex (pooky)

Tried. I would say it was done; however, I resurrected it once and it ended, but with an error.

kj7oeyVgCDasn2Vvo

SASWAVE (saswave)

running with your input and trying to reproduce the issue

SASWAVE (saswave)

Did you remove the timeout limit from the actor settings ? (or increase the default limit, it's 3600 seconds)

SASWAVE (saswave)

Did you build the actor before start ? maybe you started the run without the code being updated , it's running, 450/4000

Will get back to you if it throw an error not being handled before the last url 4000

SASWAVE (saswave)

I timed out after 1h, with 2000+ profile scraped, i resurrected the run and it start from where it stoped

Alex (pooky)

Did you remove the timeout limit from the actor settings ? (or increase the default limit, it's 3600 seconds)

No, it was default.

Did you build the actor before start ? maybe you started the run without the code being updated , it's running, 450/4000

Not sure here, tbh.

I timed out after 1h, with 2000+ profile scraped, i resurrected the run and it start from where it stoped

Yes, last time after your update it successfully continued after resurrection. And I checked the final result. The list is clear and tidy, without missing lines. Thank you! I think the issue is closed now.

Add comment

Github User Profile Scraper

powerful_bachelor/Github-User-Profile-Scraper

The GitHub User Profile Scraper extracts vital info from GitHub profiles, including followers, following, LinkedIn, Twitter, achievements and much more. Ideal for developers, researchers, and marketers, it supports multiple profiles and exports data in various formats.

Powerful Bachelor

Github Users Scraper

dtrungtin/github-users-scraper

Github Users Scraper is an Apify actor for extracting users or emails from Github. It allows you to extract all watchers, stargazers, and members from a repository page.

Tin

225

4.0

Github Users Scraper

getdataforme/github-users-actor

This actor works well and helps to scrape the users on github repository.

GetDataForMe

Github Search Scraper

saswave/github-search-scraper

Github search scraper. Get all data from search results list

SASWAVE

Github emails from commits

saswave/github-emails-from-commits

From a Github repository url, extract all emails from commits and their occurence number. Allow you to generate a list of emails from targeted github repositories

SASWAVE

Github Repo User Scraper

inquisitive_sarangi/github-repo-scraper

Github Repo User Scraper is simple tool to extract users of a repo(s) like contributors, stargazers & watchers. You can also export listings to JSON/CSV or any other as format.

API Master

GitHub Stars

sauain/github-stars

Input will be the URL of any GitHub repository, and output will be GitHub Stars.

Saurav Jain

Github Trending Repositories / Developers

saswave/github-trending-repositories-developers

From a Github Trending category, extract all related informations about repositories or developers trending date range in Daily / Weekly / Monthly. With filters based on language spoken, code language, sponsorable status and date range

SASWAVE

GitHub Repository Scraper

fresh_cliff/github-scraper

This actor scrapes detailed information from GitHub repositories using reliable HTTP requests and HTML parsing. It extracts repository metadata including star counts, fork counts, topics/tags, license information, primary programming language, and last updated timestamps.