Gitlab Scraper | Fast & Reliable | $2 / 1k avatar
Gitlab Scraper | Fast & Reliable | $2 / 1k

Pricing

$1.99 / 1,000 results

Go to Apify Store
Gitlab Scraper | Fast & Reliable | $2 / 1k

Gitlab Scraper | Fast & Reliable | $2 / 1k

Turn GitLab into a clean dataset. Scrape public projects and user profiles with rich metadata: names, namespaces, descriptions, topics, stars, forks, visibility and activity timestamps. Great for tech research, competitor tracking, lead lists and spotting fast-moving projects.

Pricing

$1.99 / 1,000 results

Rating

5.0

(1)

Developer

Fatih Tahta

Fatih Tahta

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

19 days ago

Last modified

Share

Overview

Gitlab Scraper | Fast & Reliable | $2 / 1k captures structured, public data from gitlab.com including projects and user profiles. GitLab hosts millions of repositories, contributors, and discussions, making it a rich source for tracking technologies, communities, and product ecosystems. This actor automates collection with dependable runs so you can focus on analysis instead of manual browsing.

Why Use This Actor

  • Market and competitive research: Map projects, contributors, and activity around specific technologies or vendors.
  • Lead and outreach lists: Identify active maintainers, organizations, and repositories relevant to your campaigns.
  • Product and trend monitoring: Follow repositories by language, license, or topics to spot momentum and adoption patterns.
  • Directory building: Assemble curated lists of projects or profiles with consistent metadata for enrichment or audits.
  • Time savings and reliability: Schedule recurring runs and keep datasets fresh without manual exports or copy-paste work.

Input Parameters

ParameterTypeDescriptionDefault
searchQueriesarray of stringsKeywords to search on GitLab. Each entry runs separately.
searchUrlsarray of stringsFull GitLab search URLs (e.g., https://gitlab.com/search?scope=projects&search=vscode). The actor extracts the search term and scope automatically.
searchModestring (projects | users | projects_and_users)Choose which GitLab entities to fetch for each query. Select projects only, users only, or let the actor honor scopes from search URLs.
projectLanguagestringOptional language filter for project searches (e.g., typescript).
projectArchivedbooleanInclude archived repositories when enabled.
projectMinStarsintegerMinimum star count to include a project.
projectMinForksintegerMinimum fork count to include a project.
projectLicensestringLicense filter for projects (e.g., apache-2.0).
projectTopicsarray of stringsTopics to require on projects (e.g., security, cli).
projectOwnerstringOwner namespace to target (e.g., gitlab-org).
projectFilterTokensstringAdditional GitLab search tokens to append to project queries (e.g., stars:>50 archived:false).
maxResultsintegerMaximum combined project and user records to save across all queries.50000

Example Input

{
"searchQueries": ["machine learning", "security research"],
"searchMode": "projects_and_users",
"projectLanguage": "typescript",
"projectMinStars": 50,
"projectLicense": "apache-2.0",
"maxResults": 5000
}

Example Output

Each dataset item is structured for straightforward analysis or enrichment.

{
"type": "project",
"name": "heritage_site_HackJaipur_hackathon",
"nameWithNamespace": "Manjot Singh / heritage_site_HackJaipur_hackathon",
"pathWithNamespace": "manjotsinghbagha/heritage_site_HackJaipur_hackathon",
"webUrl": "https://gitlab.com/manjotsinghbagha/heritage_site_HackJaipur_hackathon",
"description": "Hack-Jaipur hackathon project to build and e commerce site with the help Autho, Machine learning, 3D models, Django, Python, Html, CSS, javascript",
"avatarUrl": null,
"topics": [],
"starCount": 0,
"forksCount": 0,
"visibility": "public",
"archived": null,
"createdAt": "2025-12-18T19:18:04.039Z",
"lastActivityAt": "2025-12-18T19:18:03.925Z",
"namespace": {
"name": "Manjot Singh",
"fullPath": "manjotsinghbagha"
}
}
  • type — Indicates whether the item is a project or user entry.
  • name, nameWithNamespace, pathWithNamespace — Project identifiers and paths for linking and indexing.
  • webUrl — Direct link to the project page.
  • description, topics, visibility, archived — High-level project details and status.
  • starCount, forksCount — Popularity and community engagement signals.
  • createdAt, lastActivityAt — Timestamps for project creation and latest activity.
  • namespace — Owner or organization information.

Notes & Limitations

  • Use this actor responsibly and in accordance with GitLab’s terms of service and applicable laws.
  • Public data can include personal information; ensure you have a lawful basis to store or process it.
  • Start with modest result limits when exploring new filters, then scale once you confirm the scope.

Support

Questions or custom needs? Open an issue on the Issues tab of the actor page in Apify Console and it will be resolved around the clock.

Happy Scraping,

  • Fatih