Page Rank Actor
Under maintenance
Pricing
Pay per usage
Page Rank Actor
Under maintenance
Page rank Actor ranks URLs that satisfy the glob patterns.
0.0 (0)
Pricing
Pay per usage
0
Monthly users
1
Runs succeeded
>99%
Last modified
24 days ago
Apify Page rank Actor
Apify Page rank Actor is a crawler designed to calculate and rank web pages based on their URL and inlinks/outlinks data. It crawls the web pages, calculates the PageRank for each page, and filters the pages that meet the specified restrictions. The final output includes the URL, rank, inlinks, and outlinks, sorted by rank in descending order.
Use Cases
- SEO Analysis: Identify the most influential pages on a website based on page rank.
- Website Optimization: Discover important pages with weak internal linking and improve site structure.
- Competitor Research: Analyze competitor websites to understand their link hierarchy and page importance.
- Content Strategy: Prioritize high-ranking pages for content updates or linking strategies.
- Web Archiving: Rank and filter pages efficiently for web scraping and archiving purposes.
- Data Science & Research: Use page rank data for academic studies or data-driven decision-making.
- Social media profile: Use page rank data for measuring social media profile popularity.
Features
- Crawl Web Pages: The actor crawls through a set of web pages.
- PageRank Calculation: The PageRank algorithm is applied to rank the pages based on their incoming (inlinks) and outgoing (outlinks) links.
- Restriction-based Filtering: Only pages that meet specific restrictions (such as URL patterns or conditions) will be processed.
- Sorted Output: The output is a sorted list of web pages, showing the URL, calculated PageRank, number of inlinks, and outlinks.
- Inlinks/Outlinks Tracking: The actor keeps track of the number of inlinks and outlinks for each page.
Output
The output consists of a list of crawled pages, sorted by PageRank, and includes the following details for each page:
Property | Type | Description |
---|---|---|
url | String | The URL of the crawled webpage. |
importance | Integer | A ranking score where a lower value means higher importance. |
pageRank | Float | The calculated PageRank score, representing the authority of the page. |
inlinks | String[] | An array of URLs linking to this page (incoming links). |
outlinks | String[] | An array of URLs that this page links out to (outgoing links). |
Example Output
Datasets
1{ 2 "url": "https://apify.com/", 3 "importance": 1, 4 "pageRank": 0.00011726428097546502, 5 "inlinks": [ 6 "https://apify.com/store", 7 "https://apify.com/actors", 8 ... 9 ], 10 "outlinks": [ 11 "https://console.apify.com/sign-up", 12 "https://console.apify.com/sign-in", 13 ... 14 ] 15}
HTML Graph
The Actor generates an interactive HTML page that visualizes a graph of web pages and their connections.
- Nodes represent individual web pages.
- Edges represent links between pages (inlinks and outlinks).
- Node size is based on its PageRank value—larger nodes indicate more important pages.
- Clicking a node toggles the visibility of all its inlinks and outlinks.
- Hovering over a node displays the URL of that page.
GEXF Graph
The Actor also generates a GEXF file, an XML-based format that represents the graph structure.
This file can be used to create custom visualizations, analyze network data, or import the graph into various tools supporting GEXF.
1<?xml version="1.0" encoding="UTF-8"?> 2<gexf version="1.2" xmlns="http://www.gexf.net/1.2draft" xmlns:viz="http:///www.gexf.net/1.1draft/viz"> 3 <meta/> 4 <graph defaultedgetype="directed"> 5 <attributes class="edge"> 6 <attribute id="source" title="source" type="string"/> 7 <attribute id="target" title="target" type="string"/> 8 <attribute id="type" title="type" type="string"/> 9 <attribute id="hidden" title="hidden" type="boolean"/> 10 </attributes> 11 <nodes> 12 <node id="n0" label="https://apify.com/"> 13 <viz:color r="74" g="201" b="104"/> 14 <viz:size value="30"/> 15 <viz:position x="-121.61664581298828" y="-51.96864318847656"/> 16 </node> 17 ...
Pricing
Pricing model
Pay per usageThis Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.