COMPARATIVE ANALYSIS OF DISTRIBUTED METHODS FOR PAGERANK COMPUTATION

Date
2018
Journal Title
Journal ISSN
Volume Title
Publisher
Producer
Director
Performer
Choreographer
Costume Designer
Music
Videographer
Lighting Designer
Set Designer
Crew Member
Funder
Rehearsal Director
Concert Coordinator
Moderator
Panelist
Alternative Title
Department
Haverford College. Department of Computer Science
Type
Thesis
Original Format
Running Time
File Format
Place of Publication
Date Span
Copyright Date
Award
Language
eng
Note
Table of Contents
Terms of Use
Rights Holder
Access Restrictions
Open Access
Tripod URL
Identifier
Abstract
The PageRank of a vector is a measure of centrality for nodes in a network. Three ways to calculate the PageRank are using power iteration, Monte Carlo methods, and iterative graph methods using Spark. The graph methods built in to spark and the power iteration method demonstrated a small difference in their performance when increasing the number of machines in a cluster. Running on AWS, the graph methods improved more when increasing the number of machines from 2 to 10 when compared to the power iteration methods. Experiments were carried out using Amazon Web Services and run on a subset of the web graph from Google. Improvements in partitioning could change these results, and could increase the performance of the matrix methods overall.
Description
Subjects
Citation
Collections