Content
- PageRank (lecture 4.1 & 4.2)
- Random walk and restarts (lecture 4.3)
- Matrix factorization and node embeddings (lecture 4.4)
1. PageRank
web page contains in-links and out-links
The importance of page can be defined as the number of in-links, AKA links as votes
The flow model
Definitions:
- Rank
, where is out-defree of node i. - Stochastic adjacency matrix
, is a column stochastic matrix (columns sum to 1). - Rank vector
, where . .
Connect to random walk
Definitions:
probability distribution over pages , is a stationary distribution of a random walk.
solving eigenvector equation
Similarly, the rank vector