The Idea

The idea stemmed from a particular insight that web pages can be 1 of 2 categories:

  • hubs - a web page that served as compilations of a broad catalog of information that led users direct to other authoritative pages
  • authorities - a web page that contains the actual information

In other words, a good hub represents a page that pointed to many other pages, while a good authority represents a page that is linked by many different hubs

The scheme, therefore, assigns two scores for each page:

  • authority value - which estimates the value of the content of the page
  • hub value - which estimates the value of its links to other pages

Algorithm

TODO