http://peterwebster.me/2014/01/28/distant-reading-the-webarchive/

On the number of times each site links out: there are two ways into this. One is to look at *for how long* a linkage persists (which one can do from this data, but I haven’t yet). The second is to look at how many links there were at any one time from one host to another. These numbers are declared in the data, but would only be meaningful if you could also understand how many pages the linking site had in total. That is rather more difficult, and involves some triangulation with other data that the BL provides.

