Reading creationism in the web archive

[UPDATE, April 2018. Since this post was published it has attracted a couple of citations in the formal academic literature. Although this research is as yet unpublished, there is available now a conference paper from 2015 which documents the case more fully: Reading British creationism in the web archive (ReSAW conference, Aarhus, 2015)]

In recent years, anti-evolutionist thinking has attracted some attention in the news, mostly because of the role of some Christian free schools in teaching anti-evolutionist ideas alongside or in place of evolution. Anti-evolutionist ideas are however by no means new, and have been a durable minority view in some of the churches, picking up speed from the 1960s onwards. (Although the term ‘creationism’ is colloquially used to cover all the particular variants of this thinking, I use the more general term ‘anti-evolutionist’ here.)

It is not always easy to gauge the strength of the movement, but the archived UK web allows a new angle of view on the question. In theory, the web allows minority views to flourish in proportion with their intrinsic attractiveness and plausibility, no longer constrained by the high barriers to entry to traditional publishing. And in the absence of publicly available web usage statistics for the main sites, it is possible to analyse the structure of links to these sites as a proxy measure of attention (both positive and negative.)

Using the Host Link Graph dataset, available from the British Library, I extracted all the unique hosts that had been found linking to any one of four prominent anti-evolutionist sites at any point between 1996 and 2010. Then, using both the live web and of the Internet Archive’s interface at http://archive.org, I examined each host in order to categorise it, which I was able to do for 91% of the results. One immediate point to note is precisely how many “false” results there are. A large proportion of the hosts (34%) are categorised as Other, most of which were links associated with search engine and other directory-type sites, rather than from any host representing an autonomous actor in the field. Excluding these as well, the analysis of the remainder is shown below:

anti-evolutionists

Of the remainder, 39% are the sites of individual congregations. A full analysis of these sites (39 in total) is yet to be done, but the majority are independent evangelical churches, with a handful of Baptist churches. They include very few indeed from Anglican, Roman Catholic or Methodist congregations. Given that at the time of writing the Evangelical Alliance has a membership of 3,500 individual congregations, the magnitude of these numbers suggests that anti-evolutionism is a minority view even amongst evangelical churches.

As might be expected, a significant proportion (17%) are other anti-evolutionist sites; a later post will explore the nature of this particular network. Interestingly, few inbound links are from secularist organisations, other than the British Centre for Science Education which exists to document (and counter) creationist ideas. Once data is available for the period after 2010, it may be that this interest grows as the schools controversy mounts. There are also very few links in from the mainstream media, which might also be expected to grow after 2010.

A complaint often heard from anti-evolutionists is that the scientific “establishment” does not engage with the critique of evolution which is being offered. That claim would seem to be confirmed here, as both the proportion and absolute number of inbound links from academic domains are also very small.

In sum, this data would suggest that between 1996 and 2010, British creationism was talking largely to itself, and was mostly ignored by academia, the media and most of the churches.

Data
You can download the data, which is in the public domain, from here . Be sure to have plenty of hard disk space as, when unzipped, the data is more than 120GB. The data looks like this:

2010 | churchtimes.co.uk | archbishopofcanterbury.org | 20

which tells you that in 2010, the Internet Archive captured 20 individual resources (usually, although not always, “pages”) in the Church Times site that linked to the archbishop of Canterbury’s site.

Assumptions

(i) that a host “abc.co.uk” held the same content as “www.abc.co.uk”.

(ii) that the Internet Archive were no more likely to miss hosts that linked to these sites than ones that did not – ie., if there are gaps in what the Internet Archive found, there is no reason to suppose that they systematically skew this particular analysis.

(iii) that my sample of four target sites was reasonably representative of the movement as a whole. It is therefore possible that the profile of inbound links is very different for another hosts of the same type.

(iv) the analysis does not include cases where a site moved from one host to another during the time period. The host URLs used are those in current use, and so if another host linked to a previous host and that link was not subsequently updated, then that linkage will not be recorded in this data.

(iv) that the inconsistency in deduplication at the British Library noted here does not affect this analysis.

Prefer to read this as an email?

Sign up to receive each new post, in full, direct to your inbox.

(And nothing else.)

5 thoughts on “Reading creationism in the web archive

  1. maxkemman November 19, 2014 / 10:16 am

    Thanks for this post, interesting approach. If you don’t mind, I have some questions; you write “Of the remainder, 39% are the sites of individual congregations. A full analysis of these sites (39 in total) is yet to be done”. Does this mean your analysis concerned 100 websites in total? Is that a lot, or a few? And are there differences in how often certain websites link to these anti-evolution pages?

    • peterwebster November 19, 2014 / 10:40 am

      Many thanks Max for your comment. I didn’t go into the significance of the absolute numbers because it is not yet clear what a high, medium or low number of inbound linking hosts might be. There just hasn’t been enough use of this data so far to get a sense; my only comparator is this earlier post on the archbishop of Canterbury’s site, which had more hosts linking to it as you might expect, but whether this is as many more as we should expect, I don’t know.
      http://peterwebster.me/2014/01/28/distant-reading-the-webarchive/

      On the number of times each site links out: there are two ways into this. One is to look at *for how long* a linkage persists (which one can do from this data, but I haven’t yet). The second is to look at how many links there were at any one time from one host to another. These numbers are declared in the data, but would only be meaningful if you could also understand how many pages the linking site had in total. That is rather more difficult, and involves some triangulation with other data that the BL provides.

Leave a Reply