Internet Geographer

Blog

Mapping the Geographies of Wikipedia Content

The Internet surrounds us like air, saturating our offices and our homes. But it’s not confined to the ether. You can touch it. You can map it. And you can photograph it - Andrew Blum 2009


The following maps represent the first stage of a project I am embarking on to map out some of the spatial contours of Wikipedia. Data were obtained from the August 2009 Wikipedia geodata dump organised by user Kolossos. The information was then ported over to a GIS. There are almost half a million geotagged Wikipedia articles (i.e. Wikipedia articles about a place or an event that occurred in a distinct place), so the preparation time alone for the files needed to create these maps was almost a week.

The map below displays the total number of Wikipedia articles tagged to each country. The country with the most articles is the United States (almost 90,000 articles). Anguilla has the fewest number of geotagged articles (4), and indeed most small island nations and city states have less than 100 articles. However, it is not just microstates that are characterised by extremely low levels of wiki representation. Almost all of Africa is poorly represented in Wikipedia. Remarkably there are more Wikipedia articles written about Antarctica than all but one of the fifty-three countries in Africa (or perhaps even more amazingly, there are more Wikipedia articles written about the fictional places of Middle Earth and Discworld than about many countries in Africa, the Americas and Asia).

When examining the data normalised by area, an entirely different pattern is evident. Central and Western Europe, Japan and Israel have the most articles per landmass, while large countries like Russia and Canada have low ratios of Wikipedia articles per area.

Finally, the data were also mapped out against population. Here countries with small populations and large landmasses rise to the top of the rankings. Canada, Australia and Greenland all have extremely high levels of articles per every 100,000 people. Smaller nations with many noteworthy features or geotaggable events also appear high in the rankings (e.g. Pitcairn or Iceland).


As I’ve previously argued, Wikipedia is an important component of the palimpsests of place. In other words, presences and absences play a fundamental role in shaping how we interpret and interact with the world. The fact that the geographies of Wikipedia content are so uneven therefore leads to worrying conclusions. As we increasingly rely on peer produced information, large parts of the world remain ‘terra incognita’ (in a similar manner to the ways in which many of those same places were represented on European maps before the 19th Century). However, it is conceivable that it will only be a matter of time until the empty spaces on the Wikipedia map are filled in by Wikipedians in Zambia, Indonesia, and much of the rest of world.

These data certainly warrant a closer look, and I’ll aim to get more maps (examining the distribution of content in specific languages, and looking in more detail at specific regions) uploaded soon.