Geographical bias on Wikipedia

Wikipedia has been criticized for the inequality in the distribution of its volunteer-created content with respect to the geographical association of article subjects. The research shows that despite considerable differences of this distribution depending on the language of Wikipedia, there is a common trend towards more content related to the United States and Western Europe coupled with a lack of information about certain regions in the rest of the world.[1]

Analysis

Several studies on internet geography and Wikipedia were published by the members of the Oxford Internet Institute (OII).

A 2 December 2009 article by Mark Graham of OII in The Guardian presented a color-coded map of the world that illustrated the disparity between the numbers of geotagged Wikipedia articles (in all languages) for countries from the Global North versus the Global South. Graham wrote: "Almost the entire continent of Africa is geographically poorly represented in Wikipedia. Remarkably, there are more Wikipedia articles written about Antarctica than all but one of the 53 countries in Africa (or perhaps more amazingly, there are more Wikipedia articles written about the fictional places of Middle Earth and Discworld than about many countries in Africa, Asia, and the Americas).[2]"A 2010 analysis of Wikipedia people edits revealed that Asia, despite being the most populous continent, was represented in only 16.67% of edits; Africa (6.35%) and South America (2.58%) were equally underrepresented.[3] A 2011 study by OII found that 84 percent of articles tagged with a location were in Europe or North America, and that Antarctica had more entries than any African or South American nation.[4] According to Adama Sanneh, the founder of the WikiAfrica Education initiative, as of 2021 there were more articles on the English Wikipedia about Paris than Africa.[5]

In 2011 a breakdown by languages (Arabic, Egyptian Arabic, English, French, Hebrew, Persian and Swahili) was reported by OII. Graham reported some unexpected patterns in distributions; for example, Swahili Wikipedia has unusually many articles geotagged in Turkey. Graham explains this and similar artifacts by dedicated editors creating numerous stub articles in the areas of their interest.[6]

A 2015 OII paper reported on a highly uneven geography of participation in editing Wikipedia.[7] In particular, it found that contributors from low-income countries contribute to geographical imbalance by writing more about high-income countries than about their own.[8][9]

A 2016 study by David Laniado and Marc Miquel Ribé empirically confirmed that cultural identity is a significant motivator for Wikipedia editors affecting their works in areas which may be associated with their identity, in particular, in Wikipedia categories "Culture" and "Geography".[10] A similar conclusion was drawn in 2010 by Brent Hecht and Darren Gergle: Wikipedians tend to work on topics related to nearby locations.[11] Laniado and Ribé suggest that in order to overcome the imbalance stemming from cultural identities, Wikipedia has to promote editors from different language Wikipedias to propagate their cultural identities into other Wikipedias. To this end, the translator and article recommendation tools developed by the Wikimedia Foundation may be of use by providing options to select preferential content based on the keywords related to cultural identity.[10]

See also

  • List of countries by number of Internet users

Further reading

  • The following references are found in Beytía's article:
  • Graham, M., Hogan, B., Straumann, R. K. & Medhat, A. Uneven geographies of user-generated information: patterns of increasing informational poverty. Annals of the Association of American Geographers, 104, 746–764 (2014).
  • It was summarized in an article by Joseph Stromberg "Wikipedia's geography problem: There are more articles about Antarctica than Egypt", Vox.com 14 September 2014
  • Graham, M. Information geographies and geographies of information. New geographies (2015).
  • Roll, U. et al. Using Wikipedia page views to explore the cultural importance of global reptiles. Biological conservation, 204, 42–50 (2016).
  • Overell, S. E. & Rüger, S. View of the world according to Wikipedia: Are we all little Steinbergs? Journal of Computational Science, 2, 193–197 (2011).
  • Graham, M., Hale, S. A. & Stephens, M. Geographies of the World's Knowledge. (2011).
  • Graham, M., De Sabbata, S. & Zook, M. A. Towards a study of information geographies: (im)mutable augmentations and a mapping of the geographies of information. Geo: Geography and environment, 2, 88–105 (2015).

References

  1. Pablo Beytía, "The Positioning Matters. Estimating Geographical Bias in the Multilingual Record of Biographies on Wikipedia", SSRN Electronic Journal, 2020^
  2. Mark Graham. Wikipedia's known unknowns The Guardian.co.uk, 2 December 2009, retrieved 12 June 2020^
  3. Randall M. Livingstone. Let's Leave the Bias to the Mainstream Media: A Wikipedia Community Fighting for Information Neutrality M/C Journal, 23 November 2010^
  4. Tom Simonite. The Decline of Wikipedia MIT Technology Review, 22 October 2013, retrieved 26 May 2021^
  5. Jonathan Greig. For Wikipedia's 20th anniversary, students across Africa add vital information to site TechRepublic, 16 April 2021, retrieved 11 May 2021^
  6. Simon Rogers. The world of Wikipedia's languages mapped The Guardian, 11 November 2011^
  7. Graham, M., Straumann, R., Hogan, B. 2015. "Digital Divisions of Labour and Informational Magnetism: Mapping Participation in Wikipedia". Annals of the Association of American Geographers. 105(6) 1158-1178. (free pre-publication version)^
  8. James Temperton "Wikipedia's world view is skewed by rich, western voices", Wired, 15 September 2015^
  9. Wikipedia's view of the world is written by the west the Guardian, 15 September 2015, retrieved 6 April 2022^
  10. David Laniado, Marc Miquel Ribé. Proceedings of the 7th 2016 International Conference on Social Media & Society - SMSociety '16 11–13 July 2016^
  11. Brent J. Hecht, Darren Gergle. Proceedings of the 2010 ACM conference on Computer supported cooperative work Association for Computing Machinery, 6 February 2010^