Multilingual historical narratives on Wikipedia

Multilingual historical narratives on Wikipedia

DSpace/Manakin Repository

Multilingual historical narratives on Wikipedia

Show full item record

Title Multilingual historical narratives on Wikipedia
URI http://doi.org/10.7802/1411
Primary Researcher Samoilenko, Anna;GESIS
Publication Year 2017
Availability Freier Zugang (ohne Registrierung)
Contributor Strohmaier, Markus;GESIS, University of Koblenz-Landau;Supervisor
Weller, Katrin;GESIS;Project Member
Zens, Maria;GESIS;Project Member
Lemmerich, Florian;GESIS, University of Koblenz-Landau;Project Member
Samoilenko, Anna;GESIS, University of Koblenz-Landau;Contact Person
Subject Area Social History, Historical Social Research
Interdisciplinary and Applied Fields of the Social Sciences
Humanities
History
Cultural Sociology, Sociology of Art, Sociology of Literature
Sociology of Knowledge
Media Contents, Content Analysis
Topic Classification Society, Culture
Historical Social Research
Communication, Public Opinion, Media
Historical Studies Data
Abstract Portrayals of history are never complete, and each description inherently exhibits a specific view- point and emphasis. In this work, we automatically identified such differences by computing time- lines and detecting temporal focal points of written history across languages on Wikipedia. In particular, we studied articles related to the history of all UN member states and compared them in 30 language editions. We developed a computational approach that allows to identify focal points quantitatively, and found that Wikipedia narratives about national histories (i) are skewed towards more recent events (recency bias) and (ii) are distributed unevenly across the continents with sig- nificant focus on the history of European countries (Eurocentric bias). Thus, our work explored how colonial ties shape popular historiography on Wikipedia. We also established that national historical timelines vary across language editions, although average interlingual consensus is rather high. We hope that this work provides a starting point for a broader computational analysis of written history on Wikipedia and elsewhere.
Universe Main text of Wikipedia articles on history of 193 UN memberstates (and their outlinks) in 30 language editions, collected in July 2016
Selection Method Live-crawling of Wikipedia pages
Data Collection Mode Content Analysis
Other
Survey Period 2016-07;2016-07
Licenses CC BY-NC 4.0
Source the free encyclopedia Wikipedia
Publications Samoilenko, Anna and Lemmerich, Florian, and Zens, Maria and Weller, Katrin and Strohmaier, Markus "Analysing Timelines of National Histories across Wikipedia Editions: A Comparative Computational Approach" to appear in the ICWSM'17 volume as a full paper.;

Files in this submission

Files Size Format Download Option Description
collected_dates_per_decade_by_country_matrices.zip 501.4Kb application/zip Download {"dbk_file_desc":"dataset 1: collected dates (counts by country, language, decade)","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"Dataset","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:a29f8c0a5409da6a36151151018019ca
jensen_shannon_divergence_years_matrices.zip 261.9Kb application/zip Download {"dbk_file_desc":"analysis 1: Inter-lingual similarity","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:c26f67e5888abea94aff3d69f1700aae
z-scores.zip 157.6Kb application/zip Download {"dbk_file_desc":"Analysis 2: Significant temporal points by country, decade","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:d9d1fc4dab7e352e69885e727d1e952b
evaluation_final_error_rates.csv 1.710Kb Unknown Download {"dbk_file_desc":"Data evaluation (by language edition and century)","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:c07f040dad5c3376ce35388998bfba5a
dates_extraction.py 2.934Kb Unknown Download {"dbk_file_desc":"Python code for extracting the data","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"Software","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:5e791009507f15ce825348d76c7095bc
README.txt 3.423Kb Text file Download {"dbk_file_desc":"Notes on using the dataset","dbk_version_series_desc":"","dbk_version_number_desc":"","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"Text","dc_language_iso_desc":"","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:585949120c656c8b6a2182bf2f3325d5

This item appears in the following Collection(s)

Show full item record

Search DSpace


Advanced Search

Browse

My datorium