Politicians on Wikipedia and DBpedia

Politicians on Wikipedia and DBpedia

DSpace/Manakin Repository

Politicians on Wikipedia and DBpedia

Show full item record

Title Politicians on Wikipedia and DBpedia
URI http://doi.org/10.7802/1515
Primary Researcher Wagner, Claudia;GESIS
Publication Year 2017
Availability Freier Zugang (ohne Registrierung)
Contributor Reif, Marcel;University of Koblenz;Project Member
Rajasekaran, Kandhasamy;University of Koblenz;Project Member
Iusupova, Tevriz;University of Koblenz;Project Member
Vasilev, Evgenii;University of Koblenz;Project Member
Subject Area No classification applied
Topic Classification Person, Personality, Role
Abstract This dataset contains information about politicians from DBpedia, a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web. Some important information about people that is available on DBpedia are name, gender, nationality, occupation, birth date, death date, profession and for many politicians also the political party they belong to. This dataset is based on the DBpedia dump from October 2015 and documents the temporal evolution of the hyperlink network that articles about politicians formed on Wikipedia between 2001 and 2016 every month. Wikipedia maintains revisions for each article to keep track of the changes over time. The first revision of each month was used to construct the hyperlink network between articles about politicians.
Universe All Wikipedia articles that relate to an instance of class Person in Dbpedia (dump October 2015) and were identified as politicians.
Selection Method Full population
Data Collection Mode Other
Survey Period 2017;2017
Licenses CC BY-SA 4.0
Notes It is a TAB delimited data. Data parsers are written in Python which mines RDF (Resource Description Format) data to TAB delimited data;
Source http://wiki.dbpedia.org/Downloads2015-10
https://github.com/kandy-koblenz/people-networks/tree/dbpedia-data/dbpedia-data
https://github.com/kandy-koblenz/people-networks

Files in this submission

Files Size Format Download Option Description
person-data.tsv 194.0Mb Unknown Download {"dbk_file_desc":"It is a TAB delimited dataset. Each wikipedia article in DBPedia is filtered based on whether the instance is of type person or not. The format is as follows: #DBpURL ID WikiURL gender name birthDate deathDate occupation nationality party. If a particular information is not available then 'NA' is substituted. ","dbk_version_series_desc":"","dbk_version_number_desc":"","hid_cb_submit_dbk_version_add_desc__0":"1.0.0; 2017-08-11","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"","dc_language_iso_desc":"","hid_cb_submit_dc_language_iso_add_desc__0":"eng - (English)","dbk_dataset_variable_desc":"10","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:0b0971c78a2d9263bb880f3648e00e83
politician-data-wikipedia-edge-list.zip 60.69Mb application/zip Download {"dbk_file_desc":" The zipped folder contains multiple files, each represents the link information between politicians for one month between 2001 and 2016. The format of the file name is as follows: ‘year’_’month’.csv. For e.g. ‘2001_06.csv’ contains all the links between politicians in Wikipedia that existed 1st June 2001. The format of the information is as follows: sourceid, targetid. The identifiers in this file correspond to the identifiers in the file ‘politician-data.tsv’. This file is similar to ‘person-data.tsv’ but contains only politicians. The politician dataset is prepared from 'MappingbasedLiterals' and 'Mappingbased Object' datasets of DBPedia dump which represents the Infobox and Category section of Wikipedia. A person was classified as politician if the keyword 'politician' showed up in the Occupation property of the Infobox or the Category section.","dbk_version_series_desc":"1.0.0","dbk_version_number_desc":"2017-08-11","dbk_resourcetype_series_desc":"","dbk_resourcetype_number_desc":"","dc_language_iso_desc":"eng - (English)","dbk_dataset_variable_desc":"","dbk_dataset_item_series_desc":"","dbk_dataset_item_number_desc":"","dbk_software_desc":"","dbk_alternativeid_desc":"","dbk_relatedid_series_desc":"","dbk_relatedid_number_desc":"","description":""}
File checksum: MD5:567bd323bb18d84ef621649379c3014d

This item appears in the following Collection(s)

Show full item record

Search DSpace


Advanced Search

Browse

My datorium