Wikipedia Social Influence and Election Dataset
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Wikipedia's administrative selection process relies on a community-driven voting system where members decide which editors should be granted admin privileges. This collection captures the dynamics of these elections, transforming over a decade of community interactions into a directed, signed network. By mapping votes as edges between members, the data provides a window into the social hierarchies, trust, and conflict resolution mechanisms within a large-scale peer-production community. The records reflect the consensus-building efforts required for an editor to transition into an administrative role.
Columns
- INDEX: A unique numerical identifier for each specific vote record.
- SOURCE: The username of the Wikipedia member casting the vote.
- TARGET: The username of the candidate running for the administrative position.
- VOTE: The specific sentiment of the vote, categorised as support (1), neutral (0), or oppose (-1).
- RESULT: The final outcome of the request for adminship, indicating whether the candidate was accepted (1) or rejected (-1).
Distribution
The information is contained within a single CSV file titled
wikiRfA.csv, with a total file size of approximately 6.46 MB. It features 198,275 records representing individual votes cast by over 11,000 unique users. The data is structured for high reliability, with 100% validity for the target, vote, and result fields. This resource is provided as an annual release to ensure historical records are maintained.Usage
This resource is ideal for social network analysis and studying the evolution of digital communities. It is well-suited for training machine learning models to predict election outcomes or for identifying patterns of collaboration and opposition. Researchers can also use the signed network structure to test theories on structural balance and social status within online environments. Furthermore, it serves as a robust foundation for sentiment analysis when paired with the original textual comments.
Coverage
The scope covers the Wikipedia Requests for Adminship process from its inception in 2003 through to May 2013. It encompasses 11,381 unique users (voters and candidates) and 189,004 distinct voter/candidate pairs. The data includes every type of vote cast during this decade, providing a thorough view of the administrative selection landscape across the entire global Wikipedia community during this period.
License
CC0: Public Domain
Who Can Use It
Sociologists can leverage these records to examine the formation of power structures and reputation in open-source projects. Data scientists may utilise the network graph to practice community detection and link prediction algorithms. Additionally, community managers can study these historical voting patterns to better understand the consensus-building process in large-scale digital ecosystems.
Dataset Name Suggestions
- Wikipedia RfA Signed Social Network
- Wikipedia Adminship Election Voting Records (2003–2013)
- Community Governance and Peer-Review Network Data
- Signed Directed Graph of Wikipedia Adminship Requests
- Wikipedia Social Influence and Election Dataset
Attributes
Original Data Source: Wikipedia Social Influence and Election Dataset
Loading...
Free
Download Dataset in CSV Format
Recommended Datasets
Loading recommendations...
