Opendatabay APP

Philippine Political Tweets Dataset

Government & Civic Records

Tags and Keywords

Tabular

Politics

Nlp

Research

Asia

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Philippine Political Tweets Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a collection of tweets pertaining to the Philippine Elections 2025, offering insights into public discourse and sentiment surrounding this significant political event. It captures conversations related to the upcoming 2025 Philippine general election, which includes contests for all 317 seats in the House of Representatives, 12 of the 24 seats in the Senate, and various local elections across the country. It also covers the first regular election to the Bangsamoro Parliament. The data collection ceased on 1st June 2025.

Columns

  • pseudo_id: A unique, obfuscated hash identifier for usernames and other IDs, ensuring data integrity while maintaining anonymity. These IDs are typically up to 15 digits long.
  • text: The full content of the tweet.
  • retweetCount: The number of retweets the tweet had at the point of data extraction. It does not reflect the total lifetime count.
  • replyCount: The number of replies the tweet had at the point of data extraction. This is not a lifetime count.
  • likeCount: The number of likes the tweet had when the data was extracted. This figure is specific to the extraction time, not the tweet's entire existence.
  • quoteCount: The number of quotes the tweet had at the point of data extraction. This is a snapshot and not a lifetime total.
  • viewCount: The number of views the tweet had when extracted. This is an instantaneous value and not a running lifetime total.
  • bookmarkCount: The number of bookmarks the tweet had at the point of extraction. This is a current value, not reflecting the tweet's entire lifespan.
  • createdAt: The date and time when the tweet was originally created.
  • lang: The language in which the tweet was written.

Distribution

The dataset is primarily in a tabular format, typically supplied as a CSV file. It contains approximately 217,472 records. The tweets were collected daily, covering a period from 24th December 2024 to 31st May 2025. The language distribution of the tweets indicates that about 83% are in Tagalog (tl) and 17% are in English (en), with a negligible percentage in other languages.

Usage

This dataset is ideal for:
  • Political discourse analysis: Understanding public conversation and trends related to the Philippine elections.
  • Sentiment analysis: Gauging public opinion and emotional responses towards candidates, parties, and election issues.
  • Social media research: Studying how political information spreads and how users engage with election-related content on the platform.
  • Identifying influential accounts: Utilising the accompanying "well-known authors" file to map and add context to high-engagement users.

Coverage

The dataset's coverage is geographically focused on the Philippines, specifically concerning the 2025 general election. The temporal scope spans from 24th December 2024 to 31st May 2025. The data reflects the discourse among public Twitter users interacting with election-related keywords. It is important to note that engagement metrics (like retweetCount, replyCount, likeCount, quoteCount, viewCount, and bookmarkCount) are values captured at the time of data extraction, not lifetime totals, meaning tweets collected recently might show lower engagement. All user IDs and tagged usernames have been obfuscated to preserve privacy. The dataset may contain some tweets related to "Pinoy Big Brother" due to hashtag overlap, which are not filtered out.

License

CC-BY

Who Can Use It

This dataset is well-suited for:
  • Political scientists and academics analysing electoral processes and public opinion.
  • Data analysts seeking to quantify and visualise social media trends during an election cycle.
  • Journalists investigating current events and public sentiment leading up to the elections.
  • Researchers focusing on natural language processing (NLP) applications for social media text.

Dataset Name Suggestions

  • Tweets on Philippine Elections 2025
  • Philippine Election Discourse 2025
  • Social Media Data: Philippines General Election 2025
  • Philippine Political Tweets Dataset

Attributes

Listing Stats

VIEWS

4

DOWNLOADS

0

LISTED

08/06/2025

REGION

ASIA

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free