Social Messaging Activity Logs
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Presents detailed metadata scraped from a personal WhatsApp chat, making it ideal for analysis of communication trends and user activity patterns. It was originally developed as a learning project for those new to data science. The data provides granular details regarding who sent a message and the precise time the communication occurred, allowing for intricate temporal analysis.
Columns
- user: The identifier of the user who sent the message. There are 73 unique users identified, with one user, 'Debsrijan', accounting for the majority of contributions (7%).
- message: The textual content of the message itself. This column indicates instances where a message was deleted.
- year: The year in which the message was authored. Messages span a period of several years but are heavily concentrated in 2021.
- month: The month the message was sent. The data covers 12 unique months, with July being the most frequent month for messages.
- day: The day of the month (1 through 31) when the message was recorded.
- hour: The hour of the day (0 to 23) the user sent the message. The mean time of message transmission is approximately 14:00.
- minute: The minute (0 to 59) corresponding to the time of transmission.
Distribution
The data is structured as a CSV file, named "WhatsApp.csv," with a file size of 800.33 kB. It is composed of 7 columns and contains 12,400 valid records. The data is not expected to be updated.
Usage
This resource is perfect for developing analysis projects, particularly for studying online communication dynamics. Potential uses include building visualisations of group chat activity, tracking message frequency across different hours and days, and serving as foundational data for general text analysis studies. It is often used as a project for students learning data skills.
Coverage
The data covers chat activity primarily between 2018 and 2022, though the bulk of the records fall within 2021. The activity covers 12 unique months, with high volumes in July and April. The context of the source suggests the dataset is associated with the region of Asia.
License
CC0: Public Domain
Who Can Use It
- New Data Scientists and Students: Excellent for introductory projects focused on time-series analysis and string manipulation.
- Researchers: To model communication patterns within small online communities.
- Developers: For creating personal data visualisation tools based on messaging archives.
Dataset Name Suggestions
- WhatsApp Conversation Trends
- Social Messaging Activity Logs
- Chat Timeline Analysis Data
- User-Level Communication Metrics
Attributes
Original Data Source:Social Messaging Activity Logs
Loading...
