Opendatabay APP

Job Market Twitter Data

Education & Learning Analytics

Tags and Keywords

Education

Nlp

Data

Cleaning

Psychology

Jobs

Career

Employment

Text

Pre-processing

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Job Market Twitter Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset consists of 50,000 tweets pertaining to job vacancies and hiring. The tweets were collected using specific keywords such as 'Job Vacancy,' 'We are Hiring,' and 'We're Hiring'. The primary aim of this dataset is to facilitate the exploration of text pre-processing techniques and to test Natural Language Processing (NLP) skills. It also serves as a valuable resource for deriving insights into the job market from actual job postings and for analysing company and role requirements. The tweets were gathered using the snscrape Python library.

Columns

  • ID: The unique identifier for each individual tweet.
  • Timestamp: The precise date and time when the tweet was posted.
  • User: The Twitter handle of the user responsible for posting the tweet.
  • Text: The actual content of the tweet itself.
  • Hashtag: Any hashtags that were included within the tweet.
  • Retweets: The total count of times the tweet had been retweeted at the point of scraping.
  • Likes: The total count of likes the tweet had accrued at the point of scraping.
  • Replies: The total count of replies to the tweet at the point of scraping.
  • Source: The application or device used to post the tweet.
  • Location: The location specified on the user's Twitter profile, if available.
  • Verified_Account: A Boolean value indicating whether the user's Twitter account was verified.
  • Followers: The number of followers the user had at the point the tweet was scraped.
  • Following: The number of accounts the user was following at the point the tweet was scraped.

Distribution

The dataset is provided in a CSV format and comprises 50,000 individual tweets or records.

Usage

This dataset is ideally suited for:
  • Text Pre-processing Practice: Users can experiment with various text cleaning and normalisation techniques.
  • Natural Language Processing (NLP) Skill Development: It serves as an excellent resource for developing and testing NLP models.
  • Job Market Analysis: Gaining insights into job market trends, popular roles, and hiring patterns based on social media data.
  • Company/Role Requirement Analysis: Examining the specific requirements or characteristics mentioned in job postings.

Coverage

The tweets in this dataset were collected between 1 January 2019 and 10 April 2023. The dataset's geographic coverage is global, as indicated by its listing. There are no specific notes on data availability for particular demographic groups or years beyond the stated collection period.

License

CC0

Who Can Use It

This dataset is intended for a variety of users, including:
  • Data Scientists and Analysts: For conducting text analysis, building predictive models, or extracting actionable insights.
  • NLP Researchers: For developing and refining algorithms related to text classification, topic modelling, or sentiment analysis on social media data.
  • Human Resources (HR) Professionals: To understand job market dynamics, identify hiring trends, or benchmark their own job postings.
  • Students and Educators: As a practical resource for learning and teaching about data analysis, social media data, and NLP.

Dataset Name Suggestions

  • Job Vacancy Tweets
  • Social Media Job Postings
  • Hiring Tweets Dataset
  • Employment Tweets
  • Job Market Twitter Data

Attributes

Original Data Source: Job Vacancy Tweets

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

22/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free