Job Posting Skills Data
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information on job postings, including their descriptions and required skills. It is designed for machine learning initiatives, particularly those focused on job matching, skill extraction, and natural language processing. Researchers and developers can utilise this data to create and assess models for career recommendation systems, CV parsing, and skill inference. The data includes both hard and soft skills extracted using RecAI APIs.
Columns
- job_id: A unique identifier for each individual job posting.
- category: The industry sector or classification the job posting belongs to, such as Information Technology, Business Development, Finance, Sales, or HR.
- job_title: The specific title of the job position advertised.
- job_description: A detailed textual account of the job posting, outlining responsibilities and necessary qualifications.
- job_skill_set: A collection of both hard and soft skills relevant to the job, which have been extracted through RecAI API services.
Distribution
The dataset is typically provided in a tabular format, such as CSV. Specific row or record counts are not available; however, the dataset features a variety of categories, with Information Technology and Business Development accounting for 21% and 20% respectively, and other categories making up the remaining 59%. There are 1,167 unique job titles and categories within the dataset.
Usage
- Skill Extraction: Pinpointing and parsing essential skills from job descriptions.
- Job-CV Matching: Aligning job descriptions with appropriate candidate profiles.
- Recommendation Systems: Developing models that suggest suitable jobs or training programmes based on required skills.
- Natural Language Processing (NLP): Conducting experiments with text-based models for recruitment and career analytics.
Coverage
The dataset's geographic coverage is global. It was listed on 5th June 2025. Specific details regarding time range or demographic scope are not provided.
License
CC-BY-SA
Who Can Use It
This dataset is intended for developers and researchers working on machine learning projects. Ideal users include those aiming to build and evaluate models for:
- Career recommendation systems.
- CV parsing tools.
- Skill inference applications.
- Solutions for job matching and skill extraction.
Dataset Name Suggestions
- Job Posting Skills Data
- Recruitment Skill Set
- Career Skills Database
- Job Description Analysis Data
- Skill Extraction for Jobs
Attributes
Original Data Source:job-skill-set