Opendatabay APP

Job Posting Skills Data

E-commerce & Online Transactions

Tags and Keywords

Computer

Tabular

Data

Nlp

Feature

Text

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Job Posting Skills Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides detailed information on job postings, including their descriptions and required skills. It is designed for machine learning initiatives, particularly those focused on job matching, skill extraction, and natural language processing. Researchers and developers can utilise this data to create and assess models for career recommendation systems, CV parsing, and skill inference. The data includes both hard and soft skills extracted using RecAI APIs.

Columns

  • job_id: A unique identifier for each individual job posting.
  • category: The industry sector or classification the job posting belongs to, such as Information Technology, Business Development, Finance, Sales, or HR.
  • job_title: The specific title of the job position advertised.
  • job_description: A detailed textual account of the job posting, outlining responsibilities and necessary qualifications.
  • job_skill_set: A collection of both hard and soft skills relevant to the job, which have been extracted through RecAI API services.

Distribution

The dataset is typically provided in a tabular format, such as CSV. Specific row or record counts are not available; however, the dataset features a variety of categories, with Information Technology and Business Development accounting for 21% and 20% respectively, and other categories making up the remaining 59%. There are 1,167 unique job titles and categories within the dataset.

Usage

  • Skill Extraction: Pinpointing and parsing essential skills from job descriptions.
  • Job-CV Matching: Aligning job descriptions with appropriate candidate profiles.
  • Recommendation Systems: Developing models that suggest suitable jobs or training programmes based on required skills.
  • Natural Language Processing (NLP): Conducting experiments with text-based models for recruitment and career analytics.

Coverage

The dataset's geographic coverage is global. It was listed on 5th June 2025. Specific details regarding time range or demographic scope are not provided.

License

CC-BY-SA

Who Can Use It

This dataset is intended for developers and researchers working on machine learning projects. Ideal users include those aiming to build and evaluate models for:
  • Career recommendation systems.
  • CV parsing tools.
  • Skill inference applications.
  • Solutions for job matching and skill extraction.

Dataset Name Suggestions

  • Job Posting Skills Data
  • Recruitment Skill Set
  • Career Skills Database
  • Job Description Analysis Data
  • Skill Extraction for Jobs

Attributes

Original Data Source:job-skill-set

Listing Stats

VIEWS

1

DOWNLOADS

4

LISTED

05/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format