Opendatabay APP

Student Placement Prediction Dataset

Education & Learning Analytics

Tags and Keywords

Placement

Student

Prediction

Academic

Career

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Student Placement Prediction Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset aims to predict whether a student will be placed or not within a company. It provides essential information regarding students' academic performance, training, and placement status, serving as a valuable resource for developing predictive models. The core objective is to determine the 'PlacementStatus' of students, which is categorised as either 'placed' or 'not placed'.

Columns

  • StudentID: A unique identifier for each student, ranging from 1 to 10000.
  • CGPA: Represents the student's overall grades, with values typically between 6.5 and 9.1.
  • Internships: Indicates the number of internships a student has completed, usually 0 to 2.
  • Projects: Specifies the number of academic projects undertaken by a student, ranging from 0 to 3.
  • Workshops/Certifications: Denotes the count of workshops or certifications a student has acquired, typically between 0 and 3.
  • AptitudeTestScore: The score obtained in aptitude tests, usually falling between 60 and 90.
  • SoftSkillsRating: A rating of a student's soft skills, particularly communication, on a scale of 3 to 4.8.
  • ExtraCurricularActivities: A boolean indicator (true/false) showing whether a student participates in extracurricular activities.
  • PlacementTraining: A boolean indicator (true/false) denoting if a student has received placement training.
  • SSC_Marks: Marks obtained in Senior Secondary examinations, typically from 55 to 90.
  • HSC_Marks: Marks obtained in Higher Secondary examinations, ranging from 57 to 88.
  • PlacementStatus: The target column, indicating whether the student is 'Placed' or 'NotPlaced'.

Distribution

This dataset is provided as a CSV file, named placementdata.csv, with a size of 447.3 kB. It comprises 12 distinct columns and contains 10,000 records. A sample file will be made available on the platform separately.

Usage

This dataset is ideal for building and evaluating machine learning models, particularly for student placement prediction using techniques like CatBoost. It is also suitable for data visualisation, classification tasks, and exploratory data analysis to uncover insights into factors influencing student placements.

Coverage

The dataset focuses on student academic and training profiles, including their performance in CGPA, internships, projects, and various assessment scores. While specific geographic or time-range details are not provided, the data broadly covers student demographics relevant to their placement prospects.

License

CC0: Public Domain

Who Can Use It

This dataset is highly valuable for data scientists, machine learning engineers, and researchers interested in predictive analytics within the educational sector. It is particularly well-suited for beginners looking to practice with real-world classification problems. Educational institutions and career counselling services could also utilise this data to better understand placement trends and advise students.

Dataset Name Suggestions

  • Student Placement Prediction Dataset
  • Academic and Placement Status Data
  • CatBoost Student Placement Predictor
  • Higher Education Placement Outcomes

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

19/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format