Opendatabay APP

Glassdoor Salary Prediction Data

LLM Fine-Tuning Data

Tags and Keywords

Salary

Jobs

Tech

Glassdoor

Employment

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Glassdoor Salary Prediction Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a collection of tech job positions and associated salaries scraped from Glassdoor.com in 2017. Its primary purpose is to enable in-depth analysis of current trends within the tech job market, including the influence of company size on salaries and other pertinent job information.

Columns

The dataset is structured across several files, featuring a variety of attributes related to job postings and potential applicant characteristics:
  • job_id: A unique identifier for each job posting (Numeric).
  • job_state: The state where the job is located (String).
  • same_state: A binary indicator showing whether the job is in the same state as the person viewing it (String).
  • age: The age of the person viewing the job (Numeric).
  • python_yn: A binary indicator of whether the person viewing the job knows Python (String).
  • R_yn: A binary indicator of whether the person viewing the job knows R (String).
  • spark: A binary indicator of whether the person viewing the job knows Spark (String).
  • aws: A binary indicator of whether the person viewing the job knows AWS (String).
  • excel: A binary indicator of whether the person viewing the job knows Excel (String).
  • job_simp: A simplified job title (String).
  • seniority: The seniority level of the job (String).
  • desc_len: The length of the job description (Numeric).
  • num_comp: The number of competitors for the specific job (Numeric).
  • Salary Estimate: Estimated salary range for the job.
  • Job Description: The detailed text description of the job.
  • Rating: The company rating.
  • Company Name: The name of the hiring company.
  • Location: The physical location of the job.
  • Headquarters: The location of the company's headquarters.
  • Size: The size of the company by number of employees.
  • Founded: The year the company was founded.
  • Type of ownership: The legal ownership structure of the company.
  • Industry: The industry sector of the company.
  • Sector: The broader economic sector of the company.
  • Revenue: The company's reported revenue.
  • Competitors: A list of the company's known competitors.
  • hourly: A binary indicator if the salary is hourly.
  • employer_provided: A binary indicator if the salary is employer-provided.
  • min_salary: The minimum estimated salary for the job (Numeric).
  • max_salary: The maximum estimated salary for the job (Numeric).
  • avg_salary: The average estimated salary for the job (Numeric).
  • company_txt: Text field for company name.

Distribution

The dataset is provided in CSV format, with eda_data.csv being approximately 3.12 MB in size. It contains 742 records across its various files, offering detailed information for each entry.

Usage

This dataset is ideal for:
  • Analysing current trends based on job positions and company size within the tech industry.
  • Identifying factors that significantly influence data science salaries.
  • Determining states and cities that offer the highest-paying data science jobs.
  • Predicting job salaries for data science positions based on job descriptions and other features.
  • Investigating salary variations influenced by company size and other job-related data.

Coverage

The dataset primarily covers tech job postings from Glassdoor.com for the year 2017. Geographic coverage includes various states and cities within the United States, with notable concentrations in locations such as New York, NY and San Francisco, CA. The data offers insights into different company sizes, industries, and job seniorities.

License

**CC0 1.0 Universal (CC0 1.0) - Public Domain

Who Can Use It

This dataset is suitable for:
  • Data Scientists and Machine Learning Engineers for building predictive models for salary estimation.
  • HR Professionals and Recruiters seeking insights into salary benchmarks and job market trends.
  • Job Seekers looking to understand salary expectations and regional variations in tech roles.
  • Researchers and Analysts interested in exploring the dynamics of the tech job market.
  • Students learning about data analysis, statistics, and labour market economics.

Dataset Name Suggestions

  • Glassdoor Tech Job Salary Predictor
  • 2017 Tech Job Market Insights
  • Glassdoor Salary Prediction Data
  • US Tech Job Salaries 2017
  • Tech Industry Compensation Trends

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

08/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format