Opendatabay APP

Brazil Data Professionals Survey 2022

NLP / Natural Language Processing

Tags and Keywords

Brazil

Data

Jobs

Survey

Career

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Brazil Data Professionals Survey 2022 Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset presents the State of Data Brazil 2022 survey, offering a detailed overview of the Brazilian data job market. It is the result of a collaborative effort between Data Hackers, Brazil's largest data community, and Bain & Company, a global consultancy. The survey, conducted between 10 October and 28 November 2022, gathered insights from 4,271 respondents across Brazil. It includes indicators related to demographic profiles, education, sector performance, remuneration, job turnover, and satisfaction factors, including the impact of remote work preferences. The dataset reflects the views of various professional roles such as data analysts, data scientists, and data engineers, spanning junior, mid-level, senior, and managerial experience levels. It serves as a valuable resource for understanding the dynamics and characteristics of the Brazilian data professional landscape.

Columns

The dataset contains 353 columns, with column names identified by a tuple (e.g., ('P3a_1')), where the first identifier refers to the question and the second to the chosen alternative for multi-valued responses. Below are descriptions of a sample of the columns included:
  • ('P0', 'id'): Anonymised ID for each survey response.
  • ('P1_a ', 'Idade'): The respondent's age in years.
  • ('P1_a_1 ', 'Faixa idade'): The respondent's age presented as a range.
  • ('P1_b ', 'Genero'): The respondent's gender.
  • ('P1_c ', 'Cor/raca/etnia'): The respondent's self-declared colour, race, or ethnicity.
  • ('P1_d ', 'PCD'): Indicates if the respondent identifies as a Person with Disability.
  • ('P1_e ', 'experiencia_profissional_prejudicada'): Information on whether the respondent believes their professional experience has been negatively impacted by certain factors.
  • ('P1_f ', 'aspectos_prejudicados'): Details on specific aspects, such as selection processes or interviews, that may have been negatively impacted.
  • ('P1_g ', 'vive_no_brasil'): A Boolean indicating if the respondent lives in Brazil.
  • ('P1_i ', 'Estado onde mora'): The state in Brazil where the respondent resides.
The questionnaire was divided into eight parts covering: Demographic Data, Career Data, Challenges for Data Team Managers, Data Knowledge, Data Career Goals, Data Engineering Knowledge, Data Analysis Knowledge, and Data Science Knowledge. For privacy, the dataset has been anonymised, which involved removing outliers and, in some cases, only indicating the region for states with lower response rates.

Distribution

The dataset is provided as a CSV file, named State_of_data_2022.csv. Its size is approximately 9.8 MB. It contains 4,271 records (rows), representing the total number of survey respondents, and features 353 columns.

Usage

This dataset is ideal for:
  • Market research and trend analysis on the Brazilian data job market.
  • Academic studies focusing on labour market dynamics in the data sector.
  • Talent acquisition strategies for companies looking to hire data professionals in Brazil.
  • Professional development planning for individuals in the data field.
  • Participating in data analysis competitions, such as the State of Data Challenge 2022, which encourages analysis using this dataset.

Coverage

The dataset's geographic scope encompasses all of Brazil, with respondents from various states. The time range for the data is from 10 October to 28 November 2022, reflecting the survey period. Demographically, it covers 4,271 respondents and includes a variety of data professional roles and experience levels, alongside demographic profiles. Due to anonymisation for privacy, some specific data points, such as detailed state information for low-incidence regions, are generalised to regions.

License

Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)

Who Can Use It

Intended users for this dataset include:
  • Data professionals and enthusiasts from the Data Hackers community.
  • Consulting firms like Bain & Company, for market analysis and strategic planning.
  • Researchers and academics studying labour markets and professional trends in data.
  • HR professionals and recruiters for understanding talent pools and compensation benchmarks.
  • Organisations aiming to promote and understand changes in the business landscape.
  • Participants in data challenges or competitions involving Brazilian market data.

Dataset Name Suggestions

  • State of Data Brazil 2022-2023 Survey
  • Brazilian Data Job Market Overview 2022
  • Brazil Data Professionals Survey 2022
  • Data Careers in Brazil 2022

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

1

LISTED

14/07/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format