State of Data Brazil Industry Analysis
Government & Civic Records
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset presents a detailed mapping of the Brazilian data job market for 2023-2024, derived from the annual State of Data Brazil survey. This fourth edition of the survey, conducted by the Data Hackers community in partnership with Bain & Company between October and December 2023, gathered insights from 5,293 professionals across Brazil. It provides a valuable snapshot of the industry, covering demographic profiles, educational backgrounds, professional roles, compensation trends, job satisfaction factors (including the impact of remote work and layoffs), and a new focus on the adoption of Generative AI and Large Language Models (LLMs) within companies. The data is anonymised to protect respondent privacy, with certain outliers and geographical details generalised where necessary.
Columns
The dataset contains 399 columns, with examples including:
- P0_id: A unique identifier for each survey response.
- P1_a: The respondent's age in years.
- P1_a_1: The respondent's age range, such as '25-29' or '30-34'.
- P1_b: The respondent's gender, with options like 'Masculino' (Male) and 'Feminino' (Female).
- P1_c: The respondent's colour, race, or ethnicity, including 'Branca' (White) and 'Parda' (Brown).
- P1_d: Indicates if the respondent identifies as a Person with Disability (PCD).
- P1_e: Reflects whether the respondent believes their professional experience has been negatively impacted by certain factors.
- P1_e_1: A specific indicator if the respondent does not believe their professional experience is affected by the listed factors.
- P1_e_2: Indicates if professional experience is affected due to colour, race, or ethnicity.
- P1_e_3: Indicates if professional experience is affected due to gender identity.
- P1_e_4: Indicates if professional experience is affected due to being a Person with Disability (PCD). Many questions with multi-valued answers are represented across several columns, identified by a tuple format such as P3a_1 (Part 3, question (a), option (1)).
Distribution
This dataset is provided in a CSV format (df_survey_2023.csv) and has a file size of 15.24 MB. It comprises 5,293 individual records, each representing a survey respondent. The data is structured in a tabular format, with 399 columns in total, reflecting the extensive questionnaire covering eight distinct parts related to demographics, career, management challenges, and specific knowledge areas within data. Due to anonymisation, certain specific details like states with lower response rates have been generalised to their respective regions, and some outliers have been removed or transformed.
Usage
This dataset is ideal for:
- Analysing trends in the Brazilian data job market, including demographics, education, and compensation.
- Benchmarking salaries and career progression paths for data professionals in Brazil.
- Understanding the adoption and impact of Generative AI and LLMs in the workplace.
- Investigating factors influencing job satisfaction, turnover, and remote work in the data sector.
- Informing recruitment strategies, educational curriculum development, and public policy related to the data industry in Brazil.
- Academic research on labour market dynamics and technological impact in emerging economies.
Coverage
The dataset's geographic scope is Brazil, with responses gathered from professionals across the country. The time range of the survey data collection was between 16 October and 6 December 2023, representing the market state for late 2023 and early 2024. The demographic scope includes over 5,200 data professionals of various ages, genders, races/ethnicities, and disability statuses. It covers a wide array of professional roles, including data analysts, data scientists, and data engineers, as well as different experience levels from junior to senior and management. Note that some specific geographical information (for states with few responses) and outliers were anonymised.
License
Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
Who Can Use It
- Data Professionals: For career planning, understanding market demands, and comparing their profiles.
- Human Resources & Recruiters: To gain insights into talent pools, salary benchmarks, and retention strategies.
- Data Science Educators & Students: For curriculum development and understanding industry entry requirements and career paths.
- Market Researchers & Consultants: To conduct analyses on industry trends and inform business strategies.
- Policy Makers & Government Agencies: For developing initiatives to support the tech labour market and foster diversity.
Dataset Name Suggestions
- Brazilian Data Workforce Survey 2023-2024
- Brazil Data Professionals Outlook
- State of Data Brazil Industry Analysis
- Brazilian Data Ecosystem Report 2023
- Data Job Market Brazil Insights
Attributes
Original Data Source: State of Data Brazil Industry Analysis