Opendatabay APP

São Paulo State Air Pollution Analysis

Public Health & Epidemiology

Tags and Keywords

Pollution

Brazil

Health

Air

Environment

Trusted By
Trusted by company1Trusted by company2Trusted by company3
São Paulo State Air Pollution Analysis Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Data details air pollution measurements collected across the state of São Paulo, Brazil. This resource is valuable for performing exploratory data analysis aimed at developing a clear overview of the atmospheric pollution situation. By tracking pollutant concentrations over a long period, the material helps justify the environmental and public health impacts and allows researchers to verify the evolution of this significant problem. The dataset includes measurements for 9 distinct pollutants, such as MP10, O3, NO2, MP2.5, CO, SO2, NO, FMC, and PTS.

Columns

The dataset contains 10 attributes detailing the measurement events:
  • ID: The unique integer index identifying each specific record (primary key). This field is 100% valid across all 11 million records.
  • Data: The date on which the pollutant concentration measurement was taken. Records span from the beginning of 2015 to the end of 2021.
  • Hora: The specific time when the measurement was recorded.
  • Estação: The physical location or station where the measurement was performed. There are 87 unique stations identified, with 'Santos - Ponta da Praia' being one of the most frequently logged locations (3%).
  • Código: The specific code associated with the measuring station. There are 87 unique codes recorded, with 'SP64' being a common example (3%).
  • Poluente: The specific pollutant whose concentration was measured. There are 9 unique pollutants in the dataset, with MP10 (26%) and O3 (24%) being the most common.
  • Valor: The concentration value of the pollutant measured. All records are valid, with the mean value recorded at 83.3.
  • Unidade: The unit of concentration used for the measurement. This value is uniformly recorded as ug/m3 (100%).
  • Tipo: Indicates how the measurement was conducted (e.g., automatic or manual). The 'automatica' type accounts for 100% of the recorded entries.

Distribution

The material is stored as a CSV file named SP_poluicao_dados.csv, which has a large file size of 902.98 MB. The dataset structure includes over 10 million total lines (11.0m valid records). All documented columns maintain perfect data quality, showing 100% validity with no missing or mismatched entries.

Usage

This resource is best suited for conducting exploratory data analysis (EDA) related to air quality. It allows users to track the concentration of 9 key pollutants over time to assess the scale of environmental issues and their influence on public health. Analysts can specifically analyze pollutant concentration intervals against known health danger levels.

Coverage

The geographic scope covers the entire state of São Paulo, Brazil, utilising data from 87 distinct measuring stations. The time period recorded spans from the start of 2015 through to the end of 2021. The resource captures concentration data for nine primary atmospheric pollutants.

License

CC0: Public Domain

Who Can Use It

The dataset is intended for researchers, environmental analysts, and public health officials. It is highly suitable for studying urban areas, environmental trends, and the long-term impact of pollution on health. The material holds a maximum usability rating of 10.00.

Dataset Name Suggestions

  • São Paulo State Air Pollution Analysis
  • Brazil Air Quality Measurements 2015-2021
  • Exploratory Analysis of São Paulo Pollutants

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format