São Paulo State Air Pollution Analysis
Public Health & Epidemiology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Data details air pollution measurements collected across the state of São Paulo, Brazil. This resource is valuable for performing exploratory data analysis aimed at developing a clear overview of the atmospheric pollution situation. By tracking pollutant concentrations over a long period, the material helps justify the environmental and public health impacts and allows researchers to verify the evolution of this significant problem. The dataset includes measurements for 9 distinct pollutants, such as MP10, O3, NO2, MP2.5, CO, SO2, NO, FMC, and PTS.
Columns
The dataset contains 10 attributes detailing the measurement events:
- ID: The unique integer index identifying each specific record (primary key). This field is 100% valid across all 11 million records.
- Data: The date on which the pollutant concentration measurement was taken. Records span from the beginning of 2015 to the end of 2021.
- Hora: The specific time when the measurement was recorded.
- Estação: The physical location or station where the measurement was performed. There are 87 unique stations identified, with 'Santos - Ponta da Praia' being one of the most frequently logged locations (3%).
- Código: The specific code associated with the measuring station. There are 87 unique codes recorded, with 'SP64' being a common example (3%).
- Poluente: The specific pollutant whose concentration was measured. There are 9 unique pollutants in the dataset, with MP10 (26%) and O3 (24%) being the most common.
- Valor: The concentration value of the pollutant measured. All records are valid, with the mean value recorded at 83.3.
- Unidade: The unit of concentration used for the measurement. This value is uniformly recorded as ug/m3 (100%).
- Tipo: Indicates how the measurement was conducted (e.g., automatic or manual). The 'automatica' type accounts for 100% of the recorded entries.
Distribution
The material is stored as a CSV file named
SP_poluicao_dados.csv, which has a large file size of 902.98 MB. The dataset structure includes over 10 million total lines (11.0m valid records). All documented columns maintain perfect data quality, showing 100% validity with no missing or mismatched entries.Usage
This resource is best suited for conducting exploratory data analysis (EDA) related to air quality. It allows users to track the concentration of 9 key pollutants over time to assess the scale of environmental issues and their influence on public health. Analysts can specifically analyze pollutant concentration intervals against known health danger levels.
Coverage
The geographic scope covers the entire state of São Paulo, Brazil, utilising data from 87 distinct measuring stations. The time period recorded spans from the start of 2015 through to the end of 2021. The resource captures concentration data for nine primary atmospheric pollutants.
License
CC0: Public Domain
Who Can Use It
The dataset is intended for researchers, environmental analysts, and public health officials. It is highly suitable for studying urban areas, environmental trends, and the long-term impact of pollution on health. The material holds a maximum usability rating of 10.00.
Dataset Name Suggestions
- São Paulo State Air Pollution Analysis
- Brazil Air Quality Measurements 2015-2021
- Exploratory Analysis of São Paulo Pollutants
Attributes
Original Data Source: São Paulo State Air Pollution Analysis
Loading...
