New York City Road Traffic Accident Data
Public Safety & Security
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information on motor vehicle collision events reported by the police across New York City (NYC). Each entry represents a single crash incident. The data is collected for collisions involving injuries, fatalities, or property damage exceeding $1000, as per police reporting requirements. It offers a crucial insight into road safety dynamics within the city.
Columns
- CRASH DATE: The date of the collision event, spanning from 01 July 2012 to 25 April 2023.
- CRASH TIME: The time of the collision event.
- BOROUGH: The borough in NYC where the collision occurred. Approximately 31% of entries have missing borough information, with Brooklyn being the most frequently recorded.
- ZIP CODE: The postal code of the collision location.
- LATITUDE: The geographical latitude coordinate of the collision.
- LONGITUDE: The geographical longitude coordinate of the collision.
- LOCATION: A textual description or coordinate pair indicating the collision site.
- ON STREET NAME: The name of the street on which the collision occurred. About 21% of this data is missing.
- CROSS STREET NAME: The name of the cross street involved in the collision. Roughly 37% of this data is missing.
- OFF STREET NAME: The name of any off-street location related to the collision. A significant portion (84%) of this data is missing.
- NUMBER OF PERSONS INJURED: The total number of individuals who sustained injuries in the collision, ranging from 0 to 43.
- NUMBER OF PERSONS KILLED: The total number of individuals who were fatally injured in the collision, ranging from 0 to 8.
- NUMBER OF PEDESTRIANS INJURED: The number of pedestrians injured in the collision, ranging from 0 to 27.
- NUMBER OF PEDESTRIANS KILLED: The number of pedestrians fatally injured in the collision, ranging from 0 to 6.
- NUMBER OF CYCLIST INJURED: The number of cyclists injured in the collision, ranging from 0 to 4.
- NUMBER OF CYCLIST KILLED: The number of cyclists fatally injured in the collision, ranging from 0 to 2.
- NUMBER OF MOTORIST INJURED: The number of motorists injured in the collision, ranging from 0 to 43.
- NUMBER OF MOTORIST KILLED: The number of motorists fatally injured in the collision, ranging from 0 to 5.
- CONTRIBUTING FACTOR VEHICLE 1: The primary contributing factor attributed to vehicle 1 in the collision. 'Unspecified' and 'Driver Inattention/Distraction' are common entries.
- CONTRIBUTING FACTOR VEHICLE 2: The contributing factor for vehicle 2. Many entries are 'Unspecified' or missing.
- CONTRIBUTING FACTOR VEHICLE 3: The contributing factor for vehicle 3. This column has a high percentage of missing data (93%).
- CONTRIBUTING FACTOR VEHICLE 4: The contributing factor for vehicle 4, with 98% missing data.
- CONTRIBUTING FACTOR VEHICLE 5: The contributing factor for vehicle 5, with 100% missing data.
- COLLISION_ID: A unique identifier for each collision event.
- VEHICLE TYPE CODE 1: The type of vehicle 1 involved in the collision. 'Sedan' and 'Station Wagon/Sport Utility Vehicle' are frequently seen.
- VEHICLE TYPE CODE 2: The type of vehicle 2 involved. Many entries are missing or are 'Sedan'.
- VEHICLE TYPE CODE 3: The type of vehicle 3 involved. This column has a high percentage of missing data (93%).
- VEHICLE TYPE CODE 4: The type of vehicle 4 involved, with 98% missing data.
- VEHICLE TYPE CODE 5: The type of vehicle 5 involved, with 100% missing data.
Distribution
The dataset is typically provided in a CSV format and includes 29 columns. The file size is approximately 423.58 MB. Each row within the dataset represents a distinct crash event. Many columns contain approximately 1.99 million valid records, though some location-based and additional vehicle factor columns exhibit a notable percentage of missing values. The dataset is expected to be updated annually.
Usage
This dataset is ideal for:
- Analysing traffic accident patterns and identifying high-risk locations within NYC.
- Investigating the contributing factors to motor vehicle collisions.
- Developing and evaluating road safety policies and interventions.
- Urban planning to improve infrastructure and pedestrian/cyclist safety.
- Academic research into transportation, public safety, and urban dynamics.
Coverage
The dataset covers motor vehicle collisions across New York City. The time range for collision dates extends from 01 July 2012 to 25 April 2023, with an expected annual update frequency. While specific demographics are not directly included, the data pertains to all police-reported motor vehicle collisions involving individuals across NYC boroughs. Data is available for collisions that meet specific reporting criteria, such as injuries, fatalities, or at least $1000 in property damage.
License
CC0: Public Domain
Who Can Use It
This dataset is suitable for:
- Data Scientists and Researchers looking to model accident probabilities or analyse trends.
- Urban Planners and Transportation Departments seeking to identify areas for infrastructure improvements.
- Public Safety Officials aiming to inform targeted intervention strategies and public awareness campaigns.
- Journalists and Policy Makers interested in understanding the impact of road safety initiatives.
Dataset Name Suggestions
- NYC Motor Vehicle Collisions Records
- New York City Road Traffic Accident Data
- NYC Crash Data Annual Archive
- Motor Vehicle Incidents NYC
Attributes
Original Data Source: New York City Road Traffic Accident Data