Opendatabay APP

NYC Taxi Passenger Anomaly Data

Data Science and Analytics

Tags and Keywords

Taxi

Nyc

Anomaly

Traffic

Passengers

Trusted By
Trusted by company1Trusted by company2Trusted by company3
NYC Taxi Passenger Anomaly Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset is designed for time-series analysis and anomaly detection, focusing on the number of taxi passengers in New York City. It highlights five distinct anomalies associated with significant events: the NYC marathon, Thanksgiving, Christmas, New Year's Day, and a snow storm. The raw data, originally from the NYC Taxi and Limousine Commission, has been aggregated into 30-minute buckets to provide a clear view of passenger traffic patterns.

Columns

  • timestamp: Represents the specific time when the passenger count was calculated. The data spans from 1st July 2014 to 1st February 2015, with 10.3k valid entries.
  • value: Indicates the total number of taxi passengers recorded during each 30-minute interval. Values range from 8 to 39.2k, with a mean of 15.1k and a standard deviation of 6.94k. There are 10.3k valid entries for this column.

Distribution

The dataset is provided in a CSV file named dataset.csv, with a size of 316.58 kB. It contains 3 columns, including the timestamp and value described above, and aggregates taxi passenger counts into 30-minute periods. The dataset includes 10.3k records.

Usage

This dataset is ideal for various analytical purposes, including time-series analysis, identifying unusual patterns in passenger numbers, and developing anomaly detection models. It can also be used for broader studies related to urban transportation and the impact of holidays or extreme weather on public transit.

Coverage

The dataset covers taxi passenger activity specifically within New York City. The time range extends from 1st July 2014 to 1st February 2015. It explicitly captures unusual spikes or drops in passenger numbers during specific events such as the NYC marathon, Thanksgiving, Christmas, New Year's Day, and a snow storm, offering insights into their impact on transportation.

License

CC0: Public Domain

Who Can Use It

This dataset is particularly useful for data scientists, analysts, urban planners, researchers in transportation and logistics, and anyone interested in studying urban mobility patterns or developing predictive models for passenger demand and anomaly detection.

Dataset Name Suggestions

  • NYC Taxi Passenger Anomaly Data
  • New York City Taxi Traffic Anomalies
  • NYC Taxi Ridership Time Series with Events
  • Urban Taxi Passenger Anomaly Detection Dataset

Attributes

Original Data Source: NYC Taxi Passenger Anomaly Data

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

31/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format