Opendatabay APP

NYC 2017 Yellow Taxi Trip Data

Data Science and Analytics

Tags and Keywords

Taxi

Limousine

Nyc

Fare

Trip

Trusted By
Trusted by company1Trusted by company2Trusted by company3
NYC 2017 Yellow Taxi Trip Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Provides detailed trip and fare information for taxi and limousine services within New York City during the year 2017. The data originates from the New York City Taxi and Limousine Commission (TLC), the agency responsible for licensing and regulating the city's for-hire vehicles. The original data is drawn from millions of combined daily trips made by over 200,000 taxi and limousine licensees. This resource captures critical details related to pickup and dropoff times, distance travelled, passenger count, payment types, and all associated fare components, including taxes and surcharges. Please note that this particular dataset was prepared for pedagogical purposes and may not fully reflect typical New York City taxi cab riders' behaviour.

Columns

  • ID: Trip identification number.
  • VendorID: A code indicating the TPEP provider that supplied the record (1=Creative Mobile Technologies, LLC; 2=VeriFone Inc.).
  • tpep_pickup_datetime: The date and time when the taximeter was engaged.
  • tpep_dropoff_datetime: The date and time when the taximeter was disengaged.
  • Passenger_count: The number of passengers in the vehicle, recorded by the driver.
  • Trip_distance: The elapsed trip distance in miles, reported by the taximeter.
  • PULocationID: The TLC Taxi Zone ID in which the taximeter was engaged (pickup location).
  • DOLocationID: The TLC Taxi Zone ID in which the taximeter was disengaged (dropoff location).
  • RateCodeID: The final rate code effective at the end of the journey, including standard rate, JFK, Newark, Nassau or Westchester, Negotiated fare, and Group ride.
  • Store_and_fwd_flag: Indicates whether the trip record was temporarily held in vehicle memory (Y=store and forward trip, N=not a store and forward trip).
  • Payment_type: A numeric code signifying the method of payment (1=Credit card, 2=Cash, 3=No charge, 4=Dispute, 5=Unknown, 6=Voided trip).
  • Fare_amount: The time-and-distance fare amount calculated by the meter.
  • Extra: Miscellaneous extras and surcharges, such as $0.50 and $1 rush hour and overnight charges.
  • MTA_tax: The $0.50 MTA tax automatically applied based on the metered rate.
  • Improvement_surcharge: The $0.30 improvement surcharge assessed on trips at the flag drop, which began being levied in 2015.
  • Tip_amount: The gratuity amount (automatically populated for credit card tips only; cash tips are not included).
  • Tolls_amount: The total amount of all tolls paid during the trip.
  • Total_amount: The final amount charged to passengers, excluding any cash tips.

Distribution

The data file is typically stored in CSV format. The sample data provided contains 18 columns and is observed to have approximately 22.7 thousand valid records, with a total file size of 2.3 MB. The data is expected to be updated annually.

Usage

Ideal applications include analysing urban mobility patterns across New York City zones, creating predictive models for forecasting trip distances or total fare amounts, studying the adoption rates of different payment types, and assessing the impact of various surcharges (like rush hour extras and MTA taxes) on the final cost of a ride.

Coverage

The data focuses geographically on taxi and limousine trips within New York City. The temporal scope covers the entire year of 2017, running from January 1st through to December 31st.

License

CC0: Public Domain

Who Can Use It

The dataset is highly useful for transportation researchers seeking to understand urban travel logistics, data scientists developing dynamic pricing algorithms, urban planners studying population flow between specific TLC Taxi Zones, and students undertaking educational projects related to regression and geographical analysis of large trip records.

Dataset Name Suggestions

  • NYC 2017 Yellow Taxi Trip Data
  • New York City TLC Fare & Trip Data 2017
  • NYC Taxi Records 2017

Attributes

Original Data Source: NYC 2017 Yellow Taxi Trip Data

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

29/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format