Skip to content

Latest commit

 

History

History
7 lines (5 loc) · 381 Bytes

nyctaxi.md

File metadata and controls

7 lines (5 loc) · 381 Bytes

NYC Taxi Dataset

The integration tests and some examples refer to the "NYC Taxi" data set. This is a public data set containing information about yellow and green taxi trips in NYC.

  • The data can be downloaded in CSV format here.
  • Spark code for converting to Parquet can be found here.