Steps: - load station data - label using kmeans model - bucket into 5min intervals - aggregate into odt - store

Load Station Locations

wget https://raw.githubusercontent.com/huevosabio/notebooks/master/notebooks/assets/manhattan_demands_50.json -P /media/Big_Data/rdit/nyc-tlc/manhattan/