联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> Java编程Java编程

日期:2019-11-20 10:50

New York Taxi DataSet

In this exercise, we will be exploring a dataset containing the taxi trips made in New York City in 2013.  We will analyze a subset of this dataset containing 0.5% of all trips (about 850,000 rides).  Compressed, this subset data represents a little less than 100MB.  

NYC Taxi and Limousine Commission (TLC) is the technology provider for the trip data.  More about the (Links to an external site.)

The Data Dictionary for the yellow taxi trip data: (Links to an external site.)

An interactive web application based on this dataset: http://hubcab.org (Links to an external site.)

Assignment Details:

Please complete the following:

Download and unzip the NYC taxi dataset from Cyrille Rossant on GitHub: https://github.com/ipython-books/minibook-2nd-data (Links to an external site.)

Open the notebook file attached below.  You will be adding your code (make sure you add headers and comments) to the existing code, and make sure your code is well organized.

Please upload the data and display data columns, number of rows, variable types, and numeric statistics + categorical variable frequencies.

Display a scatter plot of pick up locations.  For which vendor is it easiest to find a cab?

Display a histogram of trip distances. What is the most common trip distance?

Display a histogram of the fare total amounts. What can you say about the data?

How many unusually long trips (of greater than 100 miles) do you see?

please upload your notebook as either PDF or ipynb file format.  Convert your notebook to pdf from your browser File > Export as PDF option.

NY TAXI EZ Source Code

Rubric

NY Taxi Assignment

NY Taxi Assignment

CriteriaRatingsPts

This criterion is linked to a Learning OutcomeUpload the data and display data columns, number of rows, variable types, and numeric statistics + categorical variable frequencies20.0 pts

This criterion is linked to a Learning OutcomeDisplay a scatter plot of pick up locations. For which vendor is it easier to find a cab?20.0 pts

This criterion is linked to a Learning OutcomeDisplay a histogram of trip distances. What is the most common trip distance?20.0 pts

This criterion is linked to a Learning OutcomeDisplay a histogram of the fare total amounts. What can you say about the data?20.0 pts

This criterion is linked to a Learning OutcomeHow many unusually long trips (of greater than 100 miles) do you see?20.0 pts

Total Points: 100.0

PreviousNext


版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp