You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Data-Science-For-Beginners/translations/en/4-Data-Science-Lifecycle/14-Introduction/assignment.md

2.4 KiB

Assessing a Dataset

A client has reached out to your team for assistance in analyzing the seasonal spending habits of taxi customers in New York City.

They want to know: Do yellow taxi passengers in New York City tip drivers more in the winter or summer?

Your team is currently in the Capturing phase of the Data Science Lifecycle, and you are responsible for managing the dataset. You have been provided with a notebook and data to examine.

In this directory, there is a notebook that uses Python to load yellow taxi trip data from the NYC Taxi & Limousine Commission. You can also open the taxi data file using a text editor or spreadsheet software like Excel.

Instructions

  • Evaluate whether the data in this dataset is sufficient to answer the question.
  • Explore the NYC Open Data catalog. Identify an additional dataset that might be useful in addressing the client's question.
  • Formulate 3 questions to ask the client for further clarification and a deeper understanding of the problem.

Refer to the dataset's dictionary and user guide for more details about the data.

Rubric

Exemplary Adequate Needs Improvement

Disclaimer:
This document has been translated using the AI translation service Co-op Translator. While we aim for accuracy, please note that automated translations may include errors or inaccuracies. The original document in its native language should be regarded as the authoritative source. For critical information, professional human translation is advised. We are not responsible for any misunderstandings or misinterpretations resulting from the use of this translation.