10 Weeks, 20 Lessons, Data Science for All!
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
Go to file
Jen Looper e1b4562ddf
file edits for DS curriculum
3 years ago
.github file edits for DS curriculum 3 years ago
00-sketchnotes organize directories 3 years ago
01-defining-data-science directory and doc formatting 3 years ago
02-ethics directory and doc formatting 3 years ago
03-defining-data defining data 3 years ago
04-stats-and-probability directory and doc formatting 3 years ago
data-science-in-the-cloud directory and doc formatting 3 years ago
data-science-in-the-wild directory and doc formatting 3 years ago
data-science-lifecycle directory and doc formatting 3 years ago
docs file edits for DS curriculum 3 years ago
quiz-app file edits for DS curriculum 3 years ago
translations file edits for DS curriculum 3 years ago
visualizations creating quiz app 3 years ago
working-with-data directory and doc formatting 3 years ago
.gitignore update readme 3 years ago
CODE_OF_CONDUCT.md
CONTRIBUTING.md file edits for DS curriculum 3 years ago
LICENSE Initial LICENSE commit 4 years ago
README.md file edits for DS curriculum 3 years ago
SECURITY.md Initial SECURITY.md commit 4 years ago
SUPPORT.md file edits for DS curriculum 3 years ago
docsifytopdf.js file edits for DS curriculum 3 years ago
for-teachers.md file edits for DS curriculum 3 years ago
index.html file edits for DS curriculum 3 years ago
package-lock.json file edits for DS curriculum 3 years ago
package.json file edits for DS curriculum 3 years ago

README.md

Data Science for Beginners - A Curriculum

Azure Cloud Advocates at Microsoft are pleased to offer a 12-week, 24-lesson curriculum all about Data Science. Each lesson includes pre-lesson and post-lesson quizzes, written instructions to complete the lesson, a solution, and an assignment. Our project-based pedagogy allows you to learn while building, a proven way for new skills to 'stick'.

Hearty thanks to our authors:

Getting Started

Teachers, we have included some suggestions on how to use this curriculum. We'd love your feedback in our discussion forum!

Students, to use this curriculum on your own, fork the entire repo complete the exercises on your own, starting with a pre-lecture quiz, then reading the lecture completing the rest of the activities. Try to create the projects by comprehending the lessons rather than copying the solution code; however that code is available in the /solutions folders in each project-oriented lesson. Another idea would be to form a study group with friends go through the content together. For further study, we recommend Microsoft Learn by watching the videos mentioned below.

Pedagogy

We have chosen two pedagogical tenets while building this curriculum: ensuring that it is project-based that it includes frequent quizzes. By the end of this series, students will have ...

In addition, a low-stakes quiz before a class sets the intention of the student towards learning a topic, while a second quiz after class ensures further retention. This curriculum was designed to be flexible fun can be taken in whole or in part. The projects start small become increasingly complex by the end of the 12 week cycle.

Find our Code of Conduct, Contributing, Translation guidelines. We welcome your constructive feedback!

Each lesson includes:

  • optional sketchnote
  • optional supplemental video
  • pre-lesson warmup quiz
  • written lesson
  • for project-based lessons, step-by-step guides on how to build the project
  • knowledge checks
  • a challenge
  • supplemental reading
  • assignment
  • post-lesson quiz

A note about quizzes: All quizzes are contained in this app, for 48 total quizzes of three questions each. They are linked from within the lessons but the quiz app can be run locally; follow the instruction in the quiz-app folder. They are gradually being localized.

Lessons

Project Name Concepts Taught Learning Objectives Linked Lesson Author
01 Defining Data Science TBD
02 Data Science Ethics TBD
03 Defining Data TBD
04 Introduction to Statistics Probability TBD
05 Working with Data Spreadsheets
06 Working with Data Relational Databases
07 Working with Data NoSQL
08 Working with Data Python Data
09 Working with Data Cleaning Transformations
10 Visualizing Data Quantities
11 Visualizing Data Distributions
12 Visualizing Data Proportions
13 Visualizing Data Relationships
14 Visualizing Data Making meaningful visualizations
15 The Data Science Lifecycle Capturing
16 The Data Science Lifecycle Processing
17 The Data Science Lifecycle Analyzing
18 The Data Science Lifecycle Communication
19 The Data Science Lifecycle Maintaining
20 Data Science in the Cloud TBD
21 Data Science in the Cloud TBD
22 Data Science in the Cloud TBD
23 Data Science in the Wild TBD
24 Data Science in the Wild TBD

Offline access

You can run this documentation offline by using Docsify. Fork this repo, install Docsify on your local machine, then in the root folder of this repo, type docsify serve. The website will be served on port 3000 on your localhost: localhost:3000.

PDF

A PDF of all of the lessons can be found here