6.3 KiB
Installation Guide
This guide will help you set up your environment to work with the Data Science for Beginners curriculum.
Table of Contents
Prerequisites
Before starting, you should have:
- Basic knowledge of command line/terminal usage
- A GitHub account (free)
- A stable internet connection for the initial setup
Quick Start Options
Option 1: GitHub Codespaces (Recommended for Beginners)
The simplest way to get started is by using GitHub Codespaces, which provides a complete development environment directly in your browser.
- Go to the repository
- Click the Code dropdown menu
- Select the Codespaces tab
- Click Create codespace on main
- Wait for the environment to initialize (2-3 minutes)
Your environment is now ready with all dependencies pre-installed!
Option 2: Local Development
If you prefer working on your own computer, follow the detailed instructions below.
Local Installation
Step 1: Install Git
Git is required to clone the repository and manage your changes.
Windows:
- Download from git-scm.com
- Run the installer with default settings
macOS:
- Install via Homebrew:
brew install git - Or download from git-scm.com
Linux:
# Debian/Ubuntu
sudo apt-get update
sudo apt-get install git
# Fedora
sudo dnf install git
# Arch
sudo pacman -S git
Step 2: Clone the Repository
# Clone the repository
git clone https://github.com/microsoft/Data-Science-For-Beginners.git
# Navigate to the directory
cd Data-Science-For-Beginners
Step 3: Install Python and Jupyter
Python 3.7 or higher is required for the data science lessons.
Windows:
- Download Python from python.org
- During installation, check "Add Python to PATH"
- Verify installation:
python --version
macOS:
# Using Homebrew
brew install python3
# Verify installation
python3 --version
Linux:
# Most Linux distributions come with Python pre-installed
python3 --version
# If not installed:
# Debian/Ubuntu
sudo apt-get install python3 python3-pip
# Fedora
sudo dnf install python3 python3-pip
Step 4: Set Up Python Environment
It is recommended to use a virtual environment to keep dependencies isolated.
# Create a virtual environment
python -m venv venv
# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
Step 5: Install Python Packages
Install the required data science libraries:
pip install jupyter pandas numpy matplotlib seaborn scikit-learn
Step 6: Install Node.js and npm (For Quiz App)
The quiz application requires Node.js and npm.
Windows/macOS:
- Download from nodejs.org (LTS version recommended)
- Run the installer
Linux:
# Debian/Ubuntu
# WARNING: Piping scripts from the internet directly into bash can be a security risk.
# It is recommended to review the script before running it:
# curl -fsSL https://deb.nodesource.com/setup_lts.x -o setup_lts.x
# less setup_lts.x
# Then run:
# sudo -E bash setup_lts.x
#
# Alternatively, you can use the one-liner below at your own risk:
curl -fsSL https://deb.nodesource.com/setup_lts.x | sudo -E bash -
sudo apt-get install -y nodejs
# Fedora
sudo dnf install nodejs
# Verify installation
node --version
npm --version
Step 7: Install Quiz App Dependencies
# Navigate to quiz app directory
cd quiz-app
# Install dependencies
npm install
# Return to root directory
cd ..
Step 8: Install Docsify (Optional)
For offline access to documentation:
npm install -g docsify-cli
Verify Your Installation
Test Python and Jupyter
# Activate your virtual environment if not already activated
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
# Start Jupyter Notebook
jupyter notebook
Your browser should open with the Jupyter interface. You can now navigate to any lesson's .ipynb file.
Test Quiz Application
# Navigate to quiz app
cd quiz-app
# Start development server
npm run serve
The quiz app should be accessible at http://localhost:8080 (or another port if 8080 is already in use).
Test Documentation Server
# From the root directory of the repository
docsify serve
The documentation should be accessible at http://localhost:3000.
Using VS Code Dev Containers
If you have Docker installed, you can use VS Code Dev Containers:
- Install Docker Desktop
- Install Visual Studio Code
- Install the Remote - Containers extension
- Open the repository in VS Code
- Press
F1and select "Remote-Containers: Reopen in Container" - Wait for the container to build (only required the first time)
Next Steps
- Explore the README.md for an overview of the curriculum
- Read USAGE.md for common workflows and examples
- Check TROUBLESHOOTING.md if you encounter issues
- Review CONTRIBUTING.md if you want to contribute
Getting Help
If you run into issues:
- Check the TROUBLESHOOTING.md guide
- Search existing GitHub Issues
- Join our Discord community
- Create a new issue with detailed information about your problem
Disclaimer:
This document has been translated using the AI translation service Co-op Translator. While we aim for accuracy, please note that automated translations may contain errors or inaccuracies. The original document in its native language should be regarded as the authoritative source. For critical information, professional human translation is recommended. We are not responsible for any misunderstandings or misinterpretations resulting from the use of this translation.