Welcome to the official code repository for the RobotX Computer Vision Workshop! This repo is designed to provide you with all the necessary code, resources, and project structures to master the fundamentals of computer vision and apply them to robotics technology.
This workshop is a hands-on journey into the world of AI-powered robotics. We will move from the basic principles of image processing to advanced topics like real-time object detection and Simultaneous Localization and Mapping (SLAM). Each module is designed to be practical, building on the previous one.
- Introduction to Computer Vision & Canny Edge Detection: Learn the fundamentals of image processing.
- Face Detection: Use pre-trained models to detect human faces in real-time.
- Object Detection with YOLO (Part 1): Get an introduction to the powerful YOLO algorithm.
- Object Detection with YOLO (Part 2): Fine-tune YOLO for custom object detection tasks.
- Visual SLAM: Explore how robots can map and navigate their environment using computer vision.
This workshop is designed for Python 3.8 to 3.11. We recommend using Python 3.10 for the best compatibility with all dependencies.
Note: Python 3.12+ may have compatibility issues with some dependencies (particularly dlib and face-recognition). If you're using Python 3.12, you may need to install these packages using conda instead of pip.
- OS: Windows 10+, macOS 10.14+, or Ubuntu 18.04+
- Memory: 4GB RAM minimum (8GB recommended)
- Storage: 2GB free disk space
- Camera: Optional (for real-time exercises, but not required)
-
Check Python Version: Ensure you're using Python > 3.8:
python --version
-
Clone the Repository:
git clone [Your Repository URL] cd [Your Repository Name] -
Set Up Your Environment: We recommend using a virtual environment to manage dependencies.
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install Dependencies:
Option A - Install all dependencies at once (recommended):
pip install -r requirements.txt
Option B - Install module by module: Navigate into each module's directory and install its specific requirements:
cd 01-intro-canny-edge pip install -r requirements.txt cd ../02-face-detection pip install -r requirements.txt # ... continue for each module
Alternative for Python 3.12 users: If you encounter dependency issues, try using conda:
conda create -n computer-vision python=3.10 conda activate computer-vision pip install -r requirements.txt
The repository is divided into folders, one for each topic. Inside each folder, you will find:
- A
README.mdfile with a detailed explanation of the topic. - A Python script (
.py) with example code. - A
requirements.txtfile with the necessary libraries.
Let's start building the future of robotics, right here in Berlin!