Over the past few weeks, I explored Computer Vision using Python and Google’s MediaPipe library — running entirely on CPU.
I started with core concepts like landmark detection, hand & face tracking, pose estimation, and real-time image processing.
Then, I applied them to build these exciting real-time applications:
Tracks hand landmarks in real-time and counts the number of raised fingers.
Detects faces and maps 468 facial landmarks, useful for AR filters or expression analysis.
Tracks body posture and counts workout reps — a basic fitness assistant.
Controls system volume by moving fingers closer or farther apart.
Controls the mouse cursor and clicks using hand gestures.
| Hand Tracker & Finger Counter | Face Mesh | AI Personal Trainer |
|---|---|---|
![]() |
![]() |
![]() |
| Gesture Volume Control | AI Virtual Mouse |
|---|---|
![]() |
![]() |
- Landmark detection and tracking
- Pose estimation for human body movement
- Real-time image & gesture processing
- Applying computer vision in interactive AI applications
I’m excited to explore advanced Computer Vision & AI-powered vision systems.
💡 Suggestions for what I should build next are welcome!




