Computer Pointer Controller

Introduction

Computer Pointer Controller app focuses on changing the position of mouse Pointer by the direction of eyes and Head Pose.This can take Video file or Webcam as Input and then can perform accordingly.It uses combination of different models to give us a desired output.

Project Video

Project Set Up and Installation

Note- You have to sucessfully install OpenVino on Your Local System.Here is the installation guide for it.

You can run it on Linux,Windows as well as macOS.

After you are done with that, you can follow these steps mentioned below to run the project.
Clone this repository:- https://github.com/DhruvKinger/Pointer-Controller
Now you have to initialze the OpenVino Environment.You can do this by the following command.
cd C:\Program Files (x86)\IntelSWTools\openvino\bin\
setupvars.bat

Note: One of the most Important step is to download the models.You can skip this step as I have already downloaded and attached them in the models Folder.

Still If you want to try them out, then you can delete the models folder from your downloaded zip file and follow thses steps mentioned below.

You can either download these models mentioned below manually from here:- https://download.01.org/opencv/2020/openvinotoolkit/2020.3/open_model_zoo/models_bin/1/

Or You can run these commands:-

1. Face Detection Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "face-detection-adas-binary-0001"

2. Head Pose Estimation Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "head-pose-estimation-adas-0001"

3. Facial Landmarks Detection Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "landmarks-regression-retail-0009"

4. Gaze Estimation Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "gaze-estimation-adas-0002"

Demo

Open a new terminal and run the following commands:-
cd C:\Program Files (x86)\IntelSWTools\openvino\bin\
setupvars.bat

With these commands your path is intialized,Now change directory to source directory of cloned project.

cd /src
Run the main.py file
python main.py -fd models\Face_detection\face-detection-adas-binary-0001.xml -fl models\Landmarks_detection\FP32\landmarks-regression-retail-0009.xml -hp models\Head_Pose\FP32\head-pose-estimation-adas-0001.xml -ge models\Gaze_Estimation\FP32\gaze-estimation-adas-0002.xml -i bin\demo.mp4 -l opt/intel/openvino/deployment_tools/inference_engine/lib/intel64/libinference_engined.dylib -d CPU -pt 0.6

For GPU

python main.py -fd 'Path of xml file of face detection model' -fl 'Path of xml file of facial landmarks detection model' -hp 'Path of xml file of head pose estimation model' -ge 'Path of xml file of gaze estimation model' -i 'Path of input video file or enter cam for taking input video from webcam' -d 'GPU'

For FPGA

python main.py -fd 'Path of xml file of face detection model' -fl 'Path of xml file of facial landmarks detection model' -hp 'Path of xml file of head pose estimation model' -ge 'Path of xml file of gaze estimation model' -i 'Path of input video file or enter cam for taking input video from webcam' -d 'HETERO:FPGA,CPU'

Documentation

Models Used

Command Line ArgumentS Used

Following are the command line arguments that can be used for running main.py file.

-fd (required) : Specify the path of Face Detection model's xml file
-fl (required) : Path to .xml file of Facial Landmark Detection model
-hp (required) : Path to .xml file of Head Pose Estimation model
-ge (required) : Path to .xml file of Gaze Estimation model.
-i (required) : Specify the path of input video file or enter cam for taking input video from webcam
-d (optional) : Specify the target device to infer on,"CPU, GPU, FPGA or MYRIAD is acceptable. Looks for a suitable plugin for device specified "(CPU by default)".
-l (optional) : Specify the absolute path of cpu extension if some layers of models are not supported on the device.
-pt (optional): Probability threshold for model to detect the face accurately from the video frame.

Project Directory Structure

The project directory contains a bin folder which has an demo.mp4 file, can be used as the input file for the project.
It has requirements.txt file which contains all the necessary dependencies to be installed before running the project.
The src folder in project directory contains the following python files:

The input_feeder.py:It is used to take the input file such as a video file or a webcam and yeilds the frames for running inference.
The mouse_controller.py:It takes the x,y co-ordinates from the gaze.py to move the mouse.
The face_detection.py,head_pose_estimation.py,facial_landmarks_detection.py,gaze_estimation.py:These contain each class function to preprocess the inputs and run inference on those inputs and sent it to mouse_controller to move the mouse position.
main.py:User needs main.py file to run the app.

Benchmarks

The benchmark result of running on CPU with multiple model precisions are :
The benchmark result of running on GPU with multiple model precisions are :

Results

I have ran mine models on different hardwares with different precisions: For running on CPU,I tried 8 combinations with different precisions like INT8,FP16,FP32. I have tried to reduce precision Value but precision also reduces accuracy.

Note: So when you use lower precision model then you can get lower accuracy than higher precision model.

I have tried 8 combn's on GPU as well.

Key Points:

GPU possed more Frames specailly when precision was FP16 because GPU has severals Execution units and their instruction sets are optimized for 16bit floating point data types.
We found for CPU FP32 was a best fit even though I tried to reduce some precisions but FP32 was giving us the best results whether we compare FPS,Model Load Time or even Inference Time.
We have analyzed for the GPU as well for most of the combinations FPS was averaging of 2.08 and for the FP32 it was giving 2.62 as Value.Inference time was giving an average of 28 s for all possible all combinations for GPU.
For CPU, FPS was averaging around 1.9 for the combinations I tried out.

Stand Out Suggestions

I have tried to build an inference pipeline for both video file and webcam feed as input.
Allowing the user to select their input option in the command line arguments: -i argument takes the input video file or a webcam, for accessing video file the command is -i " path of video file " whereas for accessing webcam -i "cam".
Depending on chosen option it will work.

Edge Cases

If model cann't detect the face then it prints 'unable to detect the face" and read another frame until it detects the face or user closes the window.
If there are more than one face detected in the frame then model takes the first detected face and control the mouse pointer.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
bin		bin
models		models
src		src
.Instructions.md.swp		.Instructions.md.swp
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Computer Pointer Controller

Introduction

Project Video

Project Set Up and Installation

Note- You have to sucessfully install OpenVino on Your Local System.Here is the installation guide for it.

Note: One of the most Important step is to download the models.You can skip this step as I have already downloaded and attached them in the models Folder.

Or You can run these commands:-

1. Face Detection Model

2. Head Pose Estimation Model

3. Facial Landmarks Detection Model

4. Gaze Estimation Model

Demo

With these commands your path is intialized,Now change directory to source directory of cloned project.

For GPU

For FPGA

Documentation

Models Used

Command Line ArgumentS Used

Project Directory Structure

Benchmarks

Results

Note: So when you use lower precision model then you can get lower accuracy than higher precision model.

Key Points:

Stand Out Suggestions

Edge Cases

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

DhruvKinger/Pointer-Controller

Folders and files

Latest commit

History

Repository files navigation

Computer Pointer Controller

Introduction

Project Video

Project Set Up and Installation

Note- You have to sucessfully install OpenVino on Your Local System.Here is the installation guide for it.

Note: One of the most Important step is to download the models.You can skip this step as I have already downloaded and attached them in the models Folder.

Or You can run these commands:-

1. Face Detection Model

2. Head Pose Estimation Model

3. Facial Landmarks Detection Model

4. Gaze Estimation Model

Demo

With these commands your path is intialized,Now change directory to source directory of cloned project.

For GPU

For FPGA

Documentation

Models Used

Command Line ArgumentS Used

Project Directory Structure

Benchmarks

Results

Note: So when you use lower precision model then you can get lower accuracy than higher precision model.

Key Points:

Stand Out Suggestions

Edge Cases

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages