We designed and implemented finger tracking based on a virtual mouse application using a regular webcam. Furthermore, we integrated a speech-to-text converter. While touchscreen technology is popular in mobile devices, it is costly for desktop systems. Therefore, we explored computer vision techniques as an alternative method to create a virtual human-computer interaction device, such as a mouse using a webcam and voice commands using a mic.
The mathematical concepts used to designed for a gesture-based desktop control system using a webcam and the MediaPipe library. Below are the essential mathematical concepts and formulas used in this system:
-
Distance Calculations:
-
Euclidean Distance: To find the distance between two points ((x1, y1)) and ((x2, y2)) on a 2D plane, the following formula is used:
distance = sqrt((x2 - x1)^2 + (y2 - y1)^2)
This calculates the straight-line distance between the points.
Code :-
dist = (self.hand_result.landmark[point[0]].x - self.hand_result.landmark[point[1]].x)**2 dist += (self.hand_result.landmark[point[0]].y - self.hand_result.landmark[point[1]].y)**2 dist = math.sqrt(dist)
-
Signed Euclidean Distance: The signed distance between two points considers direction and is computed as:
signed_distance = sign * sqrt((x2 - x1)^2 + (y2 - y1)^2)
Here,
sign
is determined by comparing the y-coordinates of the points. If the y-coordinate of the first point is less than that of the second point,sign
is set to 1; otherwise, it is set to -1.Code:-
sign = -1 if self.hand_result.landmark[point[0]].y < self.hand_result.landmark[point[1]].y: sign = 1 dist = (self.hand_result.landmark[point[0]].x - self.hand_result.landmark[point[1]].x)**2 dist += (self.hand_result.landmark[point[0]].y - self.hand_result.landmark[point[1]].y)**2 dist = math.sqrt(dist) return dist * sign
-
-
Ratio Calculation:
-
Distance Ratio: To evaluate whether a finger is open or closed, the ratio of two distances is used:
ratio = dist1 / dist2
In this case,
dist1
represents the distance from the fingertip to the middle knuckle, anddist2
is the distance from the middle knuckle to the base knuckle. A ratio greater than 0.5 typically indicates an open finger.Code:-
ratio = round(dist / dist2, 1)
-
-
Pinch Gesture Control:
-
Quantified Displacement: For pinch gestures, the displacement magnitude is used to adjust system settings such as volume and brightness:
pinch_lv = current_position - start_position
current_position
is the hand's current position, whilestart_position
is the position where the pinch gesture began. This displacement is quantified for control applications.Code:-
pinchlv = round((hand_result.landmark[8].x - Controller.pinchstartxcoord) * 10, 1)
-
-
Cursor Movement Stabilization:
-
Damping Adjustment: To smooth out cursor movements and reduce jitter, a damping factor is applied:
ratio = factor * sqrt(distance)
where
distance
is the squared change in cursor position. Different factors are used based on the distance to stabilize movement. For example:ratio = 0.07 * sqrt(distance) for smaller distances ratio = 2.1 for larger distances
Code:-
distsq = delta_x**2 + delta_y**2 ratio = 1 if distsq <= 25: ratio = 0 elif distsq <= 900: ratio = 0.07 * (distsq ** (1/2)) else: ratio = 2.1 x, y = x_old + delta_x * ratio, y_old + delta_y * ratio
-
-
Pinch Gesture Levels:
-
Vertical and Horizontal Levels: To assess the level of pinch gestures along different axes:
-
Vertical Pinch Level:
pinch_ylv = pinchstart_y - current_y
Code:-
dist = round((Controller.pinchstartycoord - hand_result.landmark[8].y) * 10, 1)
-
Horizontal Pinch Level:
pinch_xlv = current_x - pinchstart_x
pinchstart_x
andpinchstart_y
are the coordinates at the start of the pinch gesture, andcurrent_x
andcurrent_y
are the current hand coordinates.Code:-
dist = round((hand_result.landmark[8].x - Controller.pinchstartxcoord) * 10, 1)
-
-
These mathematical concepts and formulas enable the accurate detection and interpretation of hand gestures, allowing users to control various system functions through gestures effectively.
1. IDLE:- Python version 3.8 and above.
2. OS:- Windows 8 and above.
3. RAM:- 4GB and above.
4. WEBCAM:- An Webcam is necessary for input.
{ If webcam not available then you can use third party application like DroidCam or any other that can help you to use your mobile cam. But this DroidCam application should be installed in both android and pc devices for usage. }
5. MICROPHONE
For better Working Create an Virtual Environment.
-
Navigate to Your Project Directory:
cd path/to/your/project
-
Create a Virtual Environment:
You can use the venv module that comes with Python:
python -m venv venv
-
Activate the Virtual Environment:
.\venv\Scripts\activate
-
Install Requirements:
pip install -r requirements.txt
-
Run the main.py file:
python main.py
{ If any Python library is not installing due to errors, particularly if the error is related to permissions or the location of the library, try the following steps:
Open Command Prompt as Administrator: Right-click on the Command Prompt icon and select ‘Run as Administrator’.
Install the Library: Use the command pip install to install the problematic library. }
For Demo Video Click Here
Additionally, we have developed a feature to control the mouse using a mobile device's screen. When the user selects "Mobile Controller," a QR code is displayed on the screen. Scanning this code directs the user to a browser page where they can control the mouse by dragging their finger on the page. However, for this to work, both the PC and the mobile device must be on the same network. This only has feature of left click and cursor movement.
QR Code to Scan |
|
Mobile Browser Page |
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Name | Git ID | |
---|---|---|
Vivek Kalwar | vivekkalwar95@gmail.com | Viivveekk |
Prabel Pandey | prabel397@gmail.com | HiPrabel |
Nadeem Khan | kk0078841@gmail.com | Luciferkhan007 |
MIT Licensed