Face morphing is a process that involves the gradual transformation of one face into another. This technique is widely used in various fields such as animation, film, photography, and social media. It typically involves blending two or more facial images to create a smooth transition from one face to another. The process requires sophisticated algorithms to accurately detect and map facial features, ensuring a realistic and natural-looking morph.
The project consists of an app written in Java Android, using OpenCV and the native Dlib library.
OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. It's widely used for various applications, including image and video Processing (providing tools for basic image processing operations, such as filtering, transformation, and analysis) and feature detection and matching, including algorithms for detecting and matching features in images, such as corners, edges, and contours. It also supports advanced machine learning algorithms for object detection, face recognition, and object tracking.
Dlib is a modern C++ toolkit that contains machine learning algorithms and tools for creating complex software in various domains, such as image processing, computer vision, and robotics. It is designed to be highly modular and efficient, allowing developers to build high-performance applications.
-
For the process to compile and embed Dlib in my Android Studio project, I am reffering to this online guide: https://ashiqulislamshaon.medium.com/setting-up-dlib-in-your-android-projects-using-android-studio-for-windows-and-mac-linux-3cb29273fdd0
-
To include OpenCV in my Android Studio project, I referred to this online guide (but any guide that brings the version of OpenCV you want is fine): https://medium.com/@sdranju/android-studio-step-by-step-guide-for-setting-up-opencv-sdk-4-9-on-android-740547f3260b
-
Finally, for the process and steps that lead to the morphing of the two faces, I am referring to the official guide of LearnOpenCV by @BIGVision: https://learnopencv.com/face-morph-using-opencv-cpp-python/
The guide reports the general steps to get a correct morphing result, including the code written in C++/alternatively in Python. In this project, I follow the theoretical steps to take two input images as an example, identify the facial key points and the correspondences in both, and follow the warping steps of the two images, to then merge them into a third.
git clone https://github.com/ptrespidi/face-morphing-java-android.git
Once you have downloaded the project, you can run the app and test its functionality through the FROM FILE button on the homepage. Inside face-morphing-java-android/app/src/main/res/drawable there are two images for testing (the same ones used in the official guide), namely ted_cruz.jpg and hillary_clinton.jpg. The CAMERA button has not been implemented, but what it should do is the same function, but enables the camera, takes two photos and places them in two imageViews, morphing them accordingly.
For the simplicity of the project, I did not focus on adding functionality to the app, or making it more aesthetically pleasing, but rather the aim is to show the technical functioning and the process that leads to the manipulation of the two images using algorithmic computer vision techniques. To test the app you can add some images in /drawable, and set them in the two imageViews
in /face-morphing-java-android/app/src/main/res/layout/testactivity.xml.
The images of Secretary Hillary Clinton and Senator Ted Cruz are in the Public Domain.
.
├──
│ └── app
│ ├── src
│ │ └── main
│ │ └── assets
│ │ ├── shape_predictor_68_face_landmarks_GTX.dat: Pre-trained model used by the Dlib library for facial landmark detection. This model has been trained to identify 68 specific points on the human face, which correspond to facial features such as eyes, eyebrows, nose, mouth, and facial contour.
│ │ ├── cpp
│ │ │ ├── dlib_android
│ │ │ │ ├── CMakeLists.txt: It configure the build for the project, include Dlib and link the native library.
│ │ │ │ ├── dlib_utils.hpp: It contains declarations for utility functions to initialize Dlib and detect facial landmarks.
│ │ │ │ └── dlib_face_result.cpp: It contains code to handle the face detection results and return them in a form that can be easily consumed from the Java side of the application.
│ │ ├── java
│ │ │ ├── com.android.facemorphing
│ │ │ │ ├── DlibResult.java: Java class that interacts with native libraries (C++) using JNI (Java Native Interface) to perform face detection with Dlib.
│ │ │ │ ├── FaceDetectorOpenCV.java: It uses OpenCV to detect faces in an image. It is designed to configure OpenCV, load the Haar cascade classifier for face detection, and provide a method for detecting faces in a bitmap image.
│ │ │ │ ├── Face.java: Designed to represent a detected face in an image. It contains information about specific positions of the face (landmarks), a lip mask, and other features related to face processing.
│ │ │ │ └── Test.java: It contains the face morphing process and steps.
│ │ └── jniLibs
│ │ └── arm64-v8a
│ │ └── libopencv_java3.so: OpenCV native library for Android that contains code compiled for the ARM64 architecture (arm64-v8a). Depending on the architecture of the Android device on which you decide to run and use the native libraries (arm64, armeabi, mips, x86), there is a different OS (see the other folders).
│ └── openCVLibrary3414: Folder of integrated OpenCV library.
└── External Libraries
└── Android API 30 Platform: The project is using the set of SDK libraries and resources corresponding to Android version 30 (Android R).
Let's see below the steps that lead to the morphing of the two faces (code is inside Test.java).
The dLibResult is responsible for handling the results of dlib's facial detection and recognition algorithms. After processing the image, we retrieve a list of detected faces using dLibResult.getFaces()
. For each face, we retrieve the positions of various facial landmarks (keypoints). We identify 68 facial landmarks using the pretrained shape_predictor_68_face_landmarks_GTX.dat.
// Processing the image
dLibResult.processFrame(img1);
ArrayList<Face> faces = dLibResult.getFaces();
// Iterating through faces
for (Face face : faces) {
// Obtaining facial position for each face
ArrayList<Position> facePositions = face.getPositions();
// Creating an array for current face
KeyPoint[] keypointsArray = new KeyPoint[facePositions.size()];
// Adding keypoints to an array
for (int i = 0; i < facePositions.size(); i++) {
Position position = facePositions.get(i);
keypointsArray[i] = new KeyPoint((float) position.getX(), (float) position.getY(), 5);
}
// Adding keypoints to Matofkeypoints
firstKeyPoints.fromArray(keypointsArray);
}
To consider the whole image, we manually add the 4 points in the corners, plus the ones in the middle:
// Adding points of angles of image and their middle
keyPointList1 = firstKeyPoints.toList();
Point topLeft = new Point(0, 0);
Point topRight = new Point(firstImage.cols() - 1, 0);
Point bottomLeft = new Point(0, firstImage.rows() - 1);
Point bottomRight = new Point(firstImage.cols() - 1, firstImage.rows() - 1);
Point middleTop = new Point((topLeft.x + topRight.x) / 2, (topLeft.y + topRight.y) / 2);
Point middleBottom = new Point((bottomLeft.x + bottomRight.x) / 2, (bottomLeft.y + bottomRight.y) / 2);
Point middleLeft = new Point((topLeft.x + bottomLeft.x) / 2, (topLeft.y + bottomLeft.y) / 2);
Point middleRight = new Point((topRight.x + bottomRight.x) / 2, (topRight.y + bottomRight.y) / 2);
Delaunay triangulation is a technique used in computational geometry to triangulate a set of points in a plane in such a way that no point is inside the circumcircle of any triangle formed by the points. In simpler terms, it creates a network of triangles connecting a given set of points such that the triangles do not overlap or have excessively acute angles. To apply the technique, and apply it to both images, we average the coordinates between the two sets of points, and use the new calculated coordinates:
int numPunti = arrayKeyPoints1.length;
KeyPoint[] arrayMediaKeyPoints = new KeyPoint[numPunti];
for (int i = 0; i < numPunti; i++) {
float xMedia = (float) ((arrayKeyPoints1[i].pt.x + arrayKeyPoints2[i].pt.x) / 2.0);
float yMedia = (float) ((arrayKeyPoints1[i].pt.y + arrayKeyPoints2[i].pt.y) / 2.0);
float sizeMedia = (float) ((arrayKeyPoints1[i].size + arrayKeyPoints2[i].size) / 2.0);
float angleMedia = (float) ((arrayKeyPoints1[i].angle + arrayKeyPoints2[i].angle) / 2.0);
float responseMedia = (float) ((arrayKeyPoints1[i].response + arrayKeyPoints2[i].response) / 2.0);
int octaveMedia = (arrayKeyPoints1[i].octave + arrayKeyPoints2[i].octave) / 2;
arrayMediaKeyPoints[i] = new KeyPoint(xMedia, yMedia, sizeMedia, angleMedia, responseMedia, octaveMedia);
}
Then, we exploit the subdiv2D object for Delaunay triangulation, which is a method to divide the plane into triangles with specific properties.
triangleList
is a MatOfFloat6
object, which will store the resulting triangles from the triangulation. Each triangle is represented by six floating-point values (the coordinates of its three vertices):
Size size = firstImage.size();
rect = new Rect(0, 0, (int) size.width, (int) size.height);
Subdiv2D subdiv2D = new Subdiv2D(rect);
FSkeyPoints = mediaKeyPoints.toArray();
FSarrayOfPoints = new ArrayList<>();
for(int i = 0; i < FSkeyPoints.length; i++) {
FSarrayOfPoints.add(FSkeyPoints[i].pt);
}
fsMatOfPoint2f = new MatOfPoint2f();
fsMatOfPoint2f.fromList(FSarrayOfPoints);
subdiv2D.insert(fsMatOfPoint2f);
triangleList = new MatOfFloat6();
subdiv2D.getTriangleList(triangleList);
org.opencv.core.Point[] pt = new org.opencv.core.Point[3];
triangles = triangleList.toArray();
Given two images I and J, we want to create an in-between image M by blending the two images. The blending is controlled by a parameter α between 0 and 1. When α is 1, M looks exactly like J. Naively, we can blend the images using the following equation at every pixel (x, y): M(x, y) = (1-α)I(x, y) + αJ(x, y).
Before doing this, we firstly need to establish pixel corrispondence between the two images. In other words, for every pixel (xi, yi) in image I, we need to find it's corresponding pixel (xj, yj) in image J. We calculate the location (xm, ym):
for(int i = 0; i < firstM2F.length; i++) {
double x, y;
x = (1 - alpha) * firstM2F[i].x + alpha * secondM2F[i].x;
y = (1 - alpha) * firstM2F[i].y + alpha * secondM2F[i].y;
TarrayOfPoints.add(new org.opencv.core.Point(x, y));
}
After obtaining the triangles indexes from the previous step, we extract Delaunay triangle indexes from the given triangleList and returns them as a MatOfInt. We loop through the triangles in triangleList, where each triangle is represented by six floats (presumably three pairs of x-y coordinates):
public static MatOfInt getDelaunayIndexes(MatOfFloat6 triangleList, MatOfPoint2f fsMatOfPoint2f) {
Point[] pointArray = fsMatOfPoint2f.toArray();
float[] trianglesArray = triangleList.toArray();
MatOfInt triangles = new MatOfInt();
for (int i = 0; i < trianglesArray.length; i += 6) {
int[] vertexes = new int[3];
for (int v = 0; v < 3; v++) {
double x = trianglesArray[i + v * 2];
double y = trianglesArray[i + v * 2 + 1];
for (int j = 0; j < pointArray.length; j++) {
if (x == pointArray[j].x && y == pointArray[j].y) {
vertexes[v] = j;
break;
}
}
}
triangles.push_back(new MatOfInt(vertexes));
}
return triangles;
}
The following method reshapes the vertices from a MatOfInt object into a format suitable for further processing. It ensures that the number of rows in the input vertices matrix is a multiple of 3.
We create a new MatOfInt
(reshapedVertices) with the appropriate dimensions to hold the reshaped vertices, and iterate through the rows of the input vertices matrix, each representing a triangle. For each triangle, we copy the vertex indices and converts them from double to int, and puts the reshaped vertex indices into the reshapedVertices matrix.
public static MatOfInt reshapeVertices(MatOfInt vertices) {
// Verifica se il numero di righe in vertices è multiplo di 3
if (vertices.rows() % 3 != 0) {
throw new IllegalArgumentException("Il numero di righe in vertices deve essere multiplo di 3.");
}
// Crea la nuova MatOfInt con 3 colonne
MatOfInt reshapedVertices = new MatOfInt();
int numRows = vertices.rows() / 3;
reshapedVertices.create(numRows, 3, CvType.CV_32S);
// Copia i dati da vertices alla nuova MatOfInt
for (int i = 0; i < numRows; i++) {
double[] row1 = vertices.get(i * 3, 0);
double[] row2 = vertices.get(i * 3 + 1, 0);
double[] row3 = vertices.get(i * 3 + 2, 0);
// Converte ogni elemento dell'array double[] a int
int[] intRow1 = new int[row1.length];
int[] intRow2 = new int[row2.length];
int[] intRow3 = new int[row3.length];
for (int j = 0; j < row1.length; j++) {
intRow1[j] = (int) row1[j];
intRow2[j] = (int) row2[j];
intRow3[j] = (int) row3[j];
}
reshapedVertices.put(i, 0, intRow1);
reshapedVertices.put(i, 1, intRow2);
reshapedVertices.put(i, 2, intRow3);
}
return reshapedVertices;
}
// Applying the morphing for each triangle
for (int i = 0; i < vertices.rows(); i++) {
int[] vertexIndexes = new int[3];
vertices.get(i, 0, vertexIndexes);
Point[] t1 = {firstM2F[vertexIndexes[0]], firstM2F[vertexIndexes[1]], firstM2F[vertexIndexes[2]]};
Point[] t2 = {secondM2F[vertexIndexes[0]], secondM2F[vertexIndexes[1]], secondM2F[vertexIndexes[2]]};
Point[] t = {thirdM2F[vertexIndexes[0]], thirdM2F[vertexIndexes[1]], thirdM2F[vertexIndexes[2]]};
morphTriangle(t1, t2, t);
}
We calculate the bounding rectangles for three given triangles t1, t2, and t. These rectangles enclose each triangle, defining their respective regions in the image:
private void morphTriangle(Point[] t1, Point[] t2, Point[] t) {
Rect r1 = Imgproc.boundingRect(new MatOfPoint(t1));
Rect r2 = Imgproc.boundingRect(new MatOfPoint(t2));
Rect r = Imgproc.boundingRect(new MatOfPoint(t));
Then, we adjust the points of the triangles t, t1, and t2 relative to the top-left corner of their respective bounding rectangles r, r1, and r2. We create new arrays tRect, t1Rect, and t2Rect to store the adjusted points:
// Sposta i punti rispetto all'angolo in alto a sinistra del rettangolo
Point[] t1Rect = new Point[3];
Point[] t2Rect = new Point[3];
Point[] tRect = new Point[3];
for (int i = 0; i < 3; i++) {
tRect[i] = new Point(t[i].x - r.x, t[i].y - r.y);
t1Rect[i] = new Point(t1[i].x - r1.x, t1[i].y - r1.y);
t2Rect[i] = new Point(t2[i].x - r2.x, t2[i].y - r2.y);
}
MatOfPoint tRectInt = new MatOfPoint();
tRectInt.fromArray(tRect);
We proceeding creating two empty images (warpImage1 and warpImage2) with the same dimensions as a bounding rectangle r. Then, we apply affine transformations to the regions defined by triangles t1Rect and t2Rect from images img1Rect and img2Rect, respectively, to align them with a target triangle tRect:
warpImage1 = Mat.zeros(r.height, r.width, img1Rect.type());
warpImage2 = Mat.zeros(r.height, r.width, img2Rect.type());
applyAffineTransform(warpImage1, img1Rect, t1Rect, tRect);
applyAffineTransform(warpImage2, img2Rect, t2Rect, tRect);
Finally, we perform the final steps of blending and copying triangular regions. We an empty matrix imgRect to hold the blended rectangular patches, blends the warped images warpImage1 and warpImage2 using alpha blending, and stores the result in imgRect. It multiplies the triangular region of the morphed image imgMorph with the mask, and adds the blended rectangular patch imgRect to the triangular region of the morphed image imgMorph:
// Alpha blend rectangular patches
Mat imgRect = new Mat();
Core.addWeighted(warpImage1, 1.0 - alpha, warpImage2, alpha, 0, imgRect);
// Copy triangular region of the rectangular patch to the output image
Imgproc.cvtColor(imgRect, imgRect, Imgproc.COLOR_BGRA2BGR);
Core.multiply(imgMorph.submat(r), mask, imgMorph.submat(r));
Core.add(imgMorph.submat(r), imgRect, imgMorph.submat(r));
It is possible to use a SeekBar
to control the α level dynamically to be used in the warping equation of the two images:
seekBarAlpha.setOnSeekBarChangeListener(new SeekBar.OnSeekBarChangeListener() {
@Override
public void onProgressChanged(SeekBar seekBar, int progress, boolean fromUser) {
alpha = progress / 100.0;
textViewAlpha.setText(getString(R.string.label_value, String.valueOf(alpha)));
generate_morphImage();
}
The result is as follows: