Video Stabilizer

TODO: add CI badges

An application of Lowe's Scale-Invariant Feature Transform (SIFT)¹ for feature detection and Fischler and Bolles' RAndom SAmple Consesus (RANSAC)² algorithm to compute robust homography matrices, used as part of an algorithm to stabilize shaky video footage.

Example

Original	Stabilized

How Does it Work?

When stabilizing a video, our goal is to remove unwanted camera motion (e.g. shaking or jittering) while preserving the intended motion (e.g. panning or zooming). The algorithm can be broken down in to these steps:

Detect Features: key features are detected in each video frame using feature detection techniques like SIFT, SURF, or ORB (in this project, we use Lowe's SIFT technique from the '04 paper on feature detection). These are distinct points in the frame, like corners or edges, that are easy to track across frames.

Match Features Across Frames: The features from one frame are matched to the corresponding features in the next frame. This match up allows us to estimate how the frame has shifted, rotated, or transformed between frames. Not all discovered key points are valid matches, so the invalid matches, pictured below in red, are discarded.

Estimate the Homography Matrix: Using the matched feature points and the RANSAC algorithm, a homography matrix (shortened to "H matrix") is computed for every pair of consecutive frames to model the transformation between them. This estimated transform matrix includes translations (i.e. how the camera moved), rotations (i.e. how the frame tilted), and other distortions like perspective shifts.
Stabilization: To stabilize the video, the calculated H matrix is applied to every pair of consecutive frames to correct the unwanted motion. If a frame shifted slightly, e.g. due to hand tremors, applying the H matrix would shift the frame back into alignment, as well as correcting rotation or perspective distortions.
Apply Smoothing: The calculated H matrices are smoothed out over several frames to ensure the stabilization is gradual and not abrupt. This prevents a harsh "jumping" effect in the video, leading to smoother output.
Crop the video: During stabilization, the algorithm attempts to counteract shaky movements by shifting the frames accordingly. These shifts can create gaps or borders (show below in green) around the edges of the video. To avoid showing these unwanted borders, stabilized videos are slightly cropped—a small trade-off to hide the artifacts introduced.

Before Cropping	After Cropping

Future Improvements

Attributions

The user interface was created using ImGui, and the stabilization algorithm was implemented using the OpenCV library.

Lowe, David G. "Distinctive image features from scale-invariant keypoints." International Journal of Computer Vision, vol. 60, no. 2, 2004, pp. 91-110. https://doi.org/10.1023/B:VISI.0000029664.99615.94 ↩
Fischler, Martin A., and Robert C. Bolles. "Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography." Communications of the ACM, vol. 24, no. 6, 1981, pp. 381-395. https://doi.org/10.1145/358669.358692. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
docs		docs
include		include
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
vcpkg-configuration.json		vcpkg-configuration.json
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Stabilizer

Example

How Does it Work?

Future Improvements

Attributions

About

Releases

Languages

License

tessapower/video-stabilizer

Folders and files

Latest commit

History

Repository files navigation

Video Stabilizer

Example

How Does it Work?

Future Improvements

Attributions

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages