OpenCV Kalman filter tips

3 min readMar 12, 2024

While object detection can provide us with information about where objects are inside one frame, when we try to associate the same object between multiple frames, detection itself might not be enough. If we use a naive approach, for example checking if there are overlapped bounding boxes in consecutive frames, it may work in some cases, but what if an occlusion happens? What if our detection doesn’t work well when in some aspect angle? This is where tracking methods come in handy, they can help us create a hypothetical trajectory to which direction a “lost” object may have been moving.

While there are many exotic machine learning visual multi-object tracking methods readily available on the internet. For many edge devices, that require less computation load, lighter methods are much more desirable. This is why Kalman filter may be considered a good potential method.

This article is mainly about some digging I did to utilize OpenCV’s Kalman filter.

The code

While there is a one-value Kalman filter example given by Opencv, examples of two values (namely x and y for a single point in a frame) are mostly in Python.

// Initialze
cv::KalmanFilter kalman = cv::KalmanFilter(4,2,0) ;

// Setting the states
kalman.statePre = (cv::Mat_<float>(4, 1) << StartX + StartW / 2, StartY + StartH / 2, 0, 0);
kalman.statePost = (cv::Mat_<float>(4, 1) << StartX + StartW / 2, StartY + StartH / 2, 0, 0);

kalman.transitionMatrix = (cv::Mat_<float>(4, 4) << 1, 0, 1, 0, 0, 1, 0, 1, 0, 0, 1, 0, 0, 0, 0, 1);
kalman.processNoiseCov = (cv::Mat_<float>(4, 4) << 0.3f, 0, 0, 0, 0, 0.3f, 0, 0, 0, 0, 0.3f, 0, 0, 0, 0, 0.3f);
kalman.measurementMatrix = (cv::Mat_<float>(2, 4) << 1, 0, 0, 0, 0, 1, 0, 0);
kalman.measurementNoiseCov = cv::Mat::eye(2, 2, CV_32F);

Why it is initiated with cv::KalmanFilter(4,2,0)? The “4” is for dynamic parameters, “2” for measurement parameters, and “0” for control parameters. Since my usage doesn’t involve actual inputting to affect the result, “0” is put into control parameters. “2” for measurement parameters represents X and Y, the center point of our tracked object, and “4” means X, y, 🛆X and 🛆Y. Thus, if we only want to estimate a single value we should use cv::KalmanFilter(2,1,0).

Then we set “statePre” and “statePost” which are the predicted and corrected result. By assigning our position in the X and Y, we set our initial position into the function.

Other parameters require you to dig further into how the Kalman filter works, “transitionMatrix” is the state transition model applied to the previous state. “processNoiseCov” is the process noise. “measurementMatrix” is the observation model, and finally “measurementNoiseCov” is the observed noise.

Voila, initiating is done and we can now start to use it.

Predict with

cv::Mat Predictions= Predictions = kalman.predict();
// Predictions.at<float>(0) is x
// Predictions.at<float>(1) is y

Correct with

// put your x,y into a cv::Mat
// e.g. cv::Mat Center = cv::Mat_<float>(2, 1) << x + w / 2, y + h / 2);
kalman.correct(Center);

Then we are done.

Epilogue

Now we can use Kalman filter for a single point in a picture.

I have a sample code that can provide a more comprehensive view of how it can be used in code, it is built on the ONNXruntime example I created earlier.

Thank you for your watching. Bye!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Opencv

Written by Scott Jin

8 Followers

2 Following

Graduate student from Taiwan in Computer Science at the University of California, Riverside. Passionate about HPC, ML, and embedded software development.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

YOLOv12: Redefining Real-Time Object Detection 🚀

Henry Navarro

YOLOv12: Redefining Real-Time Object Detection 🚀

Introducing the Pioneering Features and Performance of YOLOv12 from the Latest Research

Feb 19

195

3D Gaussian Splatting model of Sascha Kirch

TDS Archive

Sascha Kirch

Turn Yourself into a 3D Gaussian Splat

A Hands-on Guide for Practitioners

Mar 14, 2024

536

Lists

Staff picks

827 stories1648 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2819 saves

Understanding Vision Transformers: A Game-Changer in Computer Vision

Generative AI

Nick Pai

Understanding Vision Transformers: A Game-Changer in Computer Vision

When you think about computer vision, CNNs (Convolutional Neural Networks) likely come to mind as the go-to architecture. However, recent…

Oct 24, 2024

Comprehensive Guide to Real-Time Car License Plate Detection with YOLO, .bt

Mohindra Jain

Comprehensive Guide to Real-Time Car License Plate Detection with YOLO, .bt

License plate detection has broad applications, from automated traffic management to secure entry systems. This guide walks you through…

Nov 1, 2024

How To Train Your PyTorch Models (Much) Faster

Level Up Coding

Sahib Dhanjal

How To Train Your PyTorch Models (Much) Faster

Tips and tricks I learnt while working with the best in the industry

Feb 10

705

Lidar series (1): principle, classification and development trend

PointCloud-Slam-Image-Web3

Lidar series (1): principle, classification and development trend

This is the first article in the LiDAR series, which mainly introduces the basic principle, classification and development trend of LiDAR.

Dec 12, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams