Visual object tracking performance evaluation

Introduction

Visual tracking is one of the rapidly evolving fields of computer vision. Every year, literally dozens of new tracking algorithms are presented and evaluated in journals and at conferences. When considering the evaluation of these new trackers and comparison to the state-of-the-art, several questions arise. Is there a standard set of sequences that we can use for the evaluation? Is there a standardized evaluation protocol? What kind of performance measures should we use? Unfortunately, there are currently no definite answers to these questions. Unlike some other fields of computer vision, like object detection and classification, optical-flow computation, and automatic segmentation, where widely adopted evaluation protocols are used, visual tracking is still largely lacking these features.

Methodology

The problem of visual tracking evaluation is sporting an abundance of performance measures, which are used by various authors, and largely suffers from lack of consensus about which measures should be preferred. This is hampering the cross-paper tracker comparison and faster advancement of the field. In our research we show that several measures are equivalent from the point of information they provide for tracker comparison and, crucially, that some are more brittle than the others. Based on this analysis we narrow down the set of potential measures to only two complementary ones that can be intuitively interpreted and visualized, thus pushing towards homogenization of the tracker evaluation methodology. More details are presented in the paper Visual object tracking performance measures revisited, published in IEEE Transactions on Image processing. This paper also marks the beggining of our effort on promoting good evaluation methodology which evolved into VOT Innitiative. Within the innitiative we have further developed ranking methodology for large-scale visual tracker comparison that takes into account different aspects of tracking as well as statistical significance of performance difference. We made several iterations of the evaluation methodology that evolved with times and usage scenarios.

Visual Object Tracking Challenge (VOT)

The advances in evaluation methodology are promoted by the Visual Object Tracking Challenge that we organize. The results of the first VOT challenge in 2013 were presented at a workshop at the ICCV2013 in Sydney, Australia. In total, nine challenges were organized annually up to 2021. More details about the VOT Challenges can be found on VOT Challenge webpage.

Apparent Motion Patterns (AMP)

This approach uses omnidirectional videos to generate various motion patterns in a controlled manner. More information available here.

A Color and Depth Visual Object Tracking Dataset and Benchmark (CDTB)

A new color-and-depth visual object tracking dataset (CDTB) is recorded by several passive and active RGB-D setups and contains indoor as well as outdoor sequences acquired in direct sunlight. The sequences have been carefully recorded to contain significant object pose change, clutter, occlusion, and periods of long-term target absence to enable tracker evaluation under realistic conditions. Sequences are per-frame annotated with 13 visual attributes for detailed analysis.

Dataset and Experimental Results
The benchmark results and source code of all tested RGB-D trackers are available on the VOT Challenge 2019 webpage. The CDTB dataset can be downloaded automatically using the VOT evaluation toolkit (select vot2019_rgbd stack). More information about the RGB-D tracking benchmark can be found in the paper.

Alan Lukezic, Ugur Kart, Jani Kapyla, Ahmed Durmush, Joni-Kristian Kamarainen, Jiri Matas, Matej Kristan.
CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark.
The IEEE International Conference on Computer Vision (ICCV) 2019

Long-term Visual Object Tracking Performance Evaluation

A long-term visual object tracking performance evaluation methodology and a benchmark are proposed. Performance measures are designed by following a long-term tracking definition to maximize the analysis probing strength. The new measures outperform existing ones in interpretation potential and in better distinguishing between different tracking behaviors. We show that these measures generalize the short-term performance measures, thus linking the two tracking problems. Furthermore, the new measures are highly robust to temporal annotation sparsity and allow annotation of sequences hundreds of times longer than in the current datasets without increasing manual annotation labor. A new challenging dataset of carefully selected sequences with many target disappearances is proposed. A new tracking taxonomy is proposed to position trackers on the short-term/long-term spectrum. The benchmark contains an extensive evaluation of the largest number of long-term trackers and comparison to state-of-the-art short-term trackers. We analyze the influence of tracking architecture implementations to long-term performance and explore various re-detection strategies as well as influence of visual model update strategies to long-term tracking drift. The methodology is integrated in the VOT toolkit to automate experimental analysis and benchmarking and to facilitate future development of long-term trackers.
More information can be found in the paper.

Dataset and Experimental Results The benchmark results and source code of all tested long-term trackers are available on the VOT Challenge webpage (2018 or newer). The LTB50 dataset can be downloaded automatically using the VOT evaluation toolkit (use the votlt20xy stack)

Alan Lukežič, Luka Čehovin Zajc, Tomáš Vojíř, Jiří Matas, and Matej Kristan.
Performance Evaluation Methodology for Long-Term Single Object Tracking.
IEEE Transactions on Cybernetics, 2020

Publications

The Tenth Visual Object Tracking VOT2022 Challenge Results

Matej Kristan, Aleš Leonardis, Jiri Matas, Michael Felsberg, Roman Pflugfelder, Joni-Kristian Kamarainen, Hyung Jin Chang, Martin Danelljan, Luka Čehovin Zajc, et al.

ECCV Workshops 2022, 2022
The Ninth Visual Object Tracking VOT2021 Challenge Results

Matej Kristan, Jirı Matas, Aleš Leonardis, Michael Felsberg, Roman Pflugfelder, Joni-Kristian Kamarainen, Hyung Jin Chang, Martin Danelljan, Luka Čehovin Zajc, et al.

VOT2021 challenge workshop, ICCV workshops, 2021
The Eighth Visual Object Tracking VOT2020 Challenge Results

Matej Kristan, Aleš Leonardis, Jiri Matas, Michael Felsberg, Roman Pflugfelder, Joni-Kristian Kamarainen, Luka Čehovin Zajc, Martin Danelljan, Alan Lukezic, et al.

ECCV2020 workshops, 2020
The Seventh Visual Object Tracking VOT2019 Challenge Results

Matej Kristan, Jiri Matas, Aleš Leonardis, Michael Felsberg, Roman Pflugfelder, Joni-Kristian Kamarainen, Luka Čehovin Zajc, Ondrej Drbohlav, Alan Lukezic, et al.

ICCV 2019 workshops, 2019
The sixth Visual Object Tracking VOT2018 challenge results

Matej Kristan, Aleš Leonardis, Jiri Matas, Michael Felsberg, Roman Pfugfelder, Luka Čehovin Zajc, Tomas Vojir, Goutam Bhat, Alan Lukezic, et al.

VOT2018 workshop, ECCV2018, 2018
The Visual Object Tracking VOT2017 Challenge Results

Matej Kristan, Aleš Leonardis, Jiri Matas, Michael Felsberg, Roman Pflugfelder, Luka Čehovin Zajc, Tomas Vojir, Gustav Häger, Alan Lukežič, et al.

VOT workshop 2017, ICCV workshops 2017, 2017
TraX: The visual Tracking eXchange Protocol and Library

Luka Čehovin Zajc

Neurocomputing, 2017
A Novel Performance Evaluation Methodology for Single-Target Trackers

Matej Kristan, Jiri Matas, Aleš Leonardis, Tomas Vojir, Roman Pflugfelder, Gustavo Fernandez, Georg Nebehay, Fatih Porikli and Luka Čehovin Zajc

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
The Visual Object Tracking VOT2016 challenge results

Matej Kristan, Aleš Leonardis, Jiri Matas, Michael Felsberg, Roman Pflugfelder, Luka Čehovin Zajc, Tomas Vojir, Gustav Häger, Alan Lukežič and Gustavo Fernandez

Computer Vision – ECCV 2016 Workshops, Springer, 2016
Visual object tracking performance measures revisited

Luka Čehovin Zajc, Aleš Leonardis and Matej Kristan

IEEE Transactions on Image Processing, 2016
The Visual Object Tracking VOT2015 challenge results

Matej Kristan, Jiri Matas, Aleš Leonardis, Michael Felsberg, Luka Čehovin Zajc, Gustavo Fernandez, Tomas Vojir, Gustav Häger, Georg Nebehay, et al.

Visual Object Tracking Workshop 2015 at ICCV2015, 2015
Is my new tracker really better than yours?

Luka Čehovin Zajc, Matej Kristan and Aleš Leonardis

WACV 2014: IEEE Winter Conference on Applications of Computer Vision, IEEE, 2014
The Visual Object Tracking VOT2014 challenge results

Matej Kristan, Roman Pflugfelder, Aleš Leonardis, Jiri Matas, Luka Čehovin Zajc, Georg Nebehay, Tomas Vojir, Gustavo Fernandez, Alan Lukežič, et al.

Visual Object Tracking Workshop 2014 at ECCV2014, 2014
The VOT2013 challenge: overview and additional results

Matej Kristan, Roman Pflugfelder, Aleš Leonardis, Jiri Matas, Fatih Porikli, Luka Čehovin Zajc, Georg Nebehay, Gustavo Fernandez and Tomas Vojir

Proceedings of the Nineteenth Computer Vision Winter Workshop (CVWW2014), 2014
TraX: Visual Tracking eXchange Protocol

Luka Čehovin Zajc

2014
The Visual Object Tracking VOT2013 challenge results

Matej Kristan, Roman Pflugfelder, Aleš Leonardis, Jiri Matas, Fatih Porikli, Luka Čehovin Zajc, Georg Nebehay, Gustavo Fernandez, Tomas Vojir, et al.

ICCV2013 Workshops, Workshop on Visual Object Tracking Challenge, 2013