ViLLarD
Maintenance of large databases based on visual information using incremental learning

applied research project

July 2014 - June 2017

Collaborating partners

University of Ljubljana, Faculty of Computer and Information Science
DFG CONSULTING informacijski sistemi d.o.o.

Funding

ARRS (L2-6765)

Researchers

Danijel Skočaj, PhD

Matej Kristan, PhD

Domen Tabernik, PhD

univ. dipl. inž. Matjaž Majnik

dr. Rok Mandeljc

mag. Peter Uršič

Domen Rački, PhD

mag. Tomaž Gvozdanović (DFG Consulting d.o.o)

univ. dipl. inž. Simon Jud (DFG Consulting d.o.o)

mag. Uroš Ranfl (DFG Consulting d.o.o)

dr. Rok Vezočnik (DFG Consulting d.o.o)

mag. Domen Smole (DFG Consulting d.o.o)

univ. dipl. inž. Marko Mahnič (DFG Consulting d.o.o)

Project overview

We live in the era of information abundance. However, rather than quantity, the central concern is becoming the quality and credibility of the acquired data. This is especially true for visual information databases. Although the field of computer vision has achieved a significant progress recently, the methods for automatic image interpretation are still not sufficiently reliable for fully autonomous annotation and maintenance of image and video databases (e.g. databases of detected objects). On the other hand, manual annotation of video sequences with relevant objects is very time consuming, expensive, as well as tedious and therefore prone to errors.

In this project we aspired to combine two approaches: computer-based automation of image interpretation that is necessary for database maintenance as well as suitable introduction of a human verifier into the loop. Such combination is of central importance for developing a methodology suitable for semi-automatic maintenance of traffic signalization records, which is partially our project’s practical goal. Even the database of such records for only state roads in the Republic of Slovenia may contain more than 250.000 entries, obtained by processing image sequences along with additional information. Automation is therefore crucial for continuous maintenance of such databases.

The main goal of the project was to develop a framework for semi-supervised incremental learning as well as specific methods for visual learning and recognition that will increase the quality and efficiency of large visual information databases maintenance. We developed efficient methods that address this problem. We based our research on both, hierarchical compositional models and on deep learning methods, as well as on methods that combine the best of both worlds. We also considered different kinds of context, i.e., temporal, spatial as well as semantic context, to narrow down the search area in the images to improve the recognition results. The main research contribution to science has been therefore made in the field of modelling visual information, more specifically in the development of methods for learning object representations for detection and recognition.

The developed methodology was applied to the use case of maintaining the records of traffic signalization, which is very suitable for evaluation of the developed algorithms. For this purpose we also built a comprehensive image database containing annotated traffic signs. We therefore also expect a significant contribution of our research towards improving the efficiency of traffic signalization monitoring that would in the long run significantly reduce the cost of some elements of the traffic infrastructure.

Publications:

Improving Traffic Sign Detection with Temporal Information

Domen Tabernik, Jon Muhovič, Alan Lukežič and Danijel Skočaj

Proceedings of the 23rd Computer Vision Winter Workshop, 2018
Spatially-Adaptive Filter Units for Deep Neural Networks

Domen Tabernik, Matej Kristan and Aleš Leonardis

Computer Vision and Pattern Recognition, 2018
Deformable Parts Correlation Filters for Robust Visual Tracking

Alan Lukežič, Luka Čehovin Zajc and Matej Kristan

IEEE Transactions on Cybernetics, 2017
Detekcija napak na površinah z uporabo anotiranih slik in globokim učenjem

Domen Tabernik and Danijel Skočaj

Proceedings of the 26th International Electrotechnical and Computer Science Conference, ERK 2017, 2017
Detekcija točkovnih horizontalnih prometnih znakov, Tehnično poročilo, TR-LUVSS-17/02

Jon Muhovič, Domen Tabernik and Danijel Skočaj

Technical Report, 2017
Learning part-based spatial models for laser-vision-based room categorization

Peter Uršič, Aleš Leonardis, Danijel Skočaj and Matej Kristan

International Journal of Robotics Reseach, Sage, 2017
Towards large-scale traffic sign detection and recognition

Peter Uršič, Domen Tabernik, Rok Mandeljc and Danijel Skočaj

Proceedings of the 22nd Computer Vision Winter Workshop, 2017
A Novel Performance Evaluation Methodology for Single-Target Trackers

Matej Kristan, Jiri Matas, Aleš Leonardis, Tomas Vojir, Roman Pflugfelder, Gustavo Fernandez, Georg Nebehay, Fatih Porikli and Luka Čehovin Zajc

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
An integrated system for interactive continuous learning of categorical knowledge

Danijel Skočaj, Alen Vrečko, Marko Mahnič, Miroslav Janiček, Geert-Jan M. Kruijff, Marc Hanheide, Nick Hawes, Jeremy L Wyatt, Thomas Keller, et al.

Journal of Experimental & Theoretical Artificial Intelligence, 2016
Hierarchical Spatial Model for 2D Range Data Based Room Categorization

Peter Uršič, Aleš Leonardis, Danijel Skočaj and Matej Kristan

IEEE International Conference on Robotics and Automation (ICRA), 2016
Part-Based Room Categorization for Household Service Robots

Peter Uršič, Rok Mandeljc, Aleš Leonardis and Matej Kristan

IEEE International Conference on Robotics and Automation (ICRA), 2016
Towards Deep Compositional Networks

Domen Tabernik, Matej Kristan, Jeremy L. Wyatt and Aleš Leonardis

23rd International Conference on Pattern Recognition, 2016
Adding discriminative power to a generative hierarchical compositional model using histograms of compositions

Domen Tabernik, Aleš Leonardis, Marko Boben, Danijel Skočaj and Matej Kristan

Computer Vision and Image Understanding, 2015
Domain-specific adaptations for region proposals

Domen Tabernik, Rok Mandeljc, Danijel Skočaj and Matej Kristan

Proceedings of the 20th Computer Vision Winter Workshop, 2015, 2015
Efficient spring system optimization for part-based visual tracking

Alan Lukežič, Luka Čehovin Zajc and Matej Kristan

Proceedings of the 24th International Electrotechnical and Computer Science Conference (ERK), 2015
Filtering out nondiscriminative keypoints by geometry based keypoint constellations

Domen Rački and Matej Kristan

Proceedings of the 24th International Electrotechnical and Computer Science Conference (ERK), 2015
Quality of region proposals in traffic sign detection and recognition

Domen Tabernik, Rok Mandeljc and Danijel Skočaj

Proceedings of the 24th International Electrotechnical and Computer Science Conference, ERK 2015, 2015
Traffic sign classification with batch and on-line linear support vector machines

Rok Mandeljc, Domen Tabernik, Matej Kristan and Danijel Skočaj

Proceedings of the 24th International Electrotechnical and Computer Science Conference (ERK), 2015

Financer:

arrs