Image-based Video Search Engine

Data Compression and Nearest Neighbour Search

More Info
expand_more

Abstract

One of the main problems with Instance-level Image Retrieval in video data is that for longer query videos or large amount of image queries, comparing all of the query images to every extracted frame is time-inefficient. This thesis aims to solve this problem by implementing Nearest Neighbour Search (NNS) algorithms and data compression methods, significantly reducing total comparison time. In most NNS use cases, the reference data is provided before reaching the user, allowing methods such as ANNOY or HNSW to partition the data beforehand. However, little research has been done into partitioning the data during run-time. In this thesis, the use of Nearest Neighbor Search and Data Compression methods are discussed for the purposes of matching a query image to a query video, both of which are provided at run-time. The result is an implementation of several state-of-the-art NNS and data compression methods in a system which, based on the amount of query images and the amount of extracted keyframes, selects the optimal comparison method to be used, as well as its optimal parameters if applicable.