Querying Sparse Matrices for Information Retrieval

None, None

Querying Sparse Matrices for Information Retrieval

Doctoral Thesis (2012)

Author(s)

R. Cornacchia

Contributor(s)

A.P. De Vries – Promotor

Copyright

Database Information retrieval Array database Sparse matrices

To reference this document use:

https://resolver.tudelft.nl/uuid:d0ac16ca-3143-4a2f-9c7f-6e6eb480e6b5

More Info

expand_more

Publication Year

2012

Copyright

Abstract

For many years, information retrieval (IR) systems could have been adequately described as applications that assign an estimate of relevancy to a pair of document and query, each represented as a 'bag-of-words'. The implementation of such search systems has been relatively straightforward, and most engineers code retrieval models directly on top of an inverted file structure. Trends in research and industry motivate however a reconsideration of the above characterisation of IR. This thesis proposes an innovation in the search system engineering process, by introducing a layered approach typical of database systems, which enables more flexibility in the IR system's architecture. The increased flexibility aims to reduce the effort of parametrising search functionalities for optimal effectiveness: adapted to the work task and user context, optimised for specific types of content in the collection and specialised to exploit domain knowledge. This thesis investigates a possible solution based on the array paradigm to model IR concepts and bridge the gap with the underlying relational database layer. The proposed approach is finally evaluated in terms of flexibility and run-time efficiency.

Files

22283_thesis_20120502.pdf

(pdf | 2.9 Mb)

License info not available