Search Engine Entity Cards

Bachelor Thesis (2021)
Author(s)

Y. Kalia (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C Hauff – Mentor (TU Delft - Web Information Systems)

George Iosifidis – Graduation committee member (TU Delft - Embedded Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2021 Yash Kalia
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Yash Kalia
Graduation Date
01-07-2021
Awarding Institution
Delft University of Technology
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Search engine Entity Cards(ECs) display conciseinformation from the web about a topic or subjectin response to a user query. The topic or subjectcan be a person, an organization etc. and is referredto as an “Entity”. The specific topic under researchis how to determine which entity is most relevantfor the query in terms of helping the user find theinformation he/she is looking for. and what infor-mation about the chosen entity to display to answerthe query. The information can be in the form ofbut it not limited to text, images and hyperlinks.Research into the concepts of EC focuses on differ-ent components of the EC widget for example en-tity linking, tagging, extraction and fact summarygeneration. In the developed “EC algorithm” theseconcepts are combined into an implementation ofan Entity Card widget and then evaluated. The ECalgorithm utilizes tools such as DBPedia, DBPediaSpotlight and the Bing Web Search API to gener-ate an entity ranking for a query. The results ofevaluating the top ranked entity imply that the ECalgorithm retrieve on average a slightly to moder-ately relevant entity to the user. The fact retrievalalgorithm had predictably worse results given thecomplexity of finding truly relevant facts about en-tities.

Files

RP_Report.pdf
(pdf | 1.44 Mb)
License info not available