Query Answerability Classifier for Direct Answer Module in Web Search Engines
More Info
expand_more
expand_more
Abstract
In order determine when we can show direct answer module to user queries in web search engine, an independent classifier is designed in this study to assess the answerability of each user query. Real user queries are sampled from MS MARCO Question Answering and Natural Langauge Generation dataset \cite{MSMARCO} and manually labelled with query answerability to train and evaluate the classifier. As a result, the XGboost model has an overall better performance than the random forest model with prediction accuracy score 0.83 and F1 score 0.89. Once the classifier determines the user query is answerable, a MRC model may be used to find the direct answer within provided passages. Else, no direct answer shall be provided to this query.
Files
Research_Project_19_3_.pdf
(.pdf | 0.548 Mb)