Knowledge Graph-based Recommender Systems (KGRS) have attracted significant attention due to their potential to enhance both recommendation accuracy and interpretability by leveraging structured relational knowledge. Despite the widespread use of reasoning paths as explanations,
...
Knowledge Graph-based Recommender Systems (KGRS) have attracted significant attention due to their potential to enhance both recommendation accuracy and interpretability by leveraging structured relational knowledge. Despite the widespread use of reasoning paths as explanations, systematic evaluation of explanation quality across different KGRS paradigms, and its relationship with recommendation correctness, remains limited. This study provides a comprehensive empirical analysis of three representative KGRS paradigms, including path-based, embedding-based, and hybrid methods, focusing on multiple dimensions of explanation quality, including temporal relevance, popularity, diversity, semantic consistency, and faithfulness.
The analysis investigates differences in explanation characteristics across paradigms, revealing that models such as TMER achieve high recommendation accuracy through constrained path structures, whereas reinforcement learning-based models, including TPRec and PGPR, generate explanations with stronger lexical alignment to user review rationales. The study also examines explanation quality for correctly recommended (relevant) and incorrectly recommended (irrelevant) items. The results show that explanation-ground truth consistency metrics (Precision, Recall, and F1) exhibit greater disparities between correctly and incorrectly recommended items than other evaluation metrics. In RippleNet, the impact of ripple set size and explanation-oriented neighbor sampling strategies on recommendation performance and explanation quality is analyzed. Non-uniform sampling guided by temporal relevance, popularity, or diversity effectively shapes ripple sets to enhance explanation properties without significantly affecting overall recommendation accuracy.
The findings highlight trade-offs between recommendation accuracy and explanation quality, demonstrating that careful model design and sampling strategies can produce interpretable, user-aligned recommendations while maintaining high performance. These insights provide guidance for developing KGRS capable of delivering accurate predictions accompanied by semantically rich, temporally relevant, and user-preferred explanations, thereby improving transparency, trust, and user satisfaction in real-world applications.