YL
Y. LI
info
Please Note
<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>
1 records found
1
GNN-LLM Hybrids for Node Classification
A Comparative Study of GNN, LLM-based, and Hybrid Models
Node classification on text-attributed graphs requires both structural reasoning and rich semantic understanding.
While Graph Neural Networks (GNNs) have become the dominant solution by leveraging graph topology, they often rely on limited textual representations.
Recent advances, therefore, explore integrating large language models (LLMs) to provide stronger semantic encoding for graph learning. However, existing work often evaluates different LLM integration paradigms under individually designed experimental settings, making it difficult to assess their relative strengths for node classification tasks. In this work, we present a controlled empirical comparison between classical message-passing graph neural networks, parameter-efficient LLM-GNN integration (ENGINE), prompt-based generative reasoning (LLaGA), and a lightweight hybrid model that combines structural message passing with label-aware semantic alignment. We evaluate all models on two widely used textual graph benchmarks, \textsc{Cora} and \textsc{WikiCS}, under a unified transductive evaluation protocol and varying levels of training supervision. Our results show that the performance of ENGINE consistently outperforms strong GNN baselines, whereas LLaGA is more sensitive to inference constraints and evaluation protocols. The additional hybrid model we propose in this thesis further demonstrates complementary benefits, particularly in low-supervision regimes. These findings clarify practical trade-offs between discriminative and generative LLM-based graph models and highlight hybrid designs as a promising direction for efficient and robust node classification on textual graphs. ...
While Graph Neural Networks (GNNs) have become the dominant solution by leveraging graph topology, they often rely on limited textual representations.
Recent advances, therefore, explore integrating large language models (LLMs) to provide stronger semantic encoding for graph learning. However, existing work often evaluates different LLM integration paradigms under individually designed experimental settings, making it difficult to assess their relative strengths for node classification tasks. In this work, we present a controlled empirical comparison between classical message-passing graph neural networks, parameter-efficient LLM-GNN integration (ENGINE), prompt-based generative reasoning (LLaGA), and a lightweight hybrid model that combines structural message passing with label-aware semantic alignment. We evaluate all models on two widely used textual graph benchmarks, \textsc{Cora} and \textsc{WikiCS}, under a unified transductive evaluation protocol and varying levels of training supervision. Our results show that the performance of ENGINE consistently outperforms strong GNN baselines, whereas LLaGA is more sensitive to inference constraints and evaluation protocols. The additional hybrid model we propose in this thesis further demonstrates complementary benefits, particularly in low-supervision regimes. These findings clarify practical trade-offs between discriminative and generative LLM-based graph models and highlight hybrid designs as a promising direction for efficient and robust node classification on textual graphs. ...
Node classification on text-attributed graphs requires both structural reasoning and rich semantic understanding.
While Graph Neural Networks (GNNs) have become the dominant solution by leveraging graph topology, they often rely on limited textual representations.
Recent advances, therefore, explore integrating large language models (LLMs) to provide stronger semantic encoding for graph learning. However, existing work often evaluates different LLM integration paradigms under individually designed experimental settings, making it difficult to assess their relative strengths for node classification tasks. In this work, we present a controlled empirical comparison between classical message-passing graph neural networks, parameter-efficient LLM-GNN integration (ENGINE), prompt-based generative reasoning (LLaGA), and a lightweight hybrid model that combines structural message passing with label-aware semantic alignment. We evaluate all models on two widely used textual graph benchmarks, \textsc{Cora} and \textsc{WikiCS}, under a unified transductive evaluation protocol and varying levels of training supervision. Our results show that the performance of ENGINE consistently outperforms strong GNN baselines, whereas LLaGA is more sensitive to inference constraints and evaluation protocols. The additional hybrid model we propose in this thesis further demonstrates complementary benefits, particularly in low-supervision regimes. These findings clarify practical trade-offs between discriminative and generative LLM-based graph models and highlight hybrid designs as a promising direction for efficient and robust node classification on textual graphs.
While Graph Neural Networks (GNNs) have become the dominant solution by leveraging graph topology, they often rely on limited textual representations.
Recent advances, therefore, explore integrating large language models (LLMs) to provide stronger semantic encoding for graph learning. However, existing work often evaluates different LLM integration paradigms under individually designed experimental settings, making it difficult to assess their relative strengths for node classification tasks. In this work, we present a controlled empirical comparison between classical message-passing graph neural networks, parameter-efficient LLM-GNN integration (ENGINE), prompt-based generative reasoning (LLaGA), and a lightweight hybrid model that combines structural message passing with label-aware semantic alignment. We evaluate all models on two widely used textual graph benchmarks, \textsc{Cora} and \textsc{WikiCS}, under a unified transductive evaluation protocol and varying levels of training supervision. Our results show that the performance of ENGINE consistently outperforms strong GNN baselines, whereas LLaGA is more sensitive to inference constraints and evaluation protocols. The additional hybrid model we propose in this thesis further demonstrates complementary benefits, particularly in low-supervision regimes. These findings clarify practical trade-offs between discriminative and generative LLM-based graph models and highlight hybrid designs as a promising direction for efficient and robust node classification on textual graphs.