Topological properties and organizing principles of semantic networks

None, None; None, None; None, None; None, None

Topological properties and organizing principles of semantic networks

Journal Article (2023)

Author(s)

Gabriel Budel (TU Delft - Network Architectures and Services)

Y Jin (Student TU Delft)

Piet van Mieghem (TU Delft - Network Architectures and Services)

Maksim Kitsak (TU Delft - Network Architectures and Services)

Research Group

Network Architectures and Services

Copyright

DOI related publication

https://doi.org/10.1038/s41598-023-37294-8

OA-Fund TU Delft

To reference this document use:

https://resolver.tudelft.nl/uuid:a6a8909c-80f2-4d0a-b26e-c9fc1ff47717

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Research Group

Network Architectures and Services

Issue number

1

Volume number

13

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Interpreting natural language is an increasingly important task in computer algorithms due to the growing availability of unstructured textual data. Natural Language Processing (NLP) applications rely on semantic networks for structured knowledge representation. The fundamental properties of semantic networks must be taken into account when designing NLP algorithms, yet they remain to be structurally investigated. We study the properties of semantic networks from ConceptNet, defined by 7 semantic relations from 11 different languages. We find that semantic networks have universal basic properties: they are sparse, highly clustered, and many exhibit power-law degree distributions. Our findings show that the majority of the considered networks are scale-free. Some networks exhibit language-specific properties determined by grammatical rules, for example networks from highly inflected languages, such as e.g. Latin, German, French and Spanish, show peaks in the degree distribution that deviate from a power law. We find that depending on the semantic relation type and the language, the link formation in semantic networks is guided by different principles. In some networks the connections are similarity-based, while in others the connections are more complementarity-based. Finally, we demonstrate how knowledge of similarity and complementarity in semantic networks can improve NLP algorithms in missing link inference.

Files

S41598_023_37294_8.pdf

(pdf | 2.42 Mb)