Print Email Facebook Twitter Semantic enrichment and exploration process on domain specific digital libraries Title Semantic enrichment and exploration process on domain specific digital libraries Author Fragkeskos, Kyriakos (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Lofi, Christoph (mentor) Houben, Geert-Jan (graduation committee) Venkatesha Prasad, Ranga Rao (graduation committee) Degree granting institution Delft University of Technology Date 2017-06-26 Abstract With the growing number of scientific publications, the conceptof navigating effectively and searching for domain specific informationis rather significant and highly important for the scientificcommunity [2]. For instance, to search by topics, research methods,used datasets, or scientific objectives. Such deep meta-data increaseour perception for a given domain (i.e. Data Processing Pipelines) andfacilitate us to understand and visualize the evolution of researchtopics and venues over time. Nevertheless, the extraction of suchdeep meta-data from text-based documents is notorious challengingand demanding due to the unstructured and ambiguous languageof the text in different publications.The work in this paper has already contributed into two publicationattempts; with one published paper at ESWC conference,and one accepted paper at the TPDL conference. Furthermore, thework in this project extends the analysis of previous attempts byadding more data from the domain of Data Processing Pipelines andby including one additional domain for analysis (i.e. the domainof Robotics). Moreover, this work provides justifications for all theimplementation decisions and proposes a refined version of theonline domain-aware semantic enrichment framework (SmartPub),that automates the generation of deep meta-data by utilizing keyfacets from the domains of Data Processing Pipelines and Robotics.The goal is to generate structured meta-data (i.e. named entities orphrases), from full-text scientific publications, with respect to a setof domain aware facets (i.e. Objective, Methods, Dataset, Software,and Results), and afterwards, to construct groups of facet-terms (i.e.facet-topics) according to their semantic similarity for allowingdata exploration and navigation. Finally, the proposed frameworkis evaluated both quantitatively and qualitatively on seventeen conferenceseries from the domains of Data Processing Pipelines andRobotics. Subject Meta-Data ExtractionDigital LibraryFacet EmbeddingNatural Language ProcessingTopic Modeling To reference this document use: http://resolver.tudelft.nl/uuid:4475a83f-d0e8-485a-ba74-5478f43c32aa Part of collection Student theses Document type master thesis Rights © 2017 Kyriakos Fragkeskos Files PDF Kyriakos_Fragkeskos_Final ... Thesis.pdf 3.89 MB Close viewer /islandora/object/uuid:4475a83f-d0e8-485a-ba74-5478f43c32aa/datastream/OBJ/view