Towards an evolutionary-based approach for natural language processing

Conference Paper (2020)
Author(s)

Luca Manzoni (University of Trieste)

Domagoj Jakobovic (University of Zagreb)

L. Mariot (TU Delft - Cyber Security)

Stjepan Picek (TU Delft - Cyber Security)

Mauro Castelli (Universidade Nova de Lisboa)

Research Group
Cyber Security
Copyright
© 2020 Luca Manzoni, Domagoj Jakobovic, L. Mariot, S. Picek, Mauro Castelli
DOI related publication
https://doi.org/10.1145/3377930.3390248
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 Luca Manzoni, Domagoj Jakobovic, L. Mariot, S. Picek, Mauro Castelli
Research Group
Cyber Security
Bibliographical Note
Accepted author manuscript@en
Pages (from-to)
985-993
ISBN (print)
978-1-4503-7128-5
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Tasks related to Natural Language Processing (NLP) have recently been the focus of a large research endeavor by the machine learning community. The increased interest in this area is mainly due to the success of deep learning methods. Genetic Programming (GP), however, was not under the spotlight with respect to NLP tasks. Here, we propose a first proof-of-concept that combines GP with the well established NLP tool word2vec for the next word prediction task. The main idea is that, once words have been moved into a vector space, traditional GP operators can successfully work on vectors, thus producing meaningful words as the output. To assess the suitability of this approach, we perform an experimental evaluation on a set of existing newspaper headlines. Individuals resulting from this (pre-)training phase can be employed as the initial population in other NLP tasks, like sentence generation, which will be the focus of future investigations, possibly employing adversarial co-evolutionary approaches.

Files

License info not available