Classification of Covert Vowels in Spanish and Dutch

None, None

Classification of Covert Vowels in Spanish and Dutch

What do brain signals say about inner speech?

Master Thesis (2023)

Author(s)

I. Kyriazis (TU Delft - Mechanical Engineering)

Contributor(s)

AC Schouten – Mentor (TU Delft - Biomechanical Engineering)

O.E. Scharenborg – Mentor (TU Delft - Multimedia Computing)

A. Seth – Graduation committee member (TU Delft - Biomechatronics & Human-Machine Control)

Y. B. B. Eisma – Graduation committee member (TU Delft - Human-Robot Interaction)

Faculty

Mechanical Engineering

Copyright

Machine learning Deep Learning Electroencephalography (EEG) Brain Computer Interface (BCI) Dutch covert speech Spanish covert speech Inner speech

To reference this document use:

https://resolver.tudelft.nl/uuid:3b28b84f-f36f-476c-9cca-d507173f5155

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

23-08-2023

Awarding Institution

Delft University of Technology

Programme

['Biomedical Engineering']

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Patients with neuromuscular diseases that are unable to speak, but whose cognitive ability has been maintained, can be benefited from Brain Computer Interfaces (BCIs). The decoding of inner (covert) speech from EEGs consists of one of the state of the art methods that aim to tackle this issue. High variability between subjects, as well as low signal to noise ratio (SNR) undermine the methods used, and introduce the need for computer assisted solutions. Thus, machine learning models as well as large amounts of recorded data are required to design effective algorithms and produce substantial results. In this study, covert vowel classification was performed in a systematic way, by making use of two openly shared databases from literature; the Coretto database, that contains EEG recordings of native Spanish speakers, and the DAIS dataset, which includes EEG recordings of native Dutch speakers. Six classifiers were initially selected to perform 5-class classification: a Random Forest (RF), a k Nearest Neighbours (kNN), a Gaussian Naive Bayers (GNB), a Deep Convolutional Neural Network (DCNN), a Shallow Convolutional Neural Network (SCNN) and a Long Short Term Memory Recurrent Neural Network (LSTM). The DCNN outperformed the other methods, with average intra-subject accuracies of 35% for Coretto and 39% for DAIS (chance level 20%). Afterwards, an Overt versus Covert trials experiment was implemented, to test the limits of overt speech decoding from EEGs. The overt result was slightly higher than covert, with an intra-subject average value of 37.8% for Coretto and 40.5% for DAIS (chance level 20%). Finally, binary classification was performed to identify those pairs of vowels that can be classified more efficiently. Vowels /a/ and /u/ seemed to perform better in average in both datasets (average of 64.8% for Coretto and 64.4% for DAIS with a chance accuracy of 50%). Future work should focus on identifying the useful parts of the EEG recordings, increasing the SNR and the resolution of the electrodes, and defining the most appropriate dictionaries of words/vowels for a BCI. Also, more studies should follow systematic ways of comparisons between datasets, to obtain less ambiguous insights and lead this field to improvements.

Files

MScThesis_IoannisKyriazis_Cove... (pdf)

(pdf | 5.07 Mb)

- Embargo expired in 28-08-2024

License info not available