Automatic Psychological Text Analysis using Support Vector Machine Classification

Bachelor Thesis (2021)
Author(s)

J. Park (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

W.P. Brinkman – Mentor (TU Delft - Interactive Intelligence)

M. Bruijnes – Mentor (TU Delft - Interactive Intelligence)

H.S. Hung – Coach (TU Delft - Pattern Recognition and Bioinformatics)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2021 Jeongwoo Park
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Jeongwoo Park
Graduation Date
01-07-2021
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In recent years, there has been an increasing number of patients with mental disorders. A conversational agent is being developed to ensure an easier diagnosis based on the chat between a patient and the agent. The objective of this research is to assess how well Support Vector Machine (SVM) classifies text into its corresponding schema, which are the mental states of the patient. In total, three different classifications have been attempted, Binary, Ordinal, and Per-Questionnaire. The experimental result indicated that SVM is possible to classify 2 out of 7 schema modes, but in general, the performance of SVM was not outperforming with a low f1-score. At the end of the research, SVM was compared to Recurrent Neural Network (RNN) and k-Nearest-Neighbour (kNN) and it turned out that RNN gives the best performance. One of the limitations affecting the result is the quality of the data set. With more correlated labels and a greater size of the data set, improved results can be expected.

Files

Research_Project_2020_1.pdf
(pdf | 0.396 Mb)
License info not available