Embodied Conversational Agent for Mental Health Intervention

Master Thesis (2018)
Author(s)

M.A. Sarder (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

W.P. Brinkman – Mentor

N. Tintarev – Graduation committee member

Franziska Burger – Graduation committee member

Joost Broekens – Graduation committee member

Herman Spliethoff – Graduation committee member

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2018 Mehedi Anam Sarder
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 Mehedi Anam Sarder
Graduation Date
29-08-2018
Awarding Institution
Delft University of Technology
Project
['Digital Media Technology']
Sponsors
EIT Digital
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Embodied Conversational Agents (ECA) seek to provide a more natural means of interaction for a user through verbal and non-verbal properties of human face-to-face communication. For this reason, these systems are found to bring benefits in different mental health related interventions. However, a key challenge in developing agents to replace the human interlocutor in a dyadic conversation, is to simulate appropriate attentive listening behaviors. In this thesis work, we explored different backchannel strategies and studied their effects in terms of likability and engagement. We built a fully embodied conversational agent with three different levels of backchannel strategies and ran a within-subject study with a convenience sample of 24 participants. The results showed that the amount of emotional words in the speech of users increased if the attentive listening capabilities of the agent were improved. In addition, the capability to trigger both verbal and nonverbal backchannels with proper timing was found to be a relevant feature in terms of improved speech rate and emotional words. Contrary to our hypothesis, backchannels based on actual emotion and sentiment analysis of the speech content were not found to be significantly influential on the quality of interaction. Multi-modal approaches are suggested for future works in order to overcome limitations of this work due to potential lack in emotion detection accuracy.

Files

Sarder_MSc_thesis.pdf
(pdf | 4.41 Mb)
License info not available