Estimating intentions to speak using Lexical information

Leveraging Lexical Information to Facilitate Social Interactions with Artificial Agents

More Info
expand_more

Abstract

This research paper implements, evaluates, and compares two approaches, a machine learning (ML) approach and a rule-based approach, aimed to estimate intentions to speak. The ML approach trains lexical information extracted from time windows surrounding speech events. The rule-based approach looks for specific keywords or utterances to identify intentions to speak. The results show that the ML approach is a more favourable solution to the problem due to its adaptability and potential for improvement. Sample generation and parameter tweaking showed to be vital to the performance of the model, with its best performance being when it predicted unsuccessful intentions to continue speaking. This study concludes that a machine learning approach can be a viable solution for estimating intentions to continue speaking, with there being future use cases in conversational systems and human-computer interactions.