Improving Adaptive Learning Models Using Prosodic Speech Features

None, None; None, None; None, None; None, None

Improving Adaptive Learning Models Using Prosodic Speech Features

Conference Paper (2023)

Author(s)

Thomas Wilschut (University Medical Center Groningen)

Florian Sense (LLC)

Odette Scharenborg (TU Delft - Multimedia Computing)

Hedderik van Rijn (University Medical Center Groningen)

Research Group

Multimedia Computing

DOI related publication

https://doi.org/10.1007/978-3-031-36272-9_21

Machine learning Intensity Automatic Speech Recognition Pitch Adaptive Learning Cognitive Modeling Speaking Speed Speech prosody

To reference this document use:

https://resolver.tudelft.nl/uuid:977dd986-58c9-4df9-9336-7bcb31a1b58a

More Info

expand_more

Publication Year

2023

Language

English

Research Group

Multimedia Computing

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Pages (from-to)

255-266

Publisher

Springer

ISBN (print)

9783031362712

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Cognitive models of memory retrieval aim to describe human learning and forgetting over time. Such models have been successfully applied in digital systems that aid in memorizing information by adapting to the needs of individual learners. The memory models used in these systems typically measure the accuracy and latency of typed retrieval attempts. However, recent advances in speech technology have led to the development of learning systems that allow for spoken inputs. Here, we explore the possibility of improving a cognitive model of memory retrieval by using information present in speech signals during spoken retrieval attempts. We asked 44 participants to study vocabulary items by spoken rehearsal, and automatically extracted high-level prosodic speech features—patterns of stress and intonation—such as pitch dynamics, speaking speed and intensity from over 7,000 utterances. We demonstrate that some prosodic speech features are associated with accuracy and response latency for retrieval attempts, and that speech feature informed memory models make better predictions of future performance relative to models that only use accuracy and response latency. Our results have theoretical relevance, as they show how memory strength is reflected in a specific speech signature. They also have important practical implications as they contribute to the development of memory models for spoken retrieval that have numerous real-world applications.

Files

978_3_031_36272_9_21.pdf

(pdf | 0.563 Mb)

- Embargo expired in 01-01-2024

License info not available