Zero-shot learning for (dis)agreement detection in meeting trancripts

Comparing latent topic models and large language models

Bachelor Thesis (2023)
Author(s)

D.F.P. de Weerd (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Morita Tarvirdians – Mentor (TU Delft - Interactive Intelligence)

CM Jonker – Mentor (TU Delft - Interactive Intelligence)

M. Molenaar – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2023 Daniël de Weerd
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 Daniël de Weerd
Graduation Date
03-07-2023
Awarding Institution
Delft University of Technology
Project
CSE3000 Research Project
Programme
Computer Science and Engineering
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper presents a novel approach to detect agreement and disagreement moments between participants in meeting transcripts without relying on labeled data. We propose a model in which disagreement detection is defined as the process of first identifying argumentative theses relevant to a given corpus of text and then classifying all phrases in the text as being either in favor of, against or expressing no opinion on a given thesis. To identify relevant theses, we compare the performance of a latent Dirichlet allocation-based topic model against that of a diverse set of large language models. To classify the stance of a phrase with respect to a thesis, only large language models are used. We find that, while state-of-the-art large language models do not outperform topic modeling-based approaches in extracting semantically relevant content, they are capable of presenting such content in a more concise and grammatically correct manner. We also find that state-of-the-art large language models are not capable of accurately performing stance classification as described above.

Files

License info not available