Zero-shot learning for (dis)agreement detection in meeting trancripts

None, None

Zero-shot learning for (dis)agreement detection in meeting trancripts

Comparing latent topic models and large language models

Bachelor Thesis (2023)

Author(s)

D.F.P. de Weerd (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Morita Tarvirdians – Mentor (TU Delft - Interactive Intelligence)

CM Jonker – Mentor (TU Delft - Interactive Intelligence)

M. Molenaar – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Meeting transcript Summarization Large language models

To reference this document use:

https://resolver.tudelft.nl/uuid:01189347-da5f-45ac-b4d9-7e3fc18d7802

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

03-07-2023

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This paper presents a novel approach to detect agreement and disagreement moments between participants in meeting transcripts without relying on labeled data. We propose a model in which disagreement detection is defined as the process of first identifying argumentative theses relevant to a given corpus of text and then classifying all phrases in the text as being either in favor of, against or expressing no opinion on a given thesis. To identify relevant theses, we compare the performance of a latent Dirichlet allocation-based topic model against that of a diverse set of large language models. To classify the stance of a phrase with respect to a thesis, only large language models are used. We find that, while state-of-the-art large language models do not outperform topic modeling-based approaches in extracting semantically relevant content, they are capable of presenting such content in a more concise and grammatically correct manner. We also find that state-of-the-art large language models are not capable of accurately performing stance classification as described above.

Files

D.F.P._de_Weerd_Zero_shot_lear... (pdf)

(pdf | 0.25 Mb)

License info not available