This paper presents a deception-detection module for a diabetes support system, addressing the challenge of unreliable patient self-reporting and ultimately attempting to improve diabetes care. The research is for a system called CHIP developed by the Hybrid Intelligence project
...
This paper presents a deception-detection module for a diabetes support system, addressing the challenge of unreliable patient self-reporting and ultimately attempting to improve diabetes care. The research is for a system called CHIP developed by the Hybrid Intelligence project group and TNO. Linguistic cues, such as motion verbs, negation terms, and exclusive terms were identified through a literature study and encoded using custom dictionaries. Cue detection was implemented using the SpaCy NLP library, which identifies and counts cue occurrences. A stylometric machine learning approach was favored over LLMs for explainability and scientific substantiation. In this research, an SVM model, selected for its alignment with prior research (the Mafiascum experiment), was trained on annotated Mafia game data, using normalized cue frequencies as features for the model. Although the SVM achieved high accuracy on truthful messages (F1 between 0.78–0.84), it performed poorly in detecting deception (F1 between 0.21–0.22), likely because of the high frequency of truthful input compared to deceptive input. The low accuracy, along with the model’s domain transferability and performance limitations, suggest further work is needed, particularly with context-specific data and possible integration with LLM-based approaches.