Medical Concept Normalization in User-Generated Text

Master Thesis (2019)
Author(s)

E. Manousogiannis (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

A Bozzon – Mentor (TU Delft - Industrial Design Engineering)

RJ Sips – Mentor

S. Mesbah – Mentor (TU Delft - Web Information Systems)

G. J. Houben – Graduation committee member (TU Delft - Web Information Systems)

Jan Rellermeyer – Graduation committee member (TU Delft - Data-Intensive Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2019 Manolis Manousogiannis
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 Manolis Manousogiannis
Graduation Date
13-09-2019
Awarding Institution
Delft University of Technology
Programme
['Computer Science | Data Science and Technology']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Online social networks have revolutionized the way people interact with each other nowadays. Users often share their experiences in various health - related topics like disease symptoms, drug treatments and other medical related issues in order to discuss with other patients dealing with similar conditions. During the production of a new drug, important drug properties like possible Adverse Drug Reactions (ADRs) are monitored through a phase of clinical trials. However, due to various factors that can not be easily measured in those trials, patients can potentially experience adverse events that were not related to their treatment before, or were related to it in a much smaller frequency. Therefore, the automatic detection of Adverse Events in online networks is gaining an increasing popularity among researchers in the biomedical community, as it can offer a valuable complementary source of information, next to the traditional approaches of reporting those events to the corresponding Food and Drug Association. From an NLP perspective, this task poses a significant challenge as there is a large gap between the informal language used in social media and the formal medical language used to officially describe a medical concept. Moreover, there is an absence of large annotated datasets, as the manual labeling of an adverse effect mentions is a time-consuming and often ambiguous procedure which also requires some sort of medical expertise. In this work we propose a novel machine learning approach to normalize Adverse Drug Effect mentions in user-generated text to a standard vocabulary from a medical Ontology. The evaluation results of the proposed model demonstrates a competitive performance among the current state of the art techniques, posing the potential feasibility of our model in the medical concept normalization domain.

Files

License info not available