Print Email Facebook Twitter Coderivative Document Recognition Title Coderivative Document Recognition Author De Reede, S.J. Contributor Rothkrantz, L. (mentor) Wiggers, P. (mentor) Sodoyer, B.R. (mentor) Stegeman, O. (mentor) Faculty Electrical Engineering, Mathematics and Computer Science Date 2008-06-18 Abstract Knowledge management in large enterprises currently depends very much on the active participation of the employees. If an employee is unable or unwilling to share his knowledge,the knowledge management fails. To counter this situation, the documents of all employees can be collected automatically and stored in a knowledge management application. However, since every document is subject to change during its life cycle, many versions of a document will be created, and thus collected and stored. A query put to a knowledge management application would produce a result list that is polluted by these many versions of all documents, which reduces the accessibility and usability of the knowledge management application. We provide a solution to this problem by introducing the Cayman system. This software system consists of the implementation of a new algorithm that is able to recognize different versions of documents, so-called coderivative documents. Additionally, it consists of a prototype application that collects documents, and provides a way to fully integrate our algorithm within an existing well-known knowledge management application. We compare our algorithm with six other state-of-the-art algorithms using a real life dataset. The algorithms are evaluated with several useful graphical methods, and, most importantly, with one quantitative method. This enables us to make a solid comparison of their performance. Our experiment shows that the newly introduced algorithm surpasses every other algorithm,except for one. Surprisingly, this is the most simple baseline algorithm, which all more recent algorithms are expected to outperform. Concluding, we can say that the coderivative document recognition system shows a satisfactory performance for being of practical use. The Cayman system is put to a practical test in a professional environment, and succeeds in collecting documents, recognizing coderivatives, and making them accessible and reusable for employees. Subject knowledgeversiondocumentrecognitioninformation retrieval To reference this document use: http://resolver.tudelft.nl/uuid:b03299b3-745c-43a2-ba88-2860b40c5ef8 Publisher TU Delft, Electrical Engineering, Mathematics and Computer Science, Media and Knowledge Engineering Part of collection Student theses Document type master thesis Rights (c) 2008 De Reede, S.J. Files PDF ewi_reede_2008.pdf 15.75 MB Close viewer /islandora/object/uuid:b03299b3-745c-43a2-ba88-2860b40c5ef8/datastream/OBJ/view