DK
D.D. Kaak
info
Please Note
<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>
2 records found
1
During a digital fraud investigation the search for relevant information in mailboxes of custodians is like finding a needle in a haystack. This time consuming task can, on various levels, be improved and made more efficient. Technology Assisted Review (TAR) is already one of the available machine learning algorithms that helps speeding up the process of finding relevant information. In Technology Assisted Review a model is trained based on the classification of e-mails by expert review. During the review process TAR continuously gives back the (potentially) most relevant e-mails that still need to be given a classification. The downside of this algorithm is that a manual expert review is still needed before TAR can give recommendations. This thesis will focus on introductory research on models that give an initial sorting before the expert review is done. The hypothesis that will be used is that this sorting (or classification) can be done in a similar manner as spam e-mails are removed to the junk folder in a mailbox. Three different features have been used (word frequencies, word occurrences and length of an e-mail) on four different models for each feature (A generative and discriminative model, each with maximum likelihood estimation or Bayesian estimation). Each of these 12 different implementations have been tested on three different datasets (TREC, ENRON and a confidential dataset). Based on 5-fold cross validation the Bayesian generative model based on word frequencies has been shown to perform best on the confidential dataset. This model shows that a classification at the start of a digital fraud investigation can be helpful. Combining different models, and finding the best parameters for practical usage of the model is left for further research.
...
During a digital fraud investigation the search for relevant information in mailboxes of custodians is like finding a needle in a haystack. This time consuming task can, on various levels, be improved and made more efficient. Technology Assisted Review (TAR) is already one of the available machine learning algorithms that helps speeding up the process of finding relevant information. In Technology Assisted Review a model is trained based on the classification of e-mails by expert review. During the review process TAR continuously gives back the (potentially) most relevant e-mails that still need to be given a classification. The downside of this algorithm is that a manual expert review is still needed before TAR can give recommendations. This thesis will focus on introductory research on models that give an initial sorting before the expert review is done. The hypothesis that will be used is that this sorting (or classification) can be done in a similar manner as spam e-mails are removed to the junk folder in a mailbox. Three different features have been used (word frequencies, word occurrences and length of an e-mail) on four different models for each feature (A generative and discriminative model, each with maximum likelihood estimation or Bayesian estimation). Each of these 12 different implementations have been tested on three different datasets (TREC, ENRON and a confidential dataset). Based on 5-fold cross validation the Bayesian generative model based on word frequencies has been shown to perform best on the confidential dataset. This model shows that a classification at the start of a digital fraud investigation can be helpful. Combining different models, and finding the best parameters for practical usage of the model is left for further research.
Understanding Terrorist Activity
Is Agent-Based Modelling a viable solution?
In this report an Agent-Based Model created by Bulleit and Drewek used for analysing terrorist attacks will be implemented and compared with reality.
First the implementation of the model is explained based on different articles written by Bulleit and Drewek. After that the data of the simulations are analysed statistically. In this analysis the model is compared with the Global Terrorism Database for consistency with historical data and the model is compared for consistency in distributions of the data based on research reports. Finally, a few extensions will be implemented and again analysed statistically.
An Agent-Based Model for terrorist activity can give us insights in the behaviour of terrorists. It can especially be used for security policies or when we want to investigate what happens with an implementation of a new type of security. ...
First the implementation of the model is explained based on different articles written by Bulleit and Drewek. After that the data of the simulations are analysed statistically. In this analysis the model is compared with the Global Terrorism Database for consistency with historical data and the model is compared for consistency in distributions of the data based on research reports. Finally, a few extensions will be implemented and again analysed statistically.
An Agent-Based Model for terrorist activity can give us insights in the behaviour of terrorists. It can especially be used for security policies or when we want to investigate what happens with an implementation of a new type of security. ...
In this report an Agent-Based Model created by Bulleit and Drewek used for analysing terrorist attacks will be implemented and compared with reality.
First the implementation of the model is explained based on different articles written by Bulleit and Drewek. After that the data of the simulations are analysed statistically. In this analysis the model is compared with the Global Terrorism Database for consistency with historical data and the model is compared for consistency in distributions of the data based on research reports. Finally, a few extensions will be implemented and again analysed statistically.
An Agent-Based Model for terrorist activity can give us insights in the behaviour of terrorists. It can especially be used for security policies or when we want to investigate what happens with an implementation of a new type of security.
First the implementation of the model is explained based on different articles written by Bulleit and Drewek. After that the data of the simulations are analysed statistically. In this analysis the model is compared with the Global Terrorism Database for consistency with historical data and the model is compared for consistency in distributions of the data based on research reports. Finally, a few extensions will be implemented and again analysed statistically.
An Agent-Based Model for terrorist activity can give us insights in the behaviour of terrorists. It can especially be used for security policies or when we want to investigate what happens with an implementation of a new type of security.