TW
T.D. Westerborg
info
Please Note
<p>This page displays the records of the person named above and is not linked to a unique person identifier. This record may need to be merged to a profile.</p>
2 records found
1
Learning to Rank is the application of Machine Learning in order to create and optimize ranking functions. Most Learning to Rank methods follow a listwise approach and optimize a listwise loss function which closely resembles the same metric used in the evaluation. Popular listwise loss functions such as nDCG, AP and nRBP do not have consistent bounds across topics and do not account for instance-difficulty. As a result, the loss score does not solely reflect the performance of the model but also depends on the instance properties. During training, each instance is assumed to be equally informative, while in reality, this informativeness might depend on the difficulty of the instance. In this thesis, we propose four bounding methods which utilize some notion of instance-difficulty to produce difficulty-aware losses. Experimental results showed that, in most cases, optimizing a bounded variant of nDCG, AP or nRBP results in a consistent but marginal increase in the overall performance. More interestingly, we found that optimizing a bounded variant of nRBP and AP may increase the nDCG@k score, increasing the recommendation utility. Overall, our results show promising results for user/query-wise metric bounding in Learning to Rank, especially when applied to nRBP.
...
Learning to Rank is the application of Machine Learning in order to create and optimize ranking functions. Most Learning to Rank methods follow a listwise approach and optimize a listwise loss function which closely resembles the same metric used in the evaluation. Popular listwise loss functions such as nDCG, AP and nRBP do not have consistent bounds across topics and do not account for instance-difficulty. As a result, the loss score does not solely reflect the performance of the model but also depends on the instance properties. During training, each instance is assumed to be equally informative, while in reality, this informativeness might depend on the difficulty of the instance. In this thesis, we propose four bounding methods which utilize some notion of instance-difficulty to produce difficulty-aware losses. Experimental results showed that, in most cases, optimizing a bounded variant of nDCG, AP or nRBP results in a consistent but marginal increase in the overall performance. More interestingly, we found that optimizing a bounded variant of nRBP and AP may increase the nDCG@k score, increasing the recommendation utility. Overall, our results show promising results for user/query-wise metric bounding in Learning to Rank, especially when applied to nRBP.
BachelorEnd Project
Real-time anomaly detection in critical Rabobank Processes
Bachelor thesis
(2019)
-
Arthur Breurkes, Maikel Kerkhof, Ricardo Jongerius, Ties Westerborg, Sicco Verwer
Real-time processing of log data can give valuable insights in the behaviour of systems and processes. The Rabobank is a large bank and has several of these systems and processes, one of such is the QR device registration process. In order to monitor this process, an application was built which detects abnormal behaviour. The application reads in log data, parses it and then analyses it. Analysing is done by fitting distributions of the duration of each step in the process, and checking for anomalies in new incoming data. Finally, a dashboard was made in which the analysis and the distributions are visualised. More information is also available in the dashboard, like a Markov chain of the process and several key performance indicators.
...
Real-time processing of log data can give valuable insights in the behaviour of systems and processes. The Rabobank is a large bank and has several of these systems and processes, one of such is the QR device registration process. In order to monitor this process, an application was built which detects abnormal behaviour. The application reads in log data, parses it and then analyses it. Analysing is done by fitting distributions of the duration of each step in the process, and checking for anomalies in new incoming data. Finally, a dashboard was made in which the analysis and the distributions are visualised. More information is also available in the dashboard, like a Markov chain of the process and several key performance indicators.