Search results | TU Delft Repositories

Searched for: author%3A%22Smit%2C+Jordi%22

(1 - 2 of 2)

document: PEBL: Pessimistic Ensembles for Offline Deep Reinforcement Learning
Smit, Jordi (author), Ponnambalam, C.T. (author), Spaan, M.T.J. (author), Oliehoek, F.A. (author)
Offline reinforcement learning (RL), or learning from a fixed data set, is an attractive alternative to online RL. Offline RL promises to address the cost and safety implications of tak- ing numerous random or bad actions online, a crucial aspect of traditional RL that makes it difficult to apply in real-world problems. However, when RL is na...
conference paper 2021

document: OffSide: Learning to Identify Mistakes in Boundary Conditions
Arnar Briem, Jón (author), Smit, Jordi (author), Sellik, Hendrig (author), Rapoport, Pavel (author), Gousios, G. (author), Aniche, Maurício (author)
Mistakes in boundary conditions are the cause of many bugs in software. These mistakes happen when, e.g., developers make use of '<' or '>' in cases where they should have used '<=' or '>='. Mistakes in boundary conditions are often hard to find and manually detecting them might be very time-consuming for developers. While...
conference paper 2020