Circular Image

D. Kurowicka

19 records found

High-dimensional Pearson's chi-squared test

Hoge dimensionale Pearson chi-sqaured toets

This paper revisits Pearson's chi-square test and studies its properties, highlighting the behavior of the test when applied to large supports, i.e., the number of cells versus the sample size. First, we explore the general behavior through a controlled simulation, wherein we fin ...
Statistical inference of low-frequency time series is a challenge present in various fields, such as financial risk management and weather forecasting. Practical difficulties arise due to the scarcity of non-overlapping observations. The “direct method”, which directly uses the a ...
This thesis concerns modeling residential real estate selling prices in a hedonic price model framework on a small spatial-temporal granularity. The research addresses the challenge of sparse spatial-temporal real estate data, i.e. many combinations of location and time with few ...
Prisons serve as amplifiers of Tuberculosis (TB) transmission due to overpopulation, lack of hygiene, and bad ventilation [Mabud et al., 2019] [Baussano et al., 2010]. The risk of TB is elevated during and after incarceration, only returning to the general population’s risk seven ...
In this thesis, we explore the structure of consistent bootstrap statistics in hypothesis testing. Bootstrap, as a very useful technique when theoretical distributions are not available or when the sample size is small, enjoys a lot of interest from applied statisticians. Histori ...

Dutch electricity spot price forecasting

Two study cases using structured expert judgement

Electricity is a quite unique commodity. Due to the economically non-storable nature of the commodity that electricity is, the constant balance between consumption and production, weather effects, such as temperature, wind speed, solar intensity etc, and the intensity of everyday ...
In this thesis, we present a study to obtain a clear and accurate overview of the progress and behaviour of COVID-19 in the Netherlands. We distinguish two parts for this study. The first part is to estimate the total number of infected people as a function of time by combining ...
Measuring variable importance is often a difficult task: among others models can be complex and covariates can interact with each other and can be correlated. This study focuses on two questions: First, what should be the theoretical measure of variable importance under a given d ...

Aligning AI with Human Norms

Multi-Objective Deep Reinforcement Learning with Active Preference Elicitation

The field of deep reinforcement learning has seen major successes recently, achieving superhuman performance in discrete games such as Go and the Atari domain, as well as astounding results in continuous robot locomotion tasks. However, the correct specification of human intentio ...

Sensitivity analysis for hydrodynamic model of the North Sea

Considering the correlations and dependencies between parameters

In this thesis, sensitivity analysis is used to study the influences of parameters on specific outputs in the hydrodynamic model 3D DCSM-FM of the North Sea. The sensitivity analysis is the study of how uncertainty in the outputs of a model can be divided and allocated to differe ...
Safe hypothesis tests are tests that are robust under accumulation bias, namely when there are dependencies between the results of previous studies and the decision whether to conduct further studies. We construct two types of safe test for the 2 × 2 contingency table, the condit ...
In this thesis we shall consider sample covariance matrices Sn in the case when the dimension of the data increases with the sample size to infinity ,while the ratio approaches a fixed constant. We will derive a new statistic based on the general linear shrinkage estimator by Bod ...
In this modern age, data is being generated constantly and data is being saved for analysis everywhere. In the maritime industry, interest in the analysis of ship data has grown over the years. In this thesis, we will take a look at AIS data coupled with sea state data. AIS data ...
In de statistiek zijn er verschillende methodes voor het uitvoeren van model selectie. Het verschil in deze methodes komt voort uit het verschil in stromingen. Voor niet-geneste model selectie zijn de meest ganbare stromingen de Bayes Factor en de likelihood ratio. D. M. Ommen en ...
In dit onderzoek zijn twee manieren van A/B-testen met elkaar vergeleken. A/B-testen is het vergelijken van verschillende website versies om te achterhalen welke versie voor een hogere opbrengt zorgt. Consumenten krijgen afzonderlijk meerdere versies van een website te zien: vers ...

Efficient Inference with Panel Data

On the pass-through of the Dutch 2001 and 2012 VAT increases to consumer prices

This thesis evaluates the pass-through of the 2001 and 2012 Dutch Value Added Tax (VAT) increases to customer prices using a difference-in-differences model. To this end, the first difference and feasible generalised least squares estimators are introduced. Contrary to the conven ...
In this report we present an interactive multi-objective optimization tool that was developed as part of this graduation project. This appliance is meant to be used as a decision support tool for transportation planners working on a synchromodal transportation network on the cont ...
ASML produces TwinScan NXT machines that are used for the production of microchips. The machines ensure that an accurate pattern of DUV-light passes a lens and that it is projected as accurate as possible on the wafer. To ensure that the focal point of the converged DUV-light fal ...
This bachelor thesis is about binary error-correcting codes. A binary code is a collection words with the same length n that consists only of zeroes and ones. The error-correcting quality of a code is determined by the Hamming distance d of a code. A classical question in coding ...