Estimators for the population mean and variance for stratified sampling

The search for unbiased estimators in a suboptimal sample

Bachelor Thesis (2024)
Author(s)

L.J. Verbeeke (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Alexis Derumigny – Mentor (TU Delft - Statistics)

G. F. Nane – Graduation committee member (TU Delft - Applied Probability)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2024
Language
English
Graduation Date
03-07-2024
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Dividing a population into subgroups and conducting research on this population including the subgroups comes with a challenge. This stratified sampling relies on information about the share of the subgroups in the population. Sometimes the proportions in the sample are not taken equal to the true proportions of the population. This can be corrected through the use of particular estimators taking these proportions into account.
In this thesis, different estimators for the true population mean and variance are defined and examined in terms of bias and variance in the case of two subgroups. Weighing the measurements according to the true proportions creates unbiased estimators for both the mean and the variance. These unbiased estimators are compared with other, biased, estimators, including naive ones in which the influence of different subgroups is not taken into account. The naive estimators are not only biased, they also have a variance of the same order as the unbiased ones. When the true proportions are not available, one can only take a guess. A guess lying close to the true proportions leads to a smaller bias and therefore a better estimator. This underlines the importance of obtaining sufficient knowledge about the population.

Files

License info not available