Estimators for the population mean and variance for stratified sampling

None, None

Estimators for the population mean and variance for stratified sampling

The search for unbiased estimators in a suboptimal sample

Bachelor Thesis (2024)

Author(s)

L.J. Verbeeke (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Alexis Derumigny – Mentor (TU Delft - Statistics)

G.F. Nane – Graduation committee member (TU Delft - Applied Probability)

Faculty

Electrical Engineering, Mathematics and Computer Science

Estimator Stratified sampling Population mean Population variance Unbiased

To reference this document use:

https://resolver.tudelft.nl/uuid:2094be5d-63cf-48bf-abb8-e3cda69436cd

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

03-07-2024

Awarding Institution

Delft University of Technology

Project

['CSE3000 Research Project']

Programme

['Computer Science and Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Dividing a population into subgroups and conducting research on this population including the subgroups comes with a challenge. This stratified sampling relies on information about the share of the subgroups in the population. Sometimes the proportions in the sample are not taken equal to the true proportions of the population. This can be corrected through the use of particular estimators taking these proportions into account.
In this thesis, different estimators for the true population mean and variance are defined and examined in terms of bias and variance in the case of two subgroups. Weighing the measurements according to the true proportions creates unbiased estimators for both the mean and the variance. These unbiased estimators are compared with other, biased, estimators, including naive ones in which the influence of different subgroups is not taken into account. The naive estimators are not only biased, they also have a variance of the same order as the unbiased ones. When the true proportions are not available, one can only take a guess. A guess lying close to the true proportions leads to a smaller bias and therefore a better estimator. This underlines the importance of obtaining sufficient knowledge about the population.

Files

BSc_Thesis_Lucas_Verbeeke.pdf

(pdf | 0.469 Mb)

License info not available