Clustered K nearest neighbor algorithm for daily inflow forecasting

Journal article (2010)

Authors

M. Akbari

P.J.A.T.M. Van Overloop

A. Afshar

Department

Watermanagement () (TU Delft)

DOI: https://doi.org/doi:10.1007/s11269-010-9748-z

Inconsistent data Inflow forecasting K nearest neighbor Subtractive clustering Noisy data

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:5406516b-92f0-4f27-819d-1ccd94000908

Published Date

07-12-2010

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Source:

Water Resources Management, 25 (5), 2011

ISSN:

0920-4741

Source:

http://www.springerlink.com/content/f278883180480831/

Faculty

Civil Engineering and Geosciences

Department

Watermanagement

Abstract

Instance based learning (IBL) algorithms are a common choice among data driven algorithms for inflow forecasting. They are based on the similarity principle and prediction is made by the finite number of similar neighbors. In this sense, the similarity of a query instance is estimated according to the closeness of its feature vector with those of data available in calibration data. As the selected attributes in the feature vector are determined overall on calibration data, there may be some data points whose outputs do not follow the considered attributes. In fact, output values of these inconsistent data points may be a function of some other attributes which were not considered. Therefore, for some query instances, the inconsistent points may be appeared as the neighbors while they may not really be neighbor to the query instance. They can deteriorate forecasting results especially if they are very close to the query instance with the current similarity definition. In this study a clustered K nearest neighbor (CKNN) algorithm is introduced which can capture these inconsistent data points. Similar to the inconsistent data points, CKNN can be also robust against noisy data. The proposed algorithm was shown to be effective for a synthetic linear data set corrupted by noise. In addition, the utility of the algorithm was demonstrated for daily inflow forecasting of the Karoon1 reservoir located in Iran.

Files

Akbari.pdf

(pdf | 0.423 Mb)