Deep learning in standard least-squares theory of linear models

None, None; None, None; None, None

Deep learning in standard least-squares theory of linear models

Perspective, development and vision

Journal Article (2024)

Author(s)

A.R. Amiri-Simkooei (TU Delft - Aircraft Noise and Climate Effects)

Christian C.J.M. Tiberius (TU Delft - Mathematical Geodesy and Positioning)

R.C. Lindenbergh (TU Delft - Optical and Laser Remote Sensing)

Research Group

Aircraft Noise and Climate Effects

DOI related publication

https://doi.org/10.1016/j.engappai.2024.109376

Explainable artificial intelligence Design matrix Geostatistical learning Least-squares-based deep learning Quality control measures

To reference this document use:

https://resolver.tudelft.nl/uuid:804f14e7-5fe4-465c-924a-8bf27ac99b43

More Info

expand_more

Publication Year

2024

Language

English

Research Group

Aircraft Noise and Climate Effects

Volume number

138

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Inspired by the attractive features of least-squares theory in many practical applications, this contribution introduces least-squares-based deep learning (LSBDL). Least-squares theory connects explanatory variables to predicted variables, called observations, through a linear(ized) model in which the unknown parameters of this relation are estimated using the principle of least-squares. Conversely, deep learning (DL) methods establish nonlinear relationships for applications where predicted variables are unknown (nonlinear) functions of explanatory variables. This contribution presents the DL formulation based on least-squares theory in linear models. As a data-driven method, a network is trained to construct an appropriate design matrix of which its entries are estimated using two descent optimization methods: steepest descent and Gauss–Newton. In conjunction with interpretable and explainable artificial intelligence, LSBDL leverages the well-established least-squares theory for DL applications through the following three-fold objectives: (i) Quality control measures such as covariance matrix of predicted outcome can directly be determined. (ii) Available least-squares reliability theory and hypothesis testing can be established to identify mis-specification and outlying observations. (iii) Observations’ covariance matrix can be exploited to train a network with inconsistent, heterogeneous and statistically correlated data. Three examples are presented to demonstrate the theory. The first example uses LSBDL to train coordinate basis functions for a surface fitting problem. The second example applies LSBDL to time series forecasting. The third example showcases a real-world application of LSBDL to downscale groundwater storage anomaly data. LSBDL offers opportunities in many fields of geoscience, aviation, time series analysis, data assimilation and data fusion of multiple sensors.

Files

1-s2.0-S0952197624015343-main.... (pdf)

(pdf | 4.32 Mb)