Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

None, None; None, None; None, None; None, None

Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers

Conference Paper (2022)

Author(s)

Saikiran Bulusu (Syracuse University)

G. Joseph (TU Delft - Electrical Engineering, Mathematics and Computer Science)

M. Cenk Gursoy (Syracuse University)

Pramod K. Varshney (Syracuse University)

Research Group

Signal Processing Systems

To reference this document use

https://resolver.tudelft.nl/uuid:bec9f025-e74f-4854-879a-1062979a3e01

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Signal Processing Systems

ISBN (electronic)

9781713871088

Event

36th Conference on Neural Information Processing Systems (2022-11-28 - 2022-12-09), Hybrid Conference, New Orleans, United States

Downloads counter

232

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We consider a set of data samples such that a fraction of the samples are arbitrary outliers, and the rest are the output samples of a single-layer neural network with rectified linear unit (ReLU) activation. Our goal is to estimate the parameters (weight matrix and bias vector) of the neural network, assuming the bias vector to be non-negative. We estimate the network parameters using the gradient descent algorithm combined with either the median- or trimmed mean-based filters to mitigate the effect of the arbitrary outliers. We then prove that $\tilde{O}( \frac{1}{p^2}+\frac{1}{\epsilon^2p})$ samples and $\tilde{O} ( \frac{d^2}{p^2}+ \frac{d^2}{\epsilon^2p})$ time are sufficient for our algorithm to estimate the neural network parameters within an error of $\epsilon$ when the outlier probability is $1-p$, {where $2/3< p \leq 1$} and the problem dimension is $d$ (with log factors being ignored here). Our theoretical and simulation results provide insights into the training complexity of ReLU neural networks in terms of the probability of outliers and problem dimension.

Files

NeurIPS_2022_learning_distribu... (pdf)

(pdf | 0.513 Mb)

License info not available