Searched for: subject%3A%22A%252A%255C%252Balgorithm%22
(1 - 10 of 10)
document
Dai, Pengcheng (author), Yu, Wenwu (author), Wang, He (author), Baldi, S. (author)
Actor-critic (AC) cooperative multiagent reinforcement learning (MARL) over directed graphs is studied in this article. The goal of the agents in MARL is to maximize the globally averaged return in a distributed way, i.e., each agent can only exchange information with its neighboring agents. AC methods proposed in the literature require the...
journal article 2023
document
Cremers, Sho (author)
With the emergence of energy communities, where a number of prosumers (consumers with their own energy generation) invest in shared renewable generation capacity and battery storage, the issue of fair allocation of benefits and costs has become increasingly important. The Shapley value, a solution concept in cooperative game theory initially...
master thesis 2022
document
Fabiani, Filippo (author), Tajeddini, Mohammad Amin (author), Kebriaei, Hamed (author), Grammatico, S. (author)
We propose a two-layer, semi-decentralized algorithm to compute a local solution to the Stackelberg equilibrium problem in aggregative games with coupling constraints. Specifically, we focus on a single-leader, multiple follower problem, and after equivalently recasting the Stackelberg game as a mathematical program with complementarity...
journal article 2022
document
Ananduta, W. (author), Grammatico, S. (author)
We formulate the optimal flow problem in a multi-area integrated electrical and gas system as a mixed-integer optimization problem by approximating the non-linear gas flows with piece-wise affine functions, thus resulting in a set of mixed-integer linear constraints. For its solution, we propose a novel algorithm that consists in one stage for...
conference paper 2022
document
Dutta, Shamak (author), Wilde, N. (author), Smith, Stephen L. (author)
We present a new mixed integer formulation for the discrete informative path planning problem in random fields. The objective is to compute a budget constrained path while collecting measurements whose linear estimate results in minimum error over a finite set of prediction locations. The problem is known to be NP-hard. However, we strive to...
conference paper 2022
document
te Wierik, Ruben (author)
Real-time sequence clustering is the problem of clustering an infinite stream of sequences in real time with limited memory. A variant of the k-medoids algorithm called <i>SeqClu </i>is the suggested approach, representing a cluster with <i>p </i>most representative sequences of the cluster, called prototypes, to solve the problem of maintaining...
bachelor thesis 2021
document
Franci, B. (author), Grammatico, S. (author)
We consider the stochastic generalized Nash equilibrium problem (SGNEP) with expected-value cost functions. Inspired by Yi and Pavel (2019), we propose a distributed generalized Nash equilibrium seeking algorithm based on the preconditioned forward-backward operator splitting for SGNEPs, where, at each iteration, the expected value of the...
journal article 2021
document
Delimpaltadakis, Giannis (author), Mazo, M. (author)
In this article, we propose a region-based self-triggered control (STC) scheme for nonlinear systems. The state space is partitioned into a finite number of regions, each of which is associated to a uniform interevent time. The controller, at each sampling time instant, checks to which region does the current state belong, and correspondingly...
journal article 2021
document
Munk, J. (author), Kober, J. (author), Babuska, R. (author)
Deep Neural Networks (DNNs) can be used as function approximators in Reinforcement Learning (RL). One advantage of DNNs is that they can cope with large input dimensions. Instead of relying on feature engineering to lower the input dimension, DNNs can extract the features from raw observations. The drawback of this end-to-end learning is that it...
conference paper 2016
document
Byrka, Jaroslaw (author), Li, S. (author), Rybicki, Bartosz (author)
We study the k-level uncapacitated facility location problem (k-level UFL) in which clients need to be connected with paths crossing open facilities of k types (levels). In this paper we first propose an approximation algorithm that for any constant k, in polynomial time, delivers solutions of cost at most α<sub>k</sub> times OPT, where α<sub...
journal article 2014
Searched for: subject%3A%22A%252A%255C%252Balgorithm%22
(1 - 10 of 10)