Yingqian Zhang | TU Delft Repository

Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization

Journal article (2025) - Jesse van Remmerden (author) , Maurice Kenter (author) , Diederik M. Roijers (author) , C. Andriotis (author) , Yingqian Zhang (author) , Zaharah Bukhsh (author)

In this paper, we introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC), a multi-objective reinforcement learning method for infrastructural maintenance optimization, an area traditionally dominated by single-objective reinforcement learning (RL) approach ...

The first AI4TSP competition

Learning to solve stochastic routing problems

Journal article (2023) - Yingqian Zhang (author) , Laurens Bliek (author) , Paulo da Costa (author) , Reza Refaei Afshar (author) , Robbert Reijnen (author) , T. Catshoek (author) , D.A. Vos (author) , SE Verwer (author) , Fynn Schmitt-Ulms (author) , More authors (author)

This paper reports on the first international competition on AI for the traveling salesman problem (TSP) at the International Joint Conference on Artificial Intelligence 2021 (IJCAI-21). The TSP is one of the classical combinatorial optimization problems, with many variants inspi ...

Data driven design for online industrial auctions

Journal article (2021) - Qing Chuan Ye (author) , Jason. S. Rhuggenaath (author) , Yingqian Zhang (author) , S.E. Verwer (author) , Michiel Jurgen Hilgeman (author)

Designing auction parameters for online industrial auctions is a complex problem due to highly heterogeneous items. Currently, online auctioneers rely heavily on their experts in auction design. The ability of predicting how well an auction will perform prior to the start comes i ...

Solving bin-packing problems under privacy preservation

Possibilities and trade-offs

Journal article (2019) - Rowan Hoogervorst (author) , Yingqian Zhang (author) , Gamze Tillem (author) , Z. Erkin (author) , Sicco Verwer (author)

We investigate the trade-off between privacy and solution quality that occurs when a k-anonymized database is used as input to the bin-packing optimization problem. To investigate the impact of the chosen anonymization method on this trade-off, we consider two recoding methods fo ...

Learning optimal classification trees using a binary linear program formulation

Conference paper (2019) - SE Verwer (author) , Yingqian Zhang (author)

Learning fuzzy decision trees using integer programming

Conference paper (2018) - Jason. S. Rhuggenaath (author) , Yingqian Zhang (author) , Alp Akcay (author) , U Kaymak (author) , S.E. Verwer (author)

A popular method in machine learning for super-vised classification is a decision tree. In this work we propose a new framework to learn fuzzy decision trees using mathematical programming. More specifically, we encode the problem of constructing fuzzy decision trees using a Mixe ...

Learning Decision Trees with Flexible Constraints and Objectives Using Integer Optimization

Conference paper (2017) - SE Verwer (author) , Yingqian Zhang (author)

We encode the problem of learning the optimal decision tree of a given depth as an integer optimization problem. We show experimentally that our method (DTIP) can be used to learn good trees up to depth 5 from data sets of size up to 1000. In addition to being efficient, our new ...

Auction optimization using regression trees and linear models as integer programs

Journal article (2017) - Yingqian Zhang (author) , S.E. Verwer (author) , Qing Chuan Ye (author)

In a sequential auction with multiple bidding agents, the problem of determining the ordering of the items to sell in order to maximize the expected revenue is highly challenging. The challenge is largely due to the fact that the autonomy and private information of the agents hea ...