Shi Yuan Tang

Conference paper (1)

Journal article (1)

2 records found

Teacher-apprentices RL (TARL)

Leveraging complex policy distribution through generative adversarial hypernetwork in reinforcement learning

Journal article (2023) - Shi Yuan Tang (author) , Athirai A. Irissappane (author) , F.A. Oliehoek (author) , Jie Zhang (author)

Typically, a Reinforcement Learning (RL) algorithm focuses in learning a single deployable policy as the end product. Depending on the initialization methods and seed randomization, learning a single policy could possibly leads to convergence to different local optima across diff ...

Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork

Conference paper (2021) - Shi Yuan Tang (author) , F.A. Oliehoek (author) , Athirai A. Irissappane (author) , Jie Zhang (author)

Cross-Entropy Method (CEM) is a gradient-free direct policy search method, which has greater stability and is insensitive to hyperparameter tuning. CEM bears similarity to population-based evolutionary methods, but, rather than using a population it uses a distribution over candi ...