Using Decision Trees produced by Generative Adversarial Imitation Learning to give insight into black box Reinforcement Learning models

Meijer, Caspar

Using Decision Trees produced by Generative Adversarial Imitation Learning to give insight into black box Reinforcement Learning models

Title

Using Decision Trees produced by Generative Adversarial Imitation Learning to give insight into black box Reinforcement Learning models

Author

Meijer, Caspar (TU Delft Electrical Engineering, Mathematics and Computer Science; TU Delft Algorithmics)

Contributor

Lukina, A. (mentor)
Murukannaiah, P.K. (graduation committee)

Degree granting institution

Delft University of Technology

Programme

Computer Science and Engineering

Project

CSE3000 Research Project

Date

2022-01-28

Abstract

Machine learning models are increasingly being used in fields that have a direct impact on the lives of humans. Often these machine learning models are black-box models and they lack transparency and trust which is holding back the implementation. To increase transparency and trust this research investigates whether imitation learning, specifically Generative Adversarial ImitationLearning (GAIL), can be used to give insights into the black-box models by extracting decision trees. To achieve this, an extension of GAIL was made allowing it to extract decision trees. The decision trees were then measured in terms of performance, fidelity, behavior, and interpretability in three different environments. We find that GAIL is able to extract decision trees with high fidelity and can give insightful information into the expert models. Moreover, further research can be done on more complex environments and black-box models, other surrogate models, and possibilities for more specific local insights.

Subject

Generative Adversarial Network
Generative Adversarial Imitation Learning
Reinforcement Learning (RL)
Imitation Learning
Interpretability
Explainable Reinforcement Learning
Python
Decision Trees
Markov Decision Process
Neural network
black box
Machine learning

To reference this document use:

http://resolver.tudelft.nl/uuid:dbc92b47-7fae-473a-ad53-e906ad7b2008

Part of collection

Student theses

Document type

bachelor thesis

Rights

Files

PDF

FINAL_FINAL_PAPER_CASPAR_MEIJER.pdf

985.67 KB

Close viewer