End-to-End learning of decision trees and forests

None, None; None, None; None, None

End-to-End learning of decision trees and forests

Journal Article (2019)

Author(s)

Thomas M. Hehn (TU Delft - Intelligent Vehicles)

Julian F.P. Kooij (TU Delft - Intelligent Vehicles)

Fred A. Hamprecht (University of Heidelberg)

Research Group

Intelligent Vehicles

DOI related publication

https://doi.org/10.1007/s11263-019-01237-6

Interpretability Decision forests Efficient inference End-to-end learning

To reference this document use:

https://resolver.tudelft.nl/uuid:b8b380f9-c5ac-4112-9206-d39bb60f9483

More Info

expand_more

Publication Year

2019

Language

English

Research Group

Intelligent Vehicles

Volume number

128 (2020)

Pages (from-to)

997-1011

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Conventional decision trees have a number of favorable properties, including a small computational footprint, interpretability, and the ability to learn from little training data. However, they lack a key quality that has helped fuel the deep learning revolution: that of being end-to-end trainable. Kontschieder et al. (ICCV, 2015) have addressed this deficit, but at the cost of losing a main attractive trait of decision trees: the fact that each sample is routed along a small subset of tree nodes only. We here present an end-to-end learning scheme for deterministic decision trees and decision forests. Thanks to a new model and expectation–maximization training scheme, the trees are fully probabilistic at train time, but after an annealing process become deterministic at test time. In experiments we explore the effect of annealing visually and quantitatively, and find that our method performs on par or superior to standard learning algorithms for oblique decision trees and forests. We further demonstrate on image datasets that our approach can learn more complex split functions than common oblique ones, and facilitates interpretability through spatial regularization.

Files

Hehn2019_Article_End_to_EndLea... (pdf)

(pdf | 2.73 Mb)