MD
M.S. Damyanov
1 records found
1
Adaptive Feature Selection For Sparse Linear Bandits
Experimental study on strategies for Online Feature Selection in High-Dimensional Bandit Settings
The Multi-armed Bandit (MAB) is a classic problem in reinforcement learning that exemplifies the exploration-exploitation dilemma - deciding when to gather more information and when to act on current knowledge. In its sparse variant, the feature vectors often contain many irrelev
...