Skip to main content
Aurélien Garivier
professional web page
Search form
Search
Main menu
Home
Publications
Presentations
Teaching material
Miscellany
You are here
Home
Reinforcement Learning and Self-Normalized Deviation Bounds
Context:
Séminaire du Numec, USP, Sao Paulo
Slides:
saopaulo1010RL.pdf
Date:
October, 2010
Keywords:
Bandit Problems
Self-Normalized
UCB