This page contains information about my actual and old research projects.
I started my PhD in September 2020. I am currently working on Best Arm Identification, one important topic of bandit problems.
The context of bandit problems is the following: consider $K$ distincts probability distributions $\nu_1, \dots, \nu_K$. Those distributions are unknown but at each step you are able to select an arm $1 \leq k \leq K$ and obtain the value of an independent realization of $\nu_k$. You can define the strategy you want (that is to say choose the next arm to observe by using all the previous observations).
There are several mathematical objectives. For instance, in Best Arm Identification, the goal is to identify the best arm, which is the arm with highest associated expectation. There are two settings:
I am working on both settings. For more information about bandit problems the book of Tor Lattimore and Csaba Szepesvári is a good introduction.