• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail

Best Arm Identification in Graphical Bilinear Bandits

View/Open
rizk21a.pdf (1.055Mb)
Type
Communication / Conférence
Date
2021
Conference date
2021
Conference country
UNITED STATES
Publisher
Proceedings of the 38th International Conference on Machine Learning
Pages
139:9010-9019
Metadata
Show full item record
Author(s)
Rizk, Geovani
Thomas , A.
Colin, Igor
Laraki, Rida cc
Chevaleyre, Yann
Abstract (EN)
We introduce a new graphical bilinear bandit problem where a learner (or a \emph{central entity}) allocates arms to the nodes of a graph and observes for each edge a noisy bilinear reward representing the interaction between the two end nodes. We study the best arm identification problem in which the learner wants to find the graph allocation maximizing the sum of the bilinear rewards. By efficiently exploiting the geometry of this bandit problem, we propose a \emph{decentralized} allocation strategy based on random sampling with theoretical guarantees. In particular, we characterize the influence of the graph structure (e.g. star, complete or circle) on the convergence rate and propose empirical experiments that confirm this dependency.
Subjects / Keywords
graphical bilinear bandit

Related items

Showing items related by title and author.

  • Thumbnail
    On Averaging the Best Samples in Evolutionary Computation 
    Meunier, Laurent; Chevaleyre, Yann; Rapin, J.; Royer, Clément; Teytaud, O. (2020) Communication / Conférence
  • Thumbnail
    On Averaging the Best Samples in Evolutionary Computation 
    Meunier, Laurent; Chevaleyre, Yann; Rapin, J.; Royer, Clément; Teytaud, O. (2020) Communication / Conférence
  • Thumbnail
    NGO-GM: Natural Gradient Optimization for Graphical Models 
    Benhamou, Éric; Atif, Jamal; Laraki, Rida; Saltiel, David (2020) Document de travail / Working paper
  • Thumbnail
    Identification de dynamique pour les systèmes bilinéaires et non-linéaires en présence d'incertitudes 
    Fu, Ying (2016-12-09) Thèse
  • Thumbnail
    On the Existence of Approximate Equilibria and Sharing Rule Solutions in Discontinuous Games 
    Bich, Philippe; Laraki, Rida (2017) Article accepté pour publication ou publié
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo