Discussion about this post

User's avatar
Hao Hoang's avatar

📚 Related Papers:

- Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback. Available at: https://arxiv.org/abs/2505.24193

- Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits. Available at: https://arxiv.org/abs/2509.15073

- Statistical Inference on Multi-armed Bandits with Delayed Feedback. https://arxiv.org/abs/2307.00752

- Delayed Feedback in Kernel Bandits. Available at: https://arxiv.org/abs/2302.00392

No posts

Ready for more?