Bo Xue (薛波)

alt text 

Ph.D. Student
City University of Hong Kong
Hong Kong, China

Supervisor: Prof. Qingfu Zhang
E-mail: boxue4-c@my.cityu.edu.hk

About Me

Hello! I am Bo Xue (薛波), a final-year PhD student in Department of Computer Science at the City University of Hong Kong. I am very fortunate to be advised by Prof. Qingfu Zhang.

I got my M.S. degree from Department of Computer Science and Technology in June 2021 from Nanjing University, where I was very fortunate to be advised by Prof. Lijun Zhang. I was also a member of LAMDA group, led by Prof. Zhi-Hua Zhou.

I got my B.S. degree from School of Mathematics in June 2018 from Nanjing University. In the same year, I was admitted to study for a M.S. degree in Nanjing University without entrance examination.

I am interested in bandits, stochastic optimization and multi-objective optimization.

Selected Papers (Full publications)

  1. Beyond the Lower Bound: Bridging Regret Minimization and Best Arm Identification in Lexicographic Bandits [PDF]
    Bo Xue, Yuanyu Wan, Zhichao Lu and Qingfu Zhang
    In Proceedings of the 40th AAAI Conference on Artificial Intelligence (AAAI 2026, Oral), pages: to appear, 2026.

  2. Lexicographic Lipschitz Bandits: New Algorithms and a Lower Bound [PDF]
    Bo Xue, Ji Cheng, Fei Liu, Yimu Wang, Lijun Zhang and Qingfu Zhang
    Journal of Machine Learning Research (JMLR), pages: to appear, 2025.

  3. Multi-objective Linear Reinforcement Learning with Lexicographic Rewards [PDF]
    Bo Xue, Dake Bu, Ji Cheng, Yuanyu Wan and Qingfu Zhang
    In Proceedings of the 42nd International Conference on Machine Learning (ICML 2025), pages: to appear, 2025.

  4. Problem-dependent Regret for Lexicographic Multi-Armed Bandits with Adversarial Corruptions [PDF]
    Bo Xue, Xi Lin, Yuanyu Wan and Qingfu Zhang
    In Proceedings of the 34th International Joint Conference on Artificial Intelligence (IJCAI 2025), pages 6776 - 6784, 2025.

  5. Multiple Trade-offs: An Improved Approach for Lexicographic Linear Bandits [PDF]
    Bo Xue, Xi Lin, Xiaoyuan Zhang and Qingfu Zhang
    In Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI 2025, Oral), pages 21850 - 21858, 2025.

  6. Multiobjective Lipschitz Bandits under Lexicographic Ordering [PDF]
    Bo Xue, Ji Cheng, Fei Liu, Yimu Wang, and Qingfu Zhang
    In Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), pages 16238 - 16246, 2024.

  7. Efficient Algorithms for Generalized Linear Bandits with Heavy-tailed Rewards [PDF]
    Bo Xue, Yimu Wang, Yuanyu Wan, Jinfeng Yi, and Lijun Zhang
    In Advances in Neural Information Processing Systems 36 (NeurIPS 2023), pages 70880 - 70891, 2023.

  8. Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs [PDF]
    Bo Xue, Guanghui Wang, Yimu Wang, and Lijun Zhang
    In Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI 2020), pages 2936 - 2942, 2020.

Academic Service

Reviewer for Conferences: NeurIPS, ICML, ICLR, AAAI, IJCAI, AISTATS, WWW.

Reviewer for Journals: EAAI, AI.