Yikun Ban

Associate Professor
School of Computer Science and Engineering
Beihang University

I am a tenure-track associate professor in the School of Computer Science and Engineering at Beihang University and a member of the State Key Laboratory of Software Development Environment. Previously, I was a postdoc and obtained my Ph.D. degree at Computer Science, University of Illinois at Urbana-Champaign, where I was advised by Prof. Jingrui He, Prof. Hanghang Tong, and Prof. Arindam Banerjee. Prior to this, I obtained my Master's degree from EECS, Peking University and bachelor's degree from Wuhan University.
I am interested in principled algorithms in the space of reinforcement learning and deep learning, to solve real-world sequential decision-making problems. Current research topics:
  • Reinforcement Learning with Human Feedback
  • Multi-Agent Reinforcement Learning
  • Ensemble Learning of LLMs

yikunb[at]buaa.edu.cn, yikunb2[at]illinois.edu
Google Scholar

News
  • 2025.5   Welcome to check our survey! A Survey on LLM Ensemble.
  • 2025.5   NeurIPS 2025 Spotlight! Transformer Copilot: Introduces the new concept of a “Mistake Log” and a novel paradigm for LLM fine-tuning.
  • 2025.5   NeurIPS 2025! SamS: Proposes the new problem of dynamic sample scheduling in preference optimization for LLMs, together with an RL-based solution.
  • 2025.5   "LLM-Forest" is accepted by ACL 2025 Findings, a new prompt-based LLM Ensemble Learning approach.
  • 2024.12   "PageRank Bandit" is accepted by NeurIPS 2024, in which we first use bandit perspective to solve link prediction.
  • 2024.12   "Robust Neural Contextual Bandit" is accepted by NeurIPS 2024, in which we remove the Positive Definite assumption for NTK Matrix.

Preprint(*Equal Contribution, #Corresponding)

  1. Zhijun Chen, Jingzheng Li, Pengpeng Chen, Zhuoran Li, Kai Sun, Yuankai Luo, Qianren Mao, Ming Li, Likang Xiao, Dingqi Yang, Yikun Ban, Hailong Sun, Philip S. Yu
    Harnessing Multiple Large Language Models: A Survey on LLM Ensemble

Publications(*Equal Contribution, #Corresponding)

  1. Jiaru Zou, Yikun Ban#, Zihao Li, Yunzhe Qi, Ruizhong Qiu, Ling Yang, Jingrui He#
    Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

    Conference on Neural Information Processing Systems ( NeurIPS'25, Spotlight)
  2. Zixuan Huang, Yikun Ban#, Lean Fu, Xiaojie Li, Zhongxiang Dai, Jianxin Li, Deqing Wang#
    Adaptive Sample Scheduling for Direct Preference Optimization

    Conference on Neural Information Processing Systems ( NeurIPS'25)
  3. Xiaodong Lu, Mingzhe Liu, Tongyu Zhu, Leilei Sun, Jibin Wang, Weifeng Lv, Yikun Ban, Deqing Wang
    Adaptive Sampling-based Dynamic Graph Learning for Information Diffusion Prediction
    ACM Transactions on Information Systems (2025) ( TOIS )
  4. Xinrui He, Yikun Ban#, Jiaru Zou, Tianxin Wei, Curtiss Cook, Jingrui He#
    LLM-Forest: Ensemble Learning of LLMs with Graph-Augmented Prompts for Data Imputation
    The 63rd Annual Meeting of the Association for Computational Linguistics, Findings ( ACL'25 )
  5. Zihao Li, Lecheng Zheng, Bowen Jin, Dongqi Fu, Baoyu Jing, Yikun Ban, Jingrui He, Jiawei Han
    Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?
    The 63rd Annual Meeting of the Association for Computational Linguistics, Main( ACL'25 )
  6. Yunzhe Qi, Yikun Ban, Arindam Banerjee, Jingrui He
    Robust Neural Contextual Bandit against Adversarial Corruptions
    Thirty-eighth Conference on Neural Information Processing Systems ( NeurIPS'24 )
  7. Yikun Ban*, Jiaru Zou*, Zihao Li, Yunzhe Qi, Dongqi Fu, Jian Kang, Hanghang Tong, Jingrui He
    PageRank Bandits for Link Prediction
    Thirty-eighth Conference on Neural Information Processing Systems ( NeurIPS'24 )
  8. Yikun Ban*, Yunzhe Qi*, Tianxin Wei, Lihui Liu, Jingrui He
    Meta Clustering of Neural Bandits
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining ( KDD'24 )
  9. Yikun Ban, Yuchen Yan, Arindam Banerjee, Jingrui He
    Neural Exploitation and Exploration of Contextual Bandits
    Accepted by JMLR
    To Appear
  10. Yikun Ban, Yunzhe Qi, Jingrui He
    Neural Contextual Bandits for Personalized Recommendation
    The Web Conference, Tutorial ( WWW'24)
    [ ]   []
  11. Yikun Ban, Ishika Agarwal, Ziwei Wu, Yada Zhu, Kommy Weldemariam, Hanghang Tong, and Jingrui He
    Neural Active Learning Beyond Bandits
    International Conference on Learning Representations ( ICLR'24 )
  12. Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee
    Contextual Bandits with Online Neural Regression
    International Conference on Learning Representations ( ICLR'24 )
  13. Yunzhe Qi*, Yikun Ban*, Tianxin Wei, Jiaru Zou, Huaxiu Yao, and Jingrui He
    Meta-Learning with Neural Bandit Scheduler
    Thirty-seventh Conference on Neural Information Processing Systems ( NeurIPS'23 )
  14. Yunzhe Qi*, Yikun Ban*, and Jingrui He
    Graph Neural Bandits
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining ( KDD'23 )
  15. Yikun Ban*, Yuheng Zhang*, Hanghang Tong, Arindam Banerjee, and Jingrui He
    Improved Algorithms for Neural Active Learning
    Thirty-sixth Conference on Neural Information Processing Systems ( NeurIPS'22 )
  16. Dongqi Fu, Yikun Ban, Hanghang Tong, Ross Maciejewski, and Jingrui He
    DISCO: Comprehensive and Explainable Disinformation Detection
    ACM International Conference on Information and Knowledge Management (Demo Track) ( CIKM'22 )
  17. Yunzhe Qi, Yikun Ban, and Jingrui He
    Neural Bandit with Arm Group Graph
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining ( KDD'22 )
  18. Yikun Ban, Yuchen Yan, Arindam Banerjee, and Jingrui He
    EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
    International Conference on Learning Representations ( ICLR'22, Spotlight)
  19. Yikun Ban and Jingrui He
    Convolutional Neural Bandit for Visual-aware Recommendation
    Preprint: ArXiv:2107.07438
  20. Yikun Ban, Jingrui He, and Curtiss B. Cook
    Multi-Facet Contextual Bandits: A Neural Network Perspective
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining ( KDD'21 )
  21. Yikun Ban and Jingrui He
    Local Clustering in Contextual Multi-Armed Bandits
    The Web Conference ( WWW'21 )
  22. Yuchen Yan, Lihui Liu, Yikun Ban, Baoyu Jing, and Hanghang Tong
    Dynamic Knowledge Graph Alignment
    AAAI Conference on Artificial Intelligence ( AAAI'21 )
  23. Yikun Ban and Jingrui He
    Generic Outlier Detection in Multi-Armed Bandit
    ACM SIGKDD Conference on Knowledge Discovery and Data Mining ( KDD'20 )
  24. Yikun Ban, Xin liu, Ling Huang, Yitao Duan, Xue Liu, and Wei Xu
    No Place to Hide: Catching Fraudulent Entities in Tensors
    The Web Conference ( WWW ’19 )

Education

    2019 - 2024
    Ph.D., Computer Science, University of Illinois at Urbana-Champaign, Illinois, US
    2016 - 2019
    M.S., Computer Science, Peking University, Beijing, China
    2012 - 2016
    B.S., School of Software Engineering, Wuhan University, Wuhan, China