Yikun Ban 班义琨

I am a tenure-track associate professor in the School of Computer Science and Engineering at Beihang University and a member of the State Key Laboratory of Software Development Environment. Previously, I was a postdoc and obtained my Ph.D. degree at Computer Science, University of Illinois at Urbana-Champaign. Prior to this, I obtained my Master's degree from EECS, Peking University and bachelor's degree from Wuhan University.

I am interested in principled algorithms in the space of reinforcement learning and deep learning, to solve real-world sequential decision-making problems. Current research topics:

Reinforcement Learning with Human Feedback
Multi-Agent Reinforcement Learning
Ensemble Learning of LLMs

News

[2025.5] Welcome to check our survey! A Survey on LLM Ensemble.
[2025.5] NeurIPS 2025 Spotlight! Transformer Copilot: Introduces the new concept of a "Mistake Log" and a novel paradigm for LLM fine-tuning.
[2025.5] NeurIPS 2025! SamS: Proposes the new problem of dynamic sample scheduling in preference optimization for LLMs, together with an RL-based solution.
[2025.5] "LLM-Forest" is accepted by ACL 2025 Findings, a new prompt-based LLM Ensemble Learning approach.
[2024.12] "PageRank Bandit" is accepted by NeurIPS 2024, in which we first use bandit perspective to solve link prediction.
[2024.12] "Robust Neural Contextual Bandit" is accepted by NeurIPS 2024, in which we remove the Positive Definite assumption for NTK Matrix.

Preprint (* Equal Contribution, # Corresponding)

Harnessing Multiple Large Language Models: A Survey on LLM Ensemble

Zhijun Chen, Jingzheng Li, Pengpeng Chen, Zhuoran Li, Kai Sun, Yuankai Luo, Qianren Mao, Ming Li, Likang Xiao, Dingqi Yang, Yikun Ban, Hailong Sun, Philip S. Yu

[paper] [github]

Selected Publications (* Equal Contribution, # Corresponding)

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Jiaru Zou, Yikun Ban#, Zihao Li, Yunzhe Qi, Ruizhong Qiu, Ling Yang, Jingrui He#

Thirty-ninth Conference on Neural Information Processing Systems (NeurIPS'25, Spotlight)

[paper] [github] [小红书]

Adaptive Sample Scheduling for Direct Preference Optimization

Zixuan Huang, Yikun Ban#, Lean Fu, Xiaojie Li, Zhongxiang Dai, Jianxin Li, Deqing Wang#

Thirty-ninth Conference on Neural Information Processing Systems (NeurIPS'25)

[github] [paper]

LLM-Forest: Ensemble Learning of LLMs with Graph-Augmented Prompts for Data Imputation

Xinrui He, Yikun Ban#, Jiaru Zou, Tianxin Wei, Curtiss Cook, Jingrui He#

The 63rd Annual Meeting of the Association for Computational Linguistics, Findings (ACL'25)

[paper]

Adaptive Sampling-based Dynamic Graph Learning for Information Diffusion Prediction

Xiaodong Lu, Mingzhe Liu, Tongyu Zhu, Leilei Sun, Jibin Wang, Weifeng Lv, Yikun Ban, Deqing Wang

ACM Transactions on Information Systems (TOIS, 2025)

[paper]

GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs

Yating Ren, Yikun Ban, Huobin Tan

AAAI Conference on Artificial Intelligence (AAAI'26)

[To Appear]

Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?

Zihao Li, Lecheng Zheng, Bowen Jin, Dongqi Fu, Baoyu Jing, Yikun Ban, Jingrui He, Jiawei Han

The 63rd Annual Meeting of the Association for Computational Linguistics, Main (ACL'25)

[paper]

Robust Neural Contextual Bandit against Adversarial Corruptions

Yunzhe Qi, Yikun Ban, Arindam Banerjee, Jingrui He

Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS'24)

[paper]

PageRank Bandits for Link Prediction

Yikun Ban*, Jiaru Zou*, Zihao Li, Yunzhe Qi, Dongqi Fu, Jian Kang, Hanghang Tong, Jingrui He

Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS'24)

[paper]

Meta Clustering of Neural Bandits

Yikun Ban*, Yunzhe Qi*, Tianxin Wei, Lihui Liu, Jingrui He

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'24)

[paper] [github]

Neural Exploitation and Exploration of Contextual Bandits

Yikun Ban, Yuchen Yan, Arindam Banerjee, Jingrui He

Journal of Machine Learning Research (JMLR, To Appear)

Neural Contextual Bandits for Personalized Recommendation

Yikun Ban, Yunzhe Qi, Jingrui He

The Web Conference, Tutorial (WWW'24)

[website] [paper]

Neural Active Learning Beyond Bandits

Yikun Ban, Ishika Agarwal, Ziwei Wu, Yada Zhu, Kommy Weldemariam, Hanghang Tong, Jingrui He

International Conference on Learning Representations (ICLR'24)

[paper] [github]

Contextual Bandits with Online Neural Regression

Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee

International Conference on Learning Representations (ICLR'24)

[paper]

Meta-Learning with Neural Bandit Scheduler

Yunzhe Qi*, Yikun Ban*, Tianxin Wei, Jiaru Zou, Huaxiu Yao, Jingrui He

Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS'23)

[paper]

Graph Neural Bandits

Yunzhe Qi*, Yikun Ban*, Jingrui He

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'23)

[paper] [github]

Improved Algorithms for Neural Active Learning

Yikun Ban*, Yuheng Zhang*, Hanghang Tong, Arindam Banerjee, Jingrui He

Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS'22)

[paper] [github]

DISCO: Comprehensive and Explainable Disinformation Detection

Dongqi Fu, Yikun Ban, Hanghang Tong, Ross Maciejewski, Jingrui He

ACM International Conference on Information and Knowledge Management (CIKM'22, Demo Track)

[paper]

Neural Bandit with Arm Group Graph

Yunzhe Qi, Yikun Ban, Jingrui He

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'22)

[paper]

EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits

Yikun Ban, Yuchen Yan, Arindam Banerjee, Jingrui He

International Conference on Learning Representations (ICLR'22, Spotlight)

[paper] [github]

Convolutional Neural Bandit for Visual-aware Recommendation

Yikun Ban, Jingrui He

Preprint: ArXiv:2107.07438

[paper] [github]

Multi-Facet Contextual Bandits: A Neural Network Perspective

Yikun Ban, Jingrui He, Curtiss B. Cook

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'21)

[paper] [slides] [github]

Local Clustering in Contextual Multi-Armed Bandits

Yikun Ban, Jingrui He

The Web Conference (WWW'21)

[paper] [slides] [github]

Dynamic Knowledge Graph Alignment

Yuchen Yan, Lihui Liu, Yikun Ban, Baoyu Jing, Hanghang Tong

AAAI Conference on Artificial Intelligence (AAAI'21)

[paper]

Generic Outlier Detection in Multi-Armed Bandit

Yikun Ban, Jingrui He

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'20)

[paper] [slides] [github]

No Place to Hide: Catching Fraudulent Entities in Tensors

Yikun Ban, Xin liu, Ling Huang, Yitao Duan, Xue Liu, Wei Xu

The Web Conference (WWW'19)

[paper] [poster]

Education

Aug 2019 – Jun 2024

Ph.D., Computer Science, Advised by Jingrui He and Hanghang Tong

University of Illinois at Urbana-Champaign, Illinois, US

Aug 2016 – Jul 2019

M.S., Computer Science

Peking University, Beijing, China

Aug 2012 – Jul 2016

B.S., School of Software Engineering

Wuhan University, Wuhan, China

Yikun Ban | 班义琨

Yikun Ban 班义琨

News

Preprint (* Equal Contribution, # Corresponding)

Selected Publications (* Equal Contribution, # Corresponding)

Education