KABB: Knowledge-Aware Bayesian Bandits for Multi-Agent Coordination

Official code for the ICML 2025 paper “KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems”.

[ English | 中文 ]

Introduction

KABB (Knowledge-Aware Bayesian Bandits) is a dynamic expert coordination framework for multi-agent systems, featuring:

Knowledge Distance Model — Semantic matching between experts and tasks.
Dual Adaptation Mechanism — Continuous optimization of expert representation and selection.
Knowledge-Aware Thompson Sampling — Efficient expert selection in Bayesian MAB with knowledge distance.

See the paper for theoretical details and full experiments.

Installation

Clone the repo:

git clone https://github.com/your_org/KABB.git cd KABB

(Optional) Create a virtual environment:

python3 -m venv .venv && source .venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```

Configure environment variables:

cp .env.example .env # Edit .env and fill in your API keys

Quick Start

Run a sample math task with KABB:

python scripts/run_kabb.py \ --config configs/config_math_template.yaml \ --question "What is the value of (7/8)^3 * (7/8)^-3?"

Configuration

configs/config_math_template.yaml provides a minimal working config, including:

system_prompts: System prompts for different scenarios
domain_inference_settings: Domain priors, samples, and key symbols
experts_pool: List of experts (model, temperature, max tokens, etc.)
llm: API key placeholder (use environment variable)

To extend to other domains:

Add a new domain entry with prior and typical_samples
Prepare a set of expert models for the domain
Specify the new config in the script

Reproducibility

Reproduce main results with:

python scripts/run_kabb.py --config configs/config_math_template.yaml --question "..."

For full benchmarks, see the run_*.py scripts and refer to the paper appendix.

Citation

If you use this project, please cite:

@misc{zhang2025kabbknowledgeawarebayesianbandits, title={KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems}, author={Jusheng Zhang and Zimeng Huang and Yijia Fan and Ningyuan Liu and Mingyan Li and Zhuojie Yang and Jiawei Yao and Jian Wang and Keze Wang}, year={2025}, eprint={2502.07350}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2502.07350}, }

Contributing

Fork and create a new branch
Link related issues in PRs

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
kabb		kabb
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
moa_system.log		moa_system.log
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KABB: Knowledge-Aware Bayesian Bandits for Multi-Agent Coordination

Introduction

Installation

Quick Start

Configuration

Reproducibility

Citation

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

HCP-AI-Research-Lab/KABB

Folders and files

Latest commit

History

Repository files navigation

KABB: Knowledge-Aware Bayesian Bandits for Multi-Agent Coordination

Introduction

Installation

Quick Start

Configuration

Reproducibility

Citation

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages