cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents
•
6
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).