Skip to content

BlankCheng/wildcodebench-annotation

 
 

Repository files navigation

Benchmarking Programming Agents for Realistic Function Calling

WIP. Coming soon...

About

A Rigorous Benchmark for Code Generation with Realistic Constraints in the Wild

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.8%
  • Jupyter Notebook 9.2%