Skip to content

OpenEval: Benchmarking Programming Agents for Open-Domain Tasks

License

imamnurby/open-eval

 
 

Repository files navigation

Benchmarking Programming Agents for Realistic Function Calling

WIP. Coming soon...

About

OpenEval: Benchmarking Programming Agents for Open-Domain Tasks

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.2%
  • Python 1.7%
  • Shell 0.1%