- Notifications
You must be signed in to change notification settings - Fork 251
Pull requests: bigcode-project/bigcode-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix UnboundLocalError in APPS task evaluation
#310 opened Apr 20, 2025 by sad-mathematician Loading…
Updated bigcode-evaluation-harness/leaderboard/README.md
#305 opened Mar 13, 2025 by zoya-hammad Loading…
use tokenizer.chat_template by default for instruction type tasks
#301 opened Jan 25, 2025 by TK-21st Loading…
[Pytest] Fix bad import to use relative instead @ module_test
#298 opened Jan 11, 2025 by ggcr Loading…
Fix the bugs in the ds1000 sample bash script; Fix typos
#295 opened Dec 10, 2024 by gameofby Loading…
Support multiple datasets from MBPP; Fix missing commas in python list; Fix doc typos;
#291 opened Dec 4, 2024 by gameofby Loading…
Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code
#260 opened Jul 22, 2024 by q-rz Loading…
remove pad tokens added by the accelerator.pad_across_processes
#216 opened Apr 13, 2024 by IQ17 Loading…
fix apps evaluate error: local variable 'level' referenced before assignment
#206 opened Mar 10, 2024 by koking0 Loading…
Previous Next
ProTip! Filter pull requests by the default branch with base:main.