- Notifications
You must be signed in to change notification settings - Fork 57
Pull requests: bigcode-project/bigcodebench
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: pass_k-iterable in evaluate.py
#104 by KedarnathKC was merged Sep 2, 2025 Loading… updated Sep 2, 2025
Reintroduce progress checker from #48
#86 by hvaara was merged Feb 20, 2025 Loading… updated Feb 20, 2025
Add support for Hugging Face Serverless Inference
#85 by hvaara was merged Feb 20, 2025 Loading… updated Feb 20, 2025
Await futures in progress checker
#48 by hvaara was merged Oct 3, 2024 Loading… updated Feb 20, 2025
Specify a unique cache directory before each code execution
#77 by shwinshaker was merged Feb 11, 2025 Loading… updated Feb 11, 2025
fix make_raw_chat_prompt when prefill is disabled
#75 by zhangchen-xu was merged Feb 8, 2025 Loading… updated Feb 8, 2025
Remove extra period in task BigCodeBench/16
#38 by hvaara was merged Sep 10, 2024 Loading… updated Sep 10, 2024
4 tasks done
add multiprocessing support for sanitization step
#37 by sk-g was merged Aug 15, 2024 Loading… updated Aug 15, 2024
Save pass@k result & use custom tokenizer
#20 by marianna13 was merged Jul 9, 2024 Loading… updated Jul 9, 2024
Update evaluate.py - Calculating gt_pass_rate befre evaluating the expression "gt_pass_rate > 0.99"
#12 by lapidshay was closed Jul 1, 2024 Loading… updated Jul 1, 2024
fix: update the generations to v0.1.5
#4 by terryyz was merged Jun 19, 2024 Loading… updated Jun 19, 2024
Previous Next
ProTip! no:milestone will show everything without a milestone.