- Notifications
You must be signed in to change notification settings - Fork 696
Pull requests: open-compass/opencompass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] Rename smolinstruct_pp_acc_0_shot_instruct dataset list as {}_datasets
#2340 opened Dec 2, 2025 by cnlnpjhsy Loading…
[Doc]Update evaluation configuration for Qwen3 model
#2332 opened Nov 26, 2025 by LukeLIN-web Loading…
6 tasks
feat(config): add Meta-Llama-3.1-8B-Instruct for MMLU benchmark
#2325 opened Nov 25, 2025 by 6taco Loading…
6 tasks
Add ProcessBench dataset and evaluation configuration
#2274 opened Sep 16, 2025 by sudanl Loading…
6 tasks done
[fix] Handle None value for max_out_len parameter in HuggingFace model
#2271 opened Sep 15, 2025 by Nexround Loading…
[Feature] Support pass@1 evaluation for multi predictions in MathEvaluator
#2253 opened Aug 28, 2025 by DELEnomore Loading…
feat: Add Zebra Grid dataset support with ZeroEval alignment
#2234 opened Aug 11, 2025 by max-yue Loading…
[Fix] Deprecate unused and error formated math500 gen file
#2206 opened Jul 17, 2025 by liushz Loading…
6 tasks
[Fix] livecodebench serialization and timeout errors
#2204 opened Jul 15, 2025 by f14-bertolotti Loading…
6 tasks
Previous Next
ProTip! Updated in the last three days: updated:>2025-11-30.