Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(gsm8k): align README to yaml file
#3388 opened Nov 6, 2025 by neoheartbeats Loading…
feat: refine Chain-of-Thought removal logic
#3386 opened Nov 6, 2025 by Co-Cl2 Loading…
[feat] Add Countdown Task
#3384 opened Nov 4, 2025 by StephenXie Loading…
Math 500
#3381 opened Nov 1, 2025 by seldereyy Loading…
[fix] crows_pairs dataset
#3378 opened Oct 31, 2025 by jannalulu Loading…
[feat] add graphwalks
#3377 opened Oct 31, 2025 by jannalulu Loading…
fix trust_remote_code=True for longbench
#3361 opened Oct 22, 2025 by jannalulu Loading…
Longbench group fix
#3359 opened Oct 22, 2025 by jannalulu Loading…
Fix issue 3355 assertion error
#3356 opened Oct 20, 2025 by marksverdhei Loading…
Add gsm_symbolic and gsm_symbolic_cot tasks
#3354 opened Oct 19, 2025 by MengAiDev Loading…
fix(tasks):pin correct MMLUSR version
#3350 opened Oct 16, 2025 by christinaexyou Loading…
added azure openai support
#3349 opened Oct 16, 2025 by zinccat Loading…
Added ULQA benchmark
#3340 opened Oct 13, 2025 by keramjan Loading…
Add MATH500
#3311 opened Sep 26, 2025 by jannalulu Loading…
Support torchrun vllm DP
#3304 opened Sep 19, 2025 by luccafong Loading…
ProTip! no:milestone will show everything without a milestone.