- Notifications
You must be signed in to change notification settings - Fork 655
Pull requests: open-compass/VLMEvalKit
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Benchmark] Add support for MMSafetyBench, XSTest, MMSBench, Flames, SIUO and M3oralBench.
#1488 opened Mar 20, 2026 by Gugugugugutian Loading…
[Feature] Support sequential inference accorss all datasets and parallel evaluation.
#1487 opened Mar 19, 2026 by TianhaoLiang2000 Loading…
[Benchmark] Add support for MMOral-OPG-Open benchmark
#1484 opened Mar 16, 2026 by isjinghao Loading…
[Benchmark] Add support for MMOral-OPG-Closed benchmark
#1483 opened Mar 16, 2026 by isjinghao Loading…
Fix LLaVA model output issues by using official conversation templates WIP
#1424 opened Feb 2, 2026 by cdllI Loading…
[Benchmark] Support ScienceOlympiad Galaxy10DECaLS VRSBench
#1410 opened Jan 21, 2026 by zhouyujin Loading…
add support for internvideo, videollama, keye and llava-onevision-1.5 WIP
#1244 opened Sep 22, 2025 by FangXinyu-0913 Loading…
[major] add tsv, xlsx, json fo PRED_FORMAT and csv, json for EVAL_FORMAT
#1172 opened Jul 20, 2025 by OliverLeeXZ Loading…
[Model] Add new model provider: AI/ML API
#1093 opened Jun 19, 2025 by D1m7asis Loading…
4 tasks done
[Minor] Align MMMU evaluation method with official Pending
#966 opened Apr 28, 2025 by FangXinyu-0913 Loading…
[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example
#359 opened Aug 5, 2024 by weikaih04 Loading…
[Support] multiple process parallel inference large model on multi-gpu WIP
#298 opened Jul 19, 2024 by junming-yang Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.