open-compass / VLMEvalKit Public

Notifications You must be signed in to change notification settings
Fork 655
Star 3.9k

Code
Issues 202
Pull requests 23
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: open-compass/VLMEvalKit

Labels 17 Milestones 1

New pull request New

23 Open 896 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Benchmark] Add support for MMSafetyBench, XSTest, MMSBench, Flames, SIUO and M3oralBench.

#1488 opened Mar 20, 2026 by Gugugugugutian

Loading…

[Feature] Support sequential inference accorss all datasets and parallel evaluation.

#1487 opened Mar 19, 2026 by TianhaoLiang2000

Loading…

[Benchmark] Add support for MMOral-OPG-Open benchmark

#1484 opened Mar 16, 2026 by isjinghao

Loading…

[Benchmark] Add support for MMOral-OPG-Closed benchmark

#1483 opened Mar 16, 2026 by isjinghao

Loading…

[Benchmark] Add support for MedQ-DEG-Bench

#1482 opened Mar 16, 2026 by liujiyaoFDU

Loading…

[Model] Support for llava hf

#1479 opened Mar 11, 2026 by smgjch

Loading…

[API] Add DeepOCR pipeline API provider

#1473 opened Mar 4, 2026 by leejooan

Loading…

jt video chat v260227

#1460 opened Feb 28, 2026 by jiutiancv

Loading…

Fix LLaVA model output issues by using official conversation templates WIP

#1424 opened Feb 2, 2026 by cdllI

Loading…

[Benchmark] Support ScienceOlympiad Galaxy10DECaLS VRSBench

#1410 opened Jan 21, 2026 by zhouyujin

Loading…

[Benchmark] Support SArena Benchmark

#1371 opened Dec 23, 2025 by JoeLeelyf

Loading…

[Benchmark] Support OmniBench

#1327 opened Nov 26, 2025 by jmlee4967

Loading…

[Benchmark] Support VP-Bench in Image MCQ WIP

#1322 opened Nov 23, 2025 by Endlinc

Loading…

Request to add SIBench evaluation code

#1310 opened Nov 9, 2025 by song2yu

Loading…

add support for internvideo, videollama, keye and llava-onevision-1.5 WIP

#1244 opened Sep 22, 2025 by FangXinyu-0913

Loading…

[major] add tsv, xlsx, json fo PRED_FORMAT and csv, json for EVAL_FORMAT

#1172 opened Jul 20, 2025 by OliverLeeXZ

Loading…

[Model] Add new model provider: AI/ML API

#1093 opened Jun 19, 2025 by D1m7asis

Loading…

4 tasks done

WIP: Add Capture Dataset WIP

#974 opened May 2, 2025 by bodsul

Loading…

fix(image_mcp.py): use consistant argument (#929)

#968 opened Apr 28, 2025 by MaoSong2022

Loading…

[Minor] Align MMMU evaluation method with official Pending

#966 opened Apr 28, 2025 by FangXinyu-0913

Loading…

[Model] Add new model: Prism

#622 opened Nov 22, 2024 by Myhs-phz

Loading…

[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example

#359 opened Aug 5, 2024 by weikaih04

Loading…

[Support] multiple process parallel inference large model on multi-gpu WIP

#298 opened Jul 19, 2024 by junming-yang

Loading…

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!