Skip to content

minbeomkim/VLind-Bench

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

VLind-Bench

Paper link: https://arxiv.org/abs/2406.08702

Usage

  1. Download dataset in https://huggingface.co/datasets/klee972/VLind-Bench
  2. Directory structure should be as follows.
├── data │ ├── data.json │ ├── counterfactual │ ├── factual └── evel ├── ctx_cfq ├── gpt4o_eval.py ├── instructblip_eval.py ├── score_pipeline.py └── score.sh 
  1. Run gpt4o_eval.py or instructblip_eval.py to generate model predictions.
  2. Run score.sh to evaluate pipeline scores and accuracies.

About

This is the official repository for paper "VLind-Bench: Measuring Language Priors in Large Vision-Language Models"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 99.7%
  • Shell 0.3%