SeMoBridge

A Semantic Modality Bridge for Intra-Modal Alignment in CLIP

Christoph Timmermann, Hyunse Lee & Woojin Lee

Graduate School of Computer Science and Artificial Intelligence
Dongguk University

How to Install

Dassl

This code is built on top of the code for CoOp. It uses the toolbox Dassl.pytorch, so you need to install the dassl environment first. Only use the Dassl fork, as it contains a fix to work with PyTorch 2.7.0.

Clone the Dassl fork:

git clone git@github.com:christti98/Dassl.pytorch.git

Follow this to install the dassl conda environment:

cd Dassl.pytorch/ # Create a conda environment conda create -y -n dassl python=3.12.8 # Activate the environment conda activate dassl # Install torch (requires version >= 2.7.0) and torchvision # Please refer to https://pytorch.org/ if you need a different cuda version pip install torch==2.7.0 torchvision --index-url https://download.pytorch.org/whl/cu128 # Install dependencies pip install -r requirements.txt # Install this library (no need to re-build if the source code is modified) python setup.py develop

After that, run pip install -r requirements.txt under SeMoBridge/ to install a few more packages required (this should be done when dassl is activated). Then, you are ready to go.

Follow DATASETS.md to install the datasets.

How to Run

We include four shell (.sh) scripts to make running SeMoBridge or SeMoBridge-T easier.

Training-free SeMoBridge

scripts/semobridge/run_all_datasets_notrain.sh will run SeMoBridge on the standard 11 datasets for all shots (1, 2, 4, 8, 16) on three seeds (1, 2, 3).
You may have to change the DATA path in the script to point to the datasets.

To run SeMoBridge ViT-B/16 on everything, use the command
bash scripts/semobridge/run_all_datasets_notrain.sh vit_b16 clip_ensemble,cupl_full True.
The results will be saved in a .csv file. For example OUTPUT/SeMoBridge_vit_b16_clip_ensemble,cupl_full_cbTrue.csv.

Training SeMoBridge-T

SeMoBridge-T training is done in the same way:
bash scripts/semobridge/run_all_datasets.sh vit_b16 clip_ensemble,cupl_full True

The scripts have the following arguments:
run_all_datasets_notrain.sh [CONFIG FILE] [TEXT PROMPTS] [CSB False/True] [DATASET CONFIG NAME] [NUMBER OF SHOTS].

For example, to run SeMoBridge-T on 16-shot ImageNet with ViT-B/16 without CSB, use
bash scripts/semobridge/run_all_datasets.sh vit_b16 clip_ensemble,cupl_full False imagenet 16

OOD

To run out of distribution (OOD) datasets, you can use scripts/semobridge/ood_notrain.sh and scripts/semobridge/ood.sh.
They support the arguments
ood_notrain.sh [CONFIG FILE] [TEXT PROMPTS] [CSB False/True] [NUMBER OF SHOTS]

To run SeMoBridge-T on OOD datasets, it has to be trained on ImageNet first, as the OOD script will load the model from the OUTPUT directory.

Visualizations and Plots

To generate plots in Figures 2 (right) and 7, manually run train.py with the --vis argument.
Other figures are generated with plot_training_time.py bias_norms_plot.py, draw_curves.py (from a custom csv file), lambda_plot.py, and text_prompts_plot.py.

Notes

Preprocessed features will be saved in preprocessed/. They may have to be deleted if changes to the code are made.

Acknowledgements

We based our implementation on several excellent repositories: CLIP, SD-IPC, CoOp, TIP-Adapter, SuS-X, LDC, APE, and CuPL. We thank the respective authors for providing open access to their work.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
TEXTS		TEXTS
clip		clip
configs		configs
datasets		datasets
evaluators		evaluators
scripts		scripts
trainers		trainers
.gitignore		.gitignore
DATASETS.md		DATASETS.md
LICENSE		LICENSE
README.md		README.md
average_accuracy.py		average_accuracy.py
bias_norms_plot.py		bias_norms_plot.py
bias_norms_plot_aircraft.py		bias_norms_plot_aircraft.py
convert_csv_to_table_row.sh		convert_csv_to_table_row.sh
custom_clip.py		custom_clip.py
draw_curves.py		draw_curves.py
extract_features.py		extract_features.py
imagenet_templates.py		imagenet_templates.py
lambda_plot.py		lambda_plot.py
overview.jpg		overview.jpg
parse_test_res.py		parse_test_res.py
plot_training_time.py		plot_training_time.py
requirements.txt		requirements.txt
text_prompts_plot.py		text_prompts_plot.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SeMoBridge

A Semantic Modality Bridge for Intra-Modal Alignment in CLIP

How to Install

Dassl

How to Run

Training-free SeMoBridge

Training SeMoBridge-T

OOD

Visualizations and Plots

Notes

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SeMoBridge

A Semantic Modality Bridge for Intra-Modal Alignment in CLIP

How to Install

Dassl

How to Run

Training-free SeMoBridge

Training SeMoBridge-T

OOD

Visualizations and Plots

Notes

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages