CPUControlNet - Model Preserving Fine-Tuning with Adaptations

A modifed fork of ControlNet that works in CPU with no NVIDIA GPU dependency. Useful for hacking, testing, experimenting on laptop with tools complementary to a GPU implementation. ControlNet is a Domain Adaptation method applicable to Model-Preserving Fine-Tuning.

Motivation

Why? Sometimes experiments are more convenient executed on a laptop, but for many laptops the available GPU VRAM is only 4GB or less and the model might require 8GB GPU VRAM. This CPU port allows using host memory instead, and load the weights fully in a laptop with modest GPU.

Setup & Weights

Follow ControlNet install instruction.

Download the control_sd15_canny.pth checkpoint and move checkopint file in the ./model folder.

https://huggingface.co/lllyasviel/ControlNet/blob/main/models/control_sd15_canny.pth

Test Usage

Test harcoded to load an image from /data/test.jpg,

python ./cpu_canny2image.py

Model-Preserving Domain Adaptation

Conditional Control: Preserve Primary Model through Locked Layer Versions and Trainable Layer Copies

ControlNet adds conditional control to existing models without compromising the integrity of the original model. By utilizing a dual structure of "locked" (original) and "trainable" copies, it ensures that the primary model remains unaffected while allowing for domain-specific or conditional adaptations via the trainable copy.

Zero Convolution: Neutral State Start

The "zero convolution" is a 1×1 convolution with both weight and bias initialized as zeros

serves as a mechanism to start from a neutral state
initially ControlNet doesn't introduce any modifications to the outputs
As training progresses, these zero convolutions adapt and learn to introduce the desired changes

Approach advantages:

Preservation of Original Model: The "locked" version ensures that the pre-trained, production-ready model is not tampered with.
Adaptability: The "trainable" copy allows for domain-specific or conditional modifications.
Efficiency: Since no layer is trained from scratch and only fine-tuning is performed, training can be faster and feasible even on small-scale devices.
Flexibility: The architecture supports merging, replacing, or offsetting of models, weights, or layers, providing flexibility in model management and deployment.

Applications

Domain adaptation through conditional modeling
Applications where maintaining the integrity of the original model is crucial.
Fine-tuning and adaptation without the risk of forgetting or deteriorating the original model's capabilities.

Citation

@misc{zhang2023adding, title={Adding Conditional Control to Text-to-Image Diffusion Models}, author={Lvmin Zhang and Maneesh Agrawala}, year={2023}, eprint={2302.05543}, archivePrefix={arXiv}, primaryClass={cs.CV} }

Arxiv Link

Name		Name	Last commit message	Last commit date
Latest commit History 199 Commits
annotator		annotator
cldm		cldm
docs		docs
font		font
github_page		github_page
ldm		ldm
models		models
test_imgs		test_imgs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
cpu_canny2image.py		cpu_canny2image.py
environment.yaml		environment.yaml
gradio_annotator.py		gradio_annotator.py
gradio_canny2image.py		gradio_canny2image.py
gradio_depth2image.py		gradio_depth2image.py
gradio_fake_scribble2image.py		gradio_fake_scribble2image.py
gradio_hed2image.py		gradio_hed2image.py
gradio_hough2image.py		gradio_hough2image.py
gradio_normal2image.py		gradio_normal2image.py
gradio_pose2image.py		gradio_pose2image.py
gradio_scribble2image.py		gradio_scribble2image.py
gradio_scribble2image_interactive.py		gradio_scribble2image_interactive.py
gradio_seg2image.py		gradio_seg2image.py
share.py		share.py
tool_add_control.py		tool_add_control.py
tool_add_control_sd21.py		tool_add_control_sd21.py
tool_transfer_control.py		tool_transfer_control.py
tutorial_dataset.py		tutorial_dataset.py
tutorial_dataset_test.py		tutorial_dataset_test.py
tutorial_train.py		tutorial_train.py
tutorial_train_sd21.py		tutorial_train_sd21.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CPUControlNet - Model Preserving Fine-Tuning with Adaptations

Motivation

Setup & Weights

Test Usage

Model-Preserving Domain Adaptation

Conditional Control: Preserve Primary Model through Locked Layer Versions and Trainable Layer Copies

Zero Convolution: Neutral State Start

Approach advantages:

Applications

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CPUControlNet - Model Preserving Fine-Tuning with Adaptations

Motivation

Setup & Weights

Test Usage

Model-Preserving Domain Adaptation

Conditional Control: Preserve Primary Model through Locked Layer Versions and Trainable Layer Copies

Zero Convolution: Neutral State Start

Approach advantages:

Applications

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages