Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation (CVPR 2023)

[Project Page]

To plug-and-play diffusion features, please follow these steps:

Setup

Create the environment and install the dependencies by running:

conda create -n pnp-diffusers python=3.9 conda activate pnp-diffusers pip install -r requirements.txt

Latent Extraction

We first compute the intermediate noisy latents of the structure guidance image. To do that, run:

python preprocess.py --data_path <path_to_guidance_image> --inversion_prompt <inversion_prompt>

where <inversion_prompt> should describe the content of the guidance image. The intermediate noisy latents will be saved under the path latents_forward/<image_name>, where <image_name> is the filename of the provided guidance image.

Running PnP

Run the following command for applying PnP on the structure guidance image:

python pnp.py --config_path <pnp_config_path>

where <pnp_config_path> is a path to a yaml config file. The config includes fields for providing the guidance image path, the PnP output path, translation prompt, guidance scale, PnP feature and self-attention injection thresholds, and additional hyperparameters. See an example config in config_pnp.yaml.

Citation

@InProceedings{Tumanyan_2023_CVPR, author = {Tumanyan, Narek and Geyer, Michal and Bagon, Shai and Dekel, Tali}, title = {Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2023}, pages = {1921-1930} }

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
data		data
License		License
README.md		README.md
config_pnp.yaml		config_pnp.yaml
pnp.py		pnp.py
pnp_utils.py		pnp_utils.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation (CVPR 2023)

[Project Page]

Setup

Latent Extraction

Running PnP

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

MichalGeyer/pnp-diffusers

Folders and files

Latest commit

History

Repository files navigation

Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation (CVPR 2023)

[Project Page]

Setup

Latent Extraction

Running PnP

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages