technicalkiwi/latentblending

Fork 0

Go to file

lunar 70ade8b73b test

2022-12-02 11:49:55 +00:00

.gitignore

mp4 files

2022-11-28 08:45:40 +01:00

animation.gif

calmer animation

2022-12-01 09:23:31 +00:00

cherry_picknick.py

cleanup

2022-11-28 18:14:20 +01:00

example1_standard.py

cleanup

2022-11-28 18:14:20 +01:00

example2_inpaint.py

small fix

2022-12-01 08:47:58 +00:00

example3_multitrans.py

multi trans with new branching setup

2022-11-28 16:16:19 +01:00

latent_blending.py

latent injection for inpainting

2022-11-29 18:03:08 +01:00

LICENSE

Initial commit

2022-11-19 19:40:58 +01:00

movie_util.py

directory bug

2022-11-28 08:46:39 +01:00

README.md

Update README.md

2022-12-02 11:42:09 +00:00

requirements.txt

test

2022-12-02 11:49:55 +00:00

stable_diffusion_holder.py

stable diffusion holder for version 2.0

2022-11-25 15:34:12 +01:00

README.md

What is latent blending?

Latent blending allows you to generate smooth video transitions between two prompts. It is based on (stable diffusion 2.0)[https://stability.ai/blog/stable-diffusion-v2-release] and remixes the latent reprensetation using spherical linear interpolations. This results in imperceptible transitions, where one image slowly turns into another one.

Example 1: simple transition

(mp4), code

Example 2: inpainting transition

(mp4), code

Example 3: concatenated transition

(mp4), code

Relevant parameters

Installation

Packages

 pip install -r requirements.txt

Models

Download the Stable Diffusion 2.0 Standard Model

Download the Stable Diffusion 2.0 Inpainting Model (optional)

xformers efficient attention (copied from stability)

For more efficiency and speed on GPUs, we highly recommended installing the xformers library.

Tested on A100 with CUDA 11.4. Installation needs a somewhat recent version of nvcc and gcc/g++, obtain those, e.g., via

export CUDA_HOME=/usr/local/cuda-11.4
conda install -c nvidia/label/cuda-11.4.0 cuda-nvcc
conda install -c conda-forge gcc
conda install -c conda-forge gxx_linux-64=9.5.0

Then, run the following (compiling takes up to 30 min).

cd ..
git clone https://github.com/facebookresearch/xformers.git
cd xformers
git submodule update --init --recursive
pip install -r requirements.txt
pip install -e .
cd ../stable-diffusion

Upon successful installation, the code will automatically default to memory efficient attention for the self- and cross-attention layers in the U-Net and autoencoder.

How does it work

what makes a transition a good transition?

absence of movement
every frame looks like a credible photo