Create butter-smooth transitions between prompts, powered by stable diffusion

Go to file

lunarring b7a90020cc Update README.md		2022-12-02 11:42:09 +00:00
.gitignore	mp4 files	2022-11-28 08:45:40 +01:00
LICENSE	Initial commit	2022-11-19 19:40:58 +01:00
README.md	Update README.md	2022-12-02 11:42:09 +00:00
animation.gif	calmer animation	2022-12-01 09:23:31 +00:00
cherry_picknick.py	cleanup	2022-11-28 18:14:20 +01:00
example1_standard.py	cleanup	2022-11-28 18:14:20 +01:00
example2_inpaint.py	small fix	2022-12-01 08:47:58 +00:00
example3_multitrans.py	multi trans with new branching setup	2022-11-28 16:16:19 +01:00
latent_blending.py	latent injection for inpainting	2022-11-29 18:03:08 +01:00
movie_util.py	directory bug	2022-11-28 08:46:39 +01:00
stable_diffusion_holder.py	stable diffusion holder for version 2.0	2022-11-25 15:34:12 +01:00

README.md

What is latent blending?

Latent blending allows you to generate smooth video transitions between two prompts. It is based on (stable diffusion 2.0)[https://stability.ai/blog/stable-diffusion-v2-release] and remixes the latent reprensetation using spherical linear interpolations. This results in imperceptible transitions, where one image slowly turns into another one.

Example 1: simple transition

(mp4), code

Example 2: inpainting transition

(mp4), code

Example 3: concatenated transition

(mp4), code

Relevant parameters

Installation

Packages

 pip install -r requirements.txt

Models

Download the Stable Diffusion 2.0 Standard Model

Download the Stable Diffusion 2.0 Inpainting Model (optional)

xformers efficient attention (copied from stability)

For more efficiency and speed on GPUs, we highly recommended installing the xformers library.

Tested on A100 with CUDA 11.4. Installation needs a somewhat recent version of nvcc and gcc/g++, obtain those, e.g., via

export CUDA_HOME=/usr/local/cuda-11.4
conda install -c nvidia/label/cuda-11.4.0 cuda-nvcc
conda install -c conda-forge gcc
conda install -c conda-forge gxx_linux-64=9.5.0

Then, run the following (compiling takes up to 30 min).

cd ..
git clone https://github.com/facebookresearch/xformers.git
cd xformers
git submodule update --init --recursive
pip install -r requirements.txt
pip install -e .
cd ../stable-diffusion

Upon successful installation, the code will automatically default to memory efficient attention for the self- and cross-attention layers in the U-Net and autoencoder.

How does it work

what makes a transition a good transition?

absence of movement
every frame looks like a credible photo