Text-2-video // the future of video-content creation

sbagency
Nov 24, 2023

--

https://stability.ai/research/stable-video-diffusion-scaling-latent-video-diffusion-models-to-large-datasets
https://github.com/Stability-AI/generative-models

November 21, 2023

We are releasing Stable Video Diffusion, an image-to-video model, for research purposes:

SVD: This model was trained to generate 14 frames at resolution 576x1024 given a context frame of the same size. We use the standard image encoder from SD 2.1, but replace the decoder with a temporally-aware deflickering decoder.

SVD-XT: Same architecture as SVD but finetuned for 25 frame generation.

We provide a streamlit demo scripts/demo/video_sampling.py and a standalone python script scripts/sampling/simple_video_sample.py for inference of both models.

Alongside the model, we release a technical report.

--

--

sbagency
sbagency

Written by sbagency

Tech/biz consulting, analytics, research for founders, startups, corps and govs.

No responses yet