lumiere pytorch herunterladen - lumiere pytorch Quellcode herunterladen

lumiere pytorch

AI-Quellcode

0.0.24

Herunterladen

Lumiere - Pytorch

Implementierung von Lumiere, SOTA-Text-zu-Video-Generierung von Google Deepmind, in Pytorch

Yannics Papierrezension

Da es sich in diesem Artikel größtenteils nur um einige Schlüsselideen zusätzlich zum Text-zu-Bild-Modell handelt, werde ich noch einen Schritt weiter gehen und das neue Karras U-Net in diesem Repository auf Video erweitern.

Anerkennung

A16Z Open Source AI Grant Program und ? Huggingface für die großzügigen Sponsoren und auch für meine anderen Sponsoren, die mir die Unabhängigkeit gegeben haben, aktuelle Forschung im Bereich der künstlichen Intelligenz als Open-Source-Quelle zu nutzen

Installieren

$ pip install lumiere-pytorch

Verwendung

 import torch
from lumiere_pytorch import MPLumiere

from denoising_diffusion_pytorch import KarrasUnet

karras_unet = KarrasUnet (
    image_size = 256 ,
    dim = 8 ,
    channels = 3 ,
    dim_max = 768 ,
)

lumiere = MPLumiere (
    karras_unet ,
    image_size = 256 ,
    unet_time_kwarg = 'time' ,
    conv_module_names = [
        'downs.1' ,
        'ups.1' ,
        'downs.2' ,
        'ups.2' ,
    ],
    attn_module_names = [
        'mids.0'
    ],
    upsample_module_names = [
        'ups.2' ,
        'ups.1' ,
    ],
    downsample_module_names = [
        'downs.1' ,
        'downs.2'
    ]
)

noised_video = torch . randn ( 2 , 3 , 8 , 256 , 256 )
time = torch . ones ( 2 ,)

denoised_video = lumiere ( noised_video , time = time )

assert noised_video . shape == denoised_video . shape

Todo

Zitate

 @inproceedings { BarTal2024LumiereAS ,
    title   = { Lumiere: A Space-Time Diffusion Model for Video Generation } ,
    author  = { Omer Bar-Tal and Hila Chefer and Omer Tov and Charles Herrmann and Roni Paiss and Shiran Zada and Ariel Ephrat and Junhwa Hur and Yuanzhen Li and Tomer Michaeli and Oliver Wang and Deqing Sun and Tali Dekel and Inbar Mosseri } ,
    year    = { 2024 } ,
    url     = { https://api.semanticscholar.org/CorpusID:267095113 }
}

 @article { Karras2023AnalyzingAI ,
    title   = { Analyzing and Improving the Training Dynamics of Diffusion Models } ,
    author  = { Tero Karras and Miika Aittala and Jaakko Lehtinen and Janne Hellsten and Timo Aila and Samuli Laine } ,
    journal = { ArXiv } ,
    year    = { 2023 } ,
    volume  = { abs/2312.02696 } ,
    url     = { https://api.semanticscholar.org/CorpusID:265659032 }
}