control transfer diffusion Download - control transfer diffusion Source code download

control transfer diffusion

Other source code

1.0.0

Download

Combining audio control and style transfer using latent diffusion

Official repository for Combining audio control and style transfer using latent diffusion by Nils Demerlé, Philippe Esling, Guillaume Doras and David Genova accepted at ISMIR 2024 (paper link).

Training the model requires three steps : processing the dataset, training an autoencoder, then training the diffusion model.

Dataset preparation

python dataset/split_to_lmdb.py --input_path /path/to/audio_dataset --output_path /path/to/audio_dataset/out_lmdb

Or to use slakh with midi processing (after downloading Slakh2100 here) :

python dataset/split_to_lmdb_midi.py --input_path /path/to/slakh --output_path /path/to/slakh/out_lmdb_midi --slakh True

Autoencoder training

python train_autoencoder.py --name my_autoencoder --db_path /path/to/lmdb --gpu #

Once the autoencoder is trained, it must be exported to a torchscript .pt file :

 python export_autoencoder.py --name my_autoencoder --step ##

It is possible to skip this whole phase and use a pretrained autoencoder such as Encodec, wrapped in a nn.module with encode and decode methods.

Model training

The model training is configured with gin config files. To train the audio to audio model :

 python train_diffusion.py --db_path /data/nils/datasets/slakh/lmdb_midi/ --config midi --dataset_type midi --gpu #

To train the midi-to-audio model :

 python train_diffusion.py --db_path /path/to/lmdb --config main --dataset_type waveform --gpu #

Inference and evaluation

TBA

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2024-12-02
size 110.16MB
From Github

Related Applications

stable diffusion webui forge

2024-11-08
Parameter Efficient Transfer Learning Benchmark

2024-11-06
krita ai diffusion

2024-11-03
stable diffusion webui

2024-11-01
Monster Control Chinese version

2023-07-06
Control

2022-08-28

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All