LiteFocus下載 - LiteFocus原始碼下載

LiteFocus

其他源碼

1.0.0

下載

萊特焦點

LiteFocus：用於長音頻合成的加速擴散推理
譚振雄、馬欣銀、方功凡、王新超
新加坡國立大學學習與視覺實驗室

TL;DR（太長；沒讀）

LiteFocus 是一款旨在加速基於擴散的 TTA 模型的工具，現已透過基礎模型 AudioLDM2 實現。它使處理速度加倍並提高音訊品質。

設定

準備環境（可選）

conda create -n litefocus python=3.10
conda activate litefocus

安裝基礎模型

pip3 install git+https://github.com/haoheliu/AudioLDM2.git

用法

基本用法

from audioldm2 import text_to_audio, build_model
import scipy

+ from litefocus import inject_lite_focus, disable_lite_focus

model = build_model(model_name='audioldm2-full')

+ inject_lite_focus(model)

waveform = text_to_audio(
    latent_diffusion=model,
    duration=40,
    text='Musical constellations twinkling in the night sky, forming a cosmic melody.',
)

scipy.io.wavfile.write("out.wav", rate=16000, data=waveform)

停用 LiteFocus

 disable_lite_focus ( model )

配置

 config = {
    'same_frequency' : True ,
    'cross_frequency' : True ,
    'sparse_ratio' : 0.1
}

inject_lite_focus ( model , config )

範圍	描述	預設值
`same_frequency`	使人們能夠關注共享相同頻率的代幣。	`True`
`cross_frequency`	能夠關注跨頻補償中的令牌。	`True`
`sparse_ratio`	指定`cross_frequency`的稀疏率。	0.1

待辦事項

音頻LDM2集成
擴散器管道集成

引文

 @article{
  tan2024lite,
  title={LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis},
  author={Zhenxiong Tan, Xinyin Ma, Gongfan Fang, and Xinchao Wang},
  journal={arXiv preprint arXiv:2407.10468},
  year={2024}
}

展開

附加信息

版本 1.0.0
類型其他源碼
更新時間 2024-11-30
大小 820.11KB
來自於 Github

相關應用

waymo open dataset

2024-11-18
SmartTube

2024-12-14
Sunamu

2024-12-14
MySchedule.py

2024-12-15
viptools for eslam

2024-12-15
VITAident

2024-12-15

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
waymo open dataset

其他源碼

December 2023 Update
SmartTube

其他源碼

24.71 Stable
Sunamu

其他源碼

Release 2.2.0
waymo open dataset

其他源碼

December 2023 Update
wp functions

其他類別

1.0.0
termwind

其他類別

v2.3.0

相關資訊全部