data_driven_ai_voice_cloning Download - data_driven_ai_voice

English

中文(简体) 中文(繁体) 한국어 日本語 English Português Español Русский العربية Indonesia Deutsch Français ภาษาไทย

Home>Programming related>Other source code

data_driven_ai_voice_cloning

Other source code

1.0.0

Download

Data driven AI voice cloning

This repository is an implementation of the main part of my master thesis in Data science & Engineering. It is divided in two part:

Speaker Encoder

models: ECAPA-TDNN, wavlm-series

data: VoxCeleb1, private dataset

Text-to-speech

model: FastSpeech2 (microsoft implementation)

data: LibriTTS

This two part are then integrated to achieve a Multi Speaker Text to Speech model that is capable of cloning unseen voices starting from about 5 seconds of audio, the ZeroShotFastSpeech2 model.

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2024-12-05
size 262.66MB
From Github

Related Applications

OpenCore_NO_ACPI_Build

2024-11-13
nspanel_pro_tools_apk

2024-11-12
zkwork_aleo_gpu_worker

2024-11-11
Experimental_data_processing

2024-11-02
nextcloud_share_url_downloader

2024-11-01
flutter_voice_friend

2024-11-01

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
SmartTube

Other source code

24.71 Stable
viptools for eslam

Other source code
MySchedule.py

Other source code

Updates to the fetching of week codes
termwind

Other categories

v2.3.0
slugify

Other categories

Version 4.6.0 (10 September 2024)
laravel firebase

Other categories

5.10.0

Related Information All