Awesome ChatTTS Download - Awesome ChatTTS Source code download

Awesome ChatTTS

Other source code

1.0.0

Download

English | Simplified Chinese

Awesome-ChatTTS is an officially recommended ChatTTS resource summary project. You are welcome to recommend it or recommend it in issues.

If you think this project is helpful for you to understand and use ChatTTS, please give me some rewards and support.

Note

The following projects are community resources. For the official information, please go to the source warehouse 2noise/ChatTTS.

Official introduction
Quick experience
Popular branches
Interface description
Tone control
Getting started tutorial
Frequently Asked Questions
Quick check for errors

Official introduction

ChatTTS.-.001.-.ChatTTS.mp4

Quick experience

Website	type
Original Web	Original web version experience
Forge Web	Forge Enhanced Edition Experience
Linux	Python installation package
Samples	Tone seed example
Cloning	Tone cloning experience

Popular branches

Functional enhancement

project	Star	Highlights
jianchang512/ChatTTS-ui		Provides API interface that can be called in third-party applications
6drf21e/ChatTTS_colab		Provide streaming output, support long audio generation and part-character reading
lenML/ChatTTS-Forge		Provides vocal enhancement and background noise reduction, with additional prompt words available
CCmahua/ChatTTS-Enhanced		Supports batch processing of files and exports of SRT files
HKoon/ChatTTS-OpenVoice		Sound cloning with OpenVoice

Functional extension

project	Star	Highlights
6drf21e/ChatTTS_Speaker		Tone character marking and stability evaluation
AIFSH/ComfyUI-ChatTTS		ComfyUi version, which can be introduced as a workflow node
MaterialShadow/ChatTTS-manager		Provides a tone management system and WebUI interface

Interface description

Configuration item description

Text control

1. Input Text : Text that needs to be converted, supports mixed Chinese and English
2. Refine text : Whether to use colloquial processing of text
3. Text Seed : Configure text seed values, different seeds correspond to different colloquial styles
4. ? : Randomly generate text seed values
5. Output Text : Text generated after colloquial processing

Tone control

6. Timbre : Preset tone seed value
7. Audio Seed : Configure the tone seed value, different seeds correspond to different tones
8. ? : Randomly generates timbre seed values
9. Speaker Embedding : Tone code, see Tone Control for details

Emotional control

10. temperature : controls audio emotional volatility, with a range of 0-1. The larger the number, the greater the volatility.
11. top_P : controls the emotional correlation of audio, with a range of 0.1-0.9. The larger the number, the higher the correlation.
12. top_K : controls the emotional similarity of audio, with a range of 1-20. The smaller the number, the higher the similarity

Coefficient control

13. DVAE Coefficient : Model Coefficient Code
14. Reload : Reload model coefficients

Playback control

15. Auto Play : Whether to automatically play audio after it is generated
16. Stream Mode : Whether to enable streaming output
17. Generate : Click to generate audio file
18. Output Audio : Audio generation results
19. ↓ : Click to download the audio file
20. ▶️ : Click to play the audio file

Sample Control

21. Example : Click to switch the example configuration

Tone control

After actual testing, there is a significant difference in the effect of generating spk_emb each time the specified tone seed value is generated and reusing pre-generated spk_emb . It is recommended to use .pt tone files or tone codes (string representations).

The tone seeds were initially marked and stable evaluation in the ChatTTS_Speaker project, and the right tone can be quickly selected through examples.

WebUI

When used in the official WebUI, you can directly copy the tone code and replace the value in 9. Speaker Embedding to achieve tone control.

Python

When used in Python scripts, refer to the compression scheme in issue#07 to achieve tone control.

 spk = torch . load ( "asset/seed_1332_restored_emb.pt" , map_location = torch . device ( 'cpu' )). detach ()
spk_emb_str = compress_and_encode ( spk )

params_infer_code = ChatTTS . Chat . InferCodeParams (
    spk_emb = spk_emb_str ,  # add sampled speaker
    temperature = .0003 ,  # using custom temperature
    top_P = 0.7 ,  # top P decode
    top_K = 20 ,  # top K decode
)

Getting started tutorial

Chinese tutorial

video	Highlights
Brother Tongji Zihao	Detailed deployment tutorial from entry to advanced
ZTFS	Mac M1 deployment tutorial
King - Bao Bao	Windows Deployment Tutorial

English tutorial

video	Highlights
Sam Witteveen	Introduction to the English version

Frequently Asked Questions

After recent iterations, the problems in the source repository code have been basically solved. If you encounter problems, it is recommended to check the Chinese version of the official description document in detail first. If you have any questions, you can continue to view this document.

The model cannot be downloaded

The original project needs to download the corresponding model from HuggingFace. If you cannot access the Internet smoothly and scientifically, you will not be able to complete this step. As an alternative, you can download the model and configuration from modelscope and configure the local path.

Important

The model library on the Magic Tower is maintained by volunteers and does not guarantee that all models are up to date. Please verify it yourself if necessary.

Install modelscope dependencies in terminal

pip install modelscope

Modify the code in webui.py

 # 在开头导入依赖，并下载模型和配置
from modelscope import snapshot_download
model_dir = snapshot_download ( 'zlj2546/ChatTTS' )

# 第 118 行修改模型路径
ret = chat . load_models ( 'custom' , custom_path = model_dir )

Cannot run in IDE

When running in the IDE, the script cannot run smoothly due to the relative path of the file.

It is recommended to refer to the instructions in the quick startup of the official documentation and run it directly in the terminal.

Make sure that you are in the project root directory when executing the following command.

1. WebUI visual interface

python examples/web/webui.py

2. Command line interaction

The generated audio will be saved to ./output_audio_n.mp3

python examples/cmd/run.py " Your text 1. " " Your text 2. "

Tone tag read

This problem occurs because the official code does not cover all the time when dealing with Chinese punctuation, for example ？ Symbols such as , … are not processed, resulting in an error during model generation.

You can manually delete similar Chinese punctuation marks, or modify the code in ChatTTS/utils/infer_utils.py to add missing punctuation marks to the dictionary of character_map on lines 103.

 character_map = {
    '…' : '' ,
    '—' : ',' ,
    '＿' : ',' ,
    '？' : ',' ,
    }

GPU not available

The GPU requires at least 4G video memory, otherwise the CPU will be used. For related issues, please refer to the instructions in the ChatTTS-ui project.

Quick check for errors

1. load_models() got an unexpected keyword argument 'source'

See FAQs for details - Model cannot be downloaded

2. cannot import name 'CommitOperationAdd' from 'huggingface_hub'

See FAQs for details - Model cannot be downloaded

3. FileNotFoundError：［Erzno 2］ No such file or directory： 'C：\Users\xxx\.cache\huggingface\hub\models--2Noise--ChatTTS\snapshots

See FAQs for details - Model cannot be downloaded

4. local variable 'Normalizer' referenced before assignment

You need to install pynini and WeTextProcessing dependencies after completing the environment configuration.

conda install -c conda-forge pynini=2.1.5 && pip install WeTextProcessing

5. download to Local path D：pythonlprojectChatTTSChatTTS failed.

Execute scripts directly in the IDE, and an error will be reported due to file path problems. See FAQ for details - Cannot run in the IDE

6. ModuleNotFoundError : No module named'Cython'

The Python execution path is not found, Windows devices need to configure the environment path according to the tutorial

Project Trends

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2025-02-27
size 7.95MB
From Github

Related Applications

awesome citygml

2024-11-13
awesome generative ai guide

2024-11-05
GitHub sgrebnov/cordova plugin background download

2024-11-05
awesome swift

2024-11-03
Awesome Devil Game

2023-04-16
The Awesome Ad

2022-08-08

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
Sunamu

Other source code

Release 2.2.0
MySchedule.py

Other source code

Updates to the fetching of week codes
waymo open dataset

Other source code

December 2023 Update
termwind

Other categories

v2.3.0
wp functions

Other categories

1.0.0

Related Information All