Amazon releases BASE TTS, the largest text-to-speech model ever

Author：Eve Cole Update Time：2025-02-02 06:16:01

Amazon's AGI team recently released BASE TTS, a text-to-speech model with 98 billion parameters that was trained using 100,000 hours of recording data. It is currently the largest model of its kind. The release of this model marks significant progress in text-to-speech technology. Its large number of parameters and massive training data are expected to significantly improve the naturalness and anthropomorphism of speech synthesis and bring users a better voice experience. The team's goal is to apply this model to learning applications to further improve the quality of human voices in text-to-speech applications.

The Amazon AGI team released BASE TTS, the largest text-to-speech model ever, with 98 billion parameters and trained using 100,000 hours of recording data. The team plans to use this model in learning applications to improve the quality of human voices in text-to-speech applications.

The release of the BASE TTS model demonstrates Amazon's strong strength in the field of artificial intelligence and its vision for future voice technology. It heralds the coming of more natural and realistic artificial voices, bringing richer possibilities to various application scenarios. In the future, we can expect BASE TTS to play a role in more fields and provide users with more convenient and better services.