Cartesia releases a new model of voice conversion: it can retain the tone characteristics of the speaker - AI article

Author：Eve Cole Update Time：2025-02-15 16:32:01

Artificial intelligence technology is constantly breaking through the boundaries of innovation, and the field of voice conversion has ushered in major progress. The Voice Changer model launched by Cartesia brings new possibilities to the industry with its unique retention ability of voice features.

Artificial intelligence company Cartesia recently launched a voice conversion model called "Voice Changer". Unlike traditional voice conversion, this model can not only convert input voice into target sound, but also maintain the expression characteristics of the tone, stress and other expressions in the original sound.

According to Cartesia's official introduction, users can try this feature on the play.cartesia.ai website. The company has released relevant API documents, and developers can view detailed instructions through docs.cartesia.ai.

The reporter noticed that this type of conversion technology that retains voice characteristics is not common in the market. Most existing tools tend to lose the speaker's tone changes when converting sounds, resulting in the converted sounds sounds more mechanical.

Cartesia details the specific implementation of the technology in its blog. However, the company has not yet responded to ethical issues that may be brought about by this technology, such as imitating other people's voices without authorization.

This innovative technology has opened up new directions for the field of voice conversion, but it has also triggered people's thinking about technological ethics. How to find a balance between innovation and norm will become an important topic in future development.