Seed-TTS
Seed-TTS: High-Quality Autoregressive Text-to-Speech Model by ByteDance
Seed-TTS by ByteDance is a high-quality, versatile text-to-speech model that generates speech nearly indistinguishable from human speech. It excels in in-context learning, speaker similarity, and speech naturalness. Offering superior controllability over various speech attributes like emotion, Seed-TTS is capable of creating highly expressive and diverse speech. The model includes a non-autoregressive variant, Seed-TTS DiT, which uses a diffusion-based architecture for enhanced performance. Ideal for a variety of applications, Seed-TTS is revolutionizing speech technology.
2024-06-22
15.5K