Microsoft’s VALL-E AI Can Simulate Any Voice with Just 3 Seconds of Audio
Microsoft has announced a new artificial intelligence (AI) model, called VALL-E, which can synthesize audio that closely simulates a person’s voice. The model is based on a technology called EnCodec, which breaks the audio into discrete components, or “tokens,” and uses training data to match these tokens with the relevant sounds. To create VALL-E, Microsoft …
Read more