![]() Samples of speech already synthesized by the software can also be found on the Valle-E site. Not surprisingly, unlike what happened with ChatGPT, Microsoft did not provide the code for Vall-E for others to experiment with. The applications in audio editing are many, as are the criticisms of the software’s potential for manipulation and misuse. The sophistication of the new tool launched by OpenAI and Microsoft lies in Valle-E’s ability to recognize the timbre, inflection, and emotional tone of the person who is speaking and replay it after only three seconds of listening. Speaker Prompt, Ground Truth, Baseline and Vall-E. Together with the GPT-3 software for text and Dall-E/Stable Diffusion for images, the Valle-E audio system completes the ChatGPT triptych and aims to revolutionize the field of generative AI. Hence, just as ChatGPT is able to generate codes autonomously, Valle-E is also designed to create discrete audio codecs from listening to an audio sample. The startup, funded by Elon Musk and Sam Altman, among others, also boasts the creation of ChatGPT, a chatbot that can sustain an interactive conversation with users by remembering and learning from previous actions and precedents. OpenAI defines Valle-E as a “natural codec language model,” as its operation is based on a technology called EnCodec. Valle-E was developed to be used with high-quality speech synthesis tools and to create original audio from an example sample. In recent years, AI chatbots have not performed as well as its creators expected because they were limited to small tasks and had a high error rate. The latter is able to respond with preset actions to specific tasks, but not to react to an unplanned action. Thus, as opposed to what we have known so far, which is “narrow” or “weak” AI. ![]() Valle-E is a tool of AGI, Artificial General Intelligence, that is, a “general” or “strong” artificial intelligence that can simulate human intelligence. Vall-E: all the details about the new chatbot from OpenAI and Microsoft In other words, this is the latest piece of the generative artificial intelligence system developed by Microsoft and OpenAI, with which since 2019 the colossus of Bill Gates is linked by a multi-year, multibillion-dollar partnership. This is a speech synthesis software that can simulate the human voice after just three seconds of listening. OpenAI and Microsoft continue the battle with Google in artificial intelligence by implementing Vall-E, the new voice chatbot. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |