Amazon ha diffuso questo post
Thrilled to introduce a new addition to the Amazon Nova family of foundation models – Amazon Nova Sonic – a speech-to-speech model that makes it dramatically easier for developers to build voice-powered applications and AI agents that are more useful, natural, and engaging. Since launching Nova, we have been moving at lightning speed, guided by our customers’ needs. One thing we’ve heard loud and clear: building voice applications is far too complex. It’s like playing telephone between three different models – speech recognition to convert voice to text, a language model to understand and generate a text response, and text-to-speech to generate voice again. Even if it works, nuance is often lost, making the conversation with the AI feel unnatural and disconnected. Nova Sonic solves this by unifying all three steps – speech recognition, understanding, and generation – into a single architecture. This allows it to model not just what is spoken, but how it’s spoken, enabling more context-aware and dynamic responses. Nova Sonic also produces a text transcript so that developers can easily integrate their tools, APIs, and proprietary knowledge into their voice application. With Nova Sonic, developers can reimagine building AI agents that truly understand context, maintain natural conversation flow, and take action on user’s behalf. End users can simply say, “I need to cancel my flight this weekend,” and have natural back-and-forth conversation about their options. Available in American and British English in Amazon Bedrock today, Nova Sonic is the most accurate, fastest, and least expensive model in its class (and there are only a few in this space today). Customers like ASAPP, Education First, Stats Perform, and our own internal teams with early access to Nova Sonic, are ready to move from prototypes to production in a wide variety of use cases, including customer service automation, education, healthcare, and sports. Can't wait to see what developers will build with Nova Sonic to make AI truly accessible to everyone! https://amzn.to/3Gd2Mj5