At its re:Invent 2024 conference, Amazon Web Services (AWS), Amazon’s cloud computing division, announced a new family of generative AI, multimodal models called Nova.
There’s four text-focused models in total: Micro, Lite, Pro, and Premier. Micro, Lite, and Pro are available today for AWS customers, while Premiere will launch in Q1 2025, Amazon CEO Andy Jassy said onstage.
In addition to those, there’s an image generation model, Nova Canas, and a video-generating model, Nova Reel. Both are publicly available today.
“We’ve continued to work on our own frontier models,” Jassy said, “and those frontier models have made a tremendous amount of progress over the last four to five months. And we figured, if we were finding value out of them, you would probably find value out of them.”
The text-based Nova models are differentiated by their capabilities and sizes, mainly.
Micro delivers the lowest latency of the bunch, processing text and generating answers the fastest. Lite can process image, video, and text inputs reasonably quickly.
Jassy said that Amazon’s working on a speech-to-speech model and an “any-to-any” model that should arrive around mid-2025.
“You’ll be able to input text, speech, images, or video and output text, speech, images, and video,” Jassy said of the any-to-any model.
Kyle Wiggers is a senior reporter at TechCrunch with a special interest in artificial intelligence. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Brooklyn with his partner, a piano educator, and dabbles in piano himself. occasionally — if mostly unsuccessfully.
Subscribe for the industry’s biggest tech news