The new service, a fully managed compute and storage rack that will allow customers to run AWS cloud services from within their own data centers on AWS-designed hardware, is an extension of the company's work with VMware through VMware Cloud on AWS.
Says it designed Inferentia because GPU makers have focused much attention on training but too little on inference