Tuesday, November 7, 2023

Microsoft runs inference processing in Oracle Cloud Infrastructure

Microsoft is using Oracle Cloud Infrastructure (OCI) AI infrastructure, along with Microsoft Azure AI infrastructure, for inferencing of AI models that are being optimized to power Microsoft Bing conversational searches daily.  Oracle confirmed that is has a multi-year with Microsoft supporting this application.

Leveraging the Oracle Interconnect for Microsoft Azure, Microsoft is able to use managed services like Azure Kubernetes Service (AKS) to orchestrate OCI Compute at massive scale to support increasing demand for Bing conversational search.

“Generative AI is a monumental technological leap and Oracle is enabling Microsoft and thousands of other businesses to build and run new products with our OCI AI capabilities,” said Karan Batta, senior vice president, Oracle Cloud Infrastructure. “By furthering our collaboration with Microsoft, we are able to help bring new experiences to more people around the world.”

“Microsoft Bing is leveraging the latest advancements in AI to provide a dramatically better search experience for people across the world,” said Divya Kumar, global head of marketing for Search & AI at Microsoft. “Our collaboration with Oracle and use of Oracle Cloud Infrastructure along with our Microsoft Azure AI infrastructure, will expand access to customers and improve the speed of many of our search results.”

OCI Superclusters include OCI Compute Bare Metal instances, ultra-low latency RDMA cluster networking, and a choice of HPC storage. OCI Superclusters can scale up to 4,096 OCI Compute Bare Metal instances with 32,768 A100 GPUs or 16,384 H100 GPUs, and petabytes of high-performance clustered file system storage to efficiently process massively parallel applications.

https://www.oracle.com/news/announcement/oracle-cloud-infrastructure-utilized-by-microsoft-for-bing-conversational-search-2023-11-07/