Tuesday, June 4, 2024

Cisco Nexus HyperFabric AI cluster brings NVIDIA acceleration

Cisco introduced a new Nexus HyperFabric AI cluster solution featuring NVIDIA accelerated computing and AI software. The on-premise, enterprise-ready infrastructure is designed for scaling generative AI workloads.

The Cisco Nexus HyperFabric AI cluster solution offers automated, cloud-managed operations across a unified compute and networking fabric combining Cisco's Ethernet switching based on its Cisco Silicon One chip, integrated with NVIDIA's accelerated computing and NVIDIA AI Enterprise software, and VAST’s data storage platform. This will include:

  • Cisco cloud management capabilities
  • Cisco Nexus 6000 series switches for spine and leaf that deliver 400G and 800G Ethernet fabric performance
  • Cisco Optics family of QSFP-DD modules
  • NVIDIA AI Enterprise software to streamline the development and deployment of production-grade generative AI workloads
  • NVIDIA NIM inference microservices that accelerate the deployment of foundation models while ensuring data security, and are available with NVIDIA AI Enterprise
  • NVIDIA Tensor Core GPUs starting with the NVIDIA H200 NVL, designed from the ground up to supercharge generative AI workloads
  • NVIDIA BlueField-3 data processing unit DPU processor and BlueField-3 SuperNIC for accelerating AI compute networking, data access and security workloads
  • Enterprise reference design for AI built on NVIDIA MGX, a modular and flexible server architecture
  • The VAST Data Platform, which offers unified storage, database and a data-driven function engine built for AI


“While the promise of AI is clear, the path forward for many just starting out is not. Customers often face economic and operational challenges to get an AI stack up and running.” said Jonathan Davidson, Executive Vice President and General Manager, Cisco Networking. “Cisco is committed to making the deployment and operation of AI infrastructure simpler. Together with NVIDIA, we are delivering a simple-to-deploy, cloud-operated AI-stack solution for on premise deployments that builds on our Cisco Networking Cloud platform vision for automation and simplicity.”
false