NVIDIA Network Operator v26.4.0
Enabling AI Networking in Kubernetes.
The NVIDIA Network Operator simplifies the provisioning and management of NVIDIA networking resources in a Kubernetes cluster. The operator automatically installs the required host networking software - bringing together all the needed components to provide high-speed network connectivity. These components include the NVIDIA networking driver, Kubernetes device plugin, CNI plugins, IP address management (IPAM) plugin and others. The NVIDIA Network Operator works in conjunction with the NVIDIA GPU Operator to deliver high-throughput, low-latency networking for scale-out, GPU computing clusters.
A Helm chart is provided for easily deploying the Network operator in a cluster to provision the host software on NVIDIA-enabled nodes.
Note
What’s new in 26.4.0
KubeVirt SR-IOV passthrough — accelerated East-West networking for VM workloads on vanilla Kubernetes. See KubeVirt SR-IOV Integration.
Spectrum-X RA 2.2 alignment — multi-plane fabric with topology-aware pod placement via DRA, plus simplified config generation through the K8s LaunchKit.
Dynamic Resource Allocation (DRA) SR-IOV driver — Kubernetes-native VF allocation for SR-IOV workloads. See DRA SR-IOV Driver.
For the full list of changes, see the Release Notes.
Networking Features
- RDMA Support
Remote Direct Memory Access (RDMA) for memory-to-memory data transfers that bypass the CPU and kernel networking stack. Supports InfiniBand on NVIDIA Quantum-X fabrics and RDMA over Converged Ethernet (RoCE) on NVIDIA Spectrum-X fabrics.
- SR-IOV Virtualization
Single Root I/O Virtualization technology that partitions network interface cards into multiple Virtual Functions (VFs) for hardware-level isolation and performance.
- Secondary Networks
Multiple network interface types including host device networks, MacVLAN networks, IP over InfiniBand networks, and SR-IOV networks for specialized networking requirements.
- Driver Management
Automated deployment and management of NVIDIA DOCA-OFED networking drivers across cluster nodes with version control and updates.
- Heterogeneous Cluster Support
Per-node-group NIC driver and device plugin configurations via NicNodePolicy, enabling different DOCA-OFED versions for different node roles.
Supported Hardware
See the Platform Support page for supported hardware and software.
Use Cases
AI Training at Scale: Distributed training across multi-GPU, multi-node clusters, with GPUDirect RDMA between GPUs and NICs for low-latency East-West traffic.
AI Inference and Generative AI: Multi-node serving of large language and multimodal models, where tensor and pipeline parallelism depend on accelerated GPU-to-GPU networking.
Telco CNFs and DPDK Workloads: Cloud-native network functions and DPDK-accelerated user-plane applications using SR-IOV for line-rate packet processing.
Media and Broadcast: Containerized live-broadcast pipelines on Kubernetes and OpenShift using NVIDIA Rivermax and uncompressed SMPTE ST 2110 over SR-IOV, with PTP-synchronized timing. Foundational to NVIDIA Holoscan for Media.
Edge AI and Sensor Streaming: Low-latency sensor ingest over SR-IOV and RDMA for real-time inference on NVIDIA IGX Orin and Grace ARM platforms.
High-Performance Computing (HPC): Scientific simulations, modeling, and distributed computing workloads.
Data Processing: Database systems, analytics platforms, and storage applications requiring high network throughput.
License Agreements
The NVIDIA Network Operator source code is licensed under Apache 2.0 and contributions are accepted with a DCO. See the contributing document for more information on how to contribute and the release artifacts.
NVIDIA Network Operator container images distributed via NVIDIA NGC are subject to the NVIDIA software license terms. Refer to the corresponding container listing on NGC for the applicable license; by pulling and using these containers you accept those terms.
Learn More
The Network Operator is open-source. For more information on contributions and release artifacts, see the GitHub repo.
For detailed deployment, operational, and release information:
Release Notes: What changed in this release, known issues, and upgrade considerations
Platform Support: Supported hardware, operating systems, and Kubernetes platforms
Quick Start Guide for Kubernetes: Quick deployment guide with common configurations
NVIDIA Network Operator Deployment Guide with Kubernetes: Detailed deployment scenarios
Life Cycle Management: Upgrade procedures and lifecycle guidance
Troubleshooting: Troubleshooting common issues