Search Results for "kubernetes"
NVIDIA Enhances AI Scalability with NIM Operator 3.0.0 Release
NVIDIA's NIM Operator 3.0.0 introduces advanced features for scalable AI inference, enhancing Kubernetes deployments with multi-LLM and multi-node capabilities, and efficient GPU utilization.
Ray Enhances Scheduling with New Label Selectors
Ray introduces label selectors, enhancing scheduling capabilities for developers, allowing more precise workload placement on nodes. The feature is a collaboration with Google Kubernetes Engine.
Kubernetes Embraces Multi-Node NVLink for Enhanced AI Workloads
NVIDIA's GB200 NVL72 introduces ComputeDomains for efficient AI workload management on Kubernetes, facilitating secure, high-bandwidth GPU connectivity across nodes.
NVIDIA Grove Simplifies AI Inference on Kubernetes
NVIDIA introduces Grove, a Kubernetes API that streamlines complex AI inference workloads, enhancing scalability and orchestration of multi-component systems.
NVIDIA Enhances AI Inference with Dynamo and Kubernetes Integration
NVIDIA's Dynamo platform now integrates with Kubernetes to streamline AI inference management, offering improved performance and reduced costs for data centers, according to NVIDIA's latest updates.
Enhancing Kubernetes AI Cluster Stability with NVSentinel
NVIDIA introduces NVSentinel, an open-source tool designed to automate health monitoring and issue remediation in Kubernetes AI clusters, ensuring GPU reliability and minimizing downtime.
NVIDIA Run:ai v2.24 Tackles GPU Scheduling Fairness for AI Workloads
NVIDIA's new time-based fairshare scheduling prevents GPU resource hogging in Kubernetes clusters, addressing critical bottleneck for enterprise AI deployments.
NVIDIA Launches AI Cluster Runtime to Standardize GPU Kubernetes Deployments
NVIDIA's new open-source AI Cluster Runtime project delivers validated, reproducible Kubernetes configurations for GPU clusters, targeting H100 and Blackwell accelerators.
NVIDIA Advances AI Infrastructure With Disaggregated LLM Inference on Kubernetes
NVIDIA details new Kubernetes deployment patterns for disaggregated LLM inference using Dynamo and Grove, promising better GPU utilization for AI workloads.
NVIDIA Donates GPU Resource Driver to Kubernetes Open Source Project
NVIDIA transfers critical GPU allocation software to CNCF at KubeCon Europe, marking major shift toward community-governed AI infrastructure.
NVIDIA MIG Boosts AI Infrastructure ROI by 33% Over Time-Slicing
New NVIDIA benchmarks show Multi-Instance GPU partitioning achieves 1.00 req/s per GPU versus 0.76 for time-slicing in production AI workloads.
Crypto.com Leverages Amazon's AWS Cloud To Enhance Scalability and Security
Crypto.com will now leverage Amazon Web Services, Inc (AWS) cloud provider to provide enhanced scalability and security for its 5 million users,