DEEPSEEK
NVIDIA vGPU 18.0 Expands AI Capabilities Across Virtual Platforms
NVIDIA's vGPU 18.0 release enhances AI capabilities on virtual platforms, supporting Microsoft Windows Server 2025 and Proxmox VE, and introduces new AI toolkits for developers.
NVIDIA's Project Aether Boosts Apache Spark Efficiency
NVIDIA introduces Project Aether, streamlining Apache Spark workloads with GPU acceleration, significantly reducing processing times and costs for enterprises globally.
Together AI Unveils Cost-Effective On-Demand Dedicated Endpoints
Together AI introduces Dedicated Endpoints with up to 43% lower pricing, offering enhanced GPU inference capabilities for scaling AI applications, providing high-performance and cost-efficiency.
Decoding PTX: The Core of NVIDIA CUDA GPU Computing
Explore PTX, the assembly language for NVIDIA CUDA GPUs, its role in enabling forward compatibility, and its significance in the GPU computing landscape.
Enhancing CUDA C++ Development with Optimized Compile Times
Learn how the new --fdevice-time-trace feature in CUDA 12.8 improves compile times for CUDA C++ developers, boosting productivity and efficiency.
NVIDIA Unveils Video Codec SDK 13.0 with Blackwell GPU Support
NVIDIA's Video Codec SDK 13.0 introduces significant upgrades with support for Blackwell GPUs, enhancing video encoding and decoding capabilities for modern video applications.
DeepSeek-R1 Enhances GPU Kernel Generation with Inference Time Scaling
NVIDIA's DeepSeek-R1 model uses inference-time scaling to improve GPU kernel generation, optimizing performance in AI models by efficiently managing computational resources during inference.
NVIDIA and Red Hat Enhance GPU Driver Support for RHEL9 with Signed Modules
NVIDIA and Red Hat collaborate to improve GPU driver support for Red Hat Enterprise Linux 9 by providing signed open GPU kernel modules, enhancing security and ease of deployment.
Optimizing Data Workflows with cudf.pandas Profiler for GPU Acceleration
Explore how cudf.pandas Profiler enhances data processing by leveraging GPU acceleration. Discover its benefits for optimizing Python data science workflows.
NVIDIA Unveils Enhanced Features in NCCL 2.23 for Improved GPU Communication
NVIDIA's NCCL 2.23 release introduces a new scaling algorithm, accelerated initialization, and a profiler plugin API, optimizing inter-GPU and multinode communication for AI and HPC applications.