Search Results for "ai inference"
NVIDIA's Rubin CPX GPU Revolutionizes Long-Context AI Inference
NVIDIA unveils Rubin CPX GPU, enhancing AI inference with unprecedented efficiency for 1M+ token workloads, transforming sectors like software development and video generation.
NVIDIA Enhances AI Scalability with NIM Operator 3.0.0 Release
NVIDIA's NIM Operator 3.0.0 introduces advanced features for scalable AI inference, enhancing Kubernetes deployments with multi-LLM and multi-node capabilities, and efficient GPU utilization.
Reducing AI Inference Latency with Speculative Decoding
Explore how speculative decoding techniques, including EAGLE-3, reduce latency and enhance efficiency in AI inference, optimizing large language model performance on NVIDIA GPUs.
NVIDIA Dynamo Tackles KV Cache Bottlenecks in AI Inference
NVIDIA Dynamo introduces KV Cache offloading to address memory bottlenecks in AI inference, enhancing efficiency and reducing costs for large language models.
NVIDIA Grove Simplifies AI Inference on Kubernetes
NVIDIA introduces Grove, a Kubernetes API that streamlines complex AI inference workloads, enhancing scalability and orchestration of multi-component systems.
NVIDIA Enhances AI Inference with Dynamo and Kubernetes Integration
NVIDIA's Dynamo platform now integrates with Kubernetes to streamline AI inference management, offering improved performance and reduced costs for data centers, according to NVIDIA's latest updates.
NVIDIA Achieves 10x AI Image Generation Speedup on Blackwell Data Center GPUs
NVIDIA's new NVFP4 optimizations deliver 10.2x faster FLUX.2 inference on Blackwell B200 GPUs versus H200, with near-linear multi-GPU scaling.
NVIDIA TensorRT for RTX Brings Self-Optimizing AI to Consumer GPUs
NVIDIA's TensorRT for RTX introduces adaptive inference that automatically optimizes AI workloads at runtime, delivering 1.32x performance gains on RTX 5090.
Dr. Ben Goertzel—Creating an AI Marketplace for Paypal’s 286 Million Users
In this second part, we discuss SingularityNET recent projects and collaborations with Toda Network, PICC Services and PayPal.
Blockchain and AI: The Delicate Balance Between Two Cyber Titans
Artificial Intelligence and Blockchain. A technological duality with immense potential to disrupt and create a new order. But is mankind playing with fire?
US Banking Giant Patents AI Fact Checker to Simplify Investing in Crypto
Capital One Services, a subsidiary of US banking giant Capital One has patented a new artificial intelligence system to guide human cryptocurrency traders through the complicated world of misinformation in the digital assets space.
Spanish Researchers Deploy AI and Blockchain-Powered App to Tame COVID-19
At least 100 Spanish researchers from the University of Salamanca, the Artificial Intelligent Research Institute, and the Institute of Biomedical Research of Salamanca have joined hands to design an AI and blockchain-based app to picture the evolution of the coronavirus (COVID-19) pandemic. Their objective is flattening this pandemic’s curve as it has wreaked havoc across the globe.