Search Results for "ai infrastructure"
NVIDIA Run:ai Delivers 2x GPU Utilization Gains for AI Inference Workloads
NVIDIA benchmarks show Run:ai platform doubles GPU utilization while cutting latency 61x for enterprise AI deployments running NIM inference microservices.
NVIDIA Commits $4B to Optics Partners Lumentum, Coherent for AI Data Centers
NVIDIA invests $2 billion each in Lumentum and Coherent to scale silicon photonics for AI infrastructure. LITE stock jumps 12% on the news.
NVIDIA Drops $2B on Coherent as Optical Tech Race Heats Up
NVIDIA invests $2 billion in Coherent to develop silicon photonics for AI data centers, part of a broader $4B push into optical interconnect technology.
NVIDIA GTC 2026 Set for March 16-19 as Jensen Huang Teases Full AI Stack Reveal
NVIDIA's GTC 2026 conference runs March 16-19 in San Jose with 30,000+ attendees expected. CEO Jensen Huang keynote may unveil rumored new inference chip.
Oracle Seeks Air Permits for $165B Project Jupiter AI Campus in New Mexico
Oracle submitted permit applications for natural gas microgrids at its massive AI data center tied to the $500B Stargate initiative with OpenAI.
NVIDIA Releases Flash Attention Optimization Guide for Blackwell GPUs
NVIDIA's new cuTile framework delivers 1.6x speedups for Flash Attention on B200 GPUs, enabling faster LLM inference critical for AI infrastructure.
OpenAI Partners With Tata Group to Build 1GW AI Infrastructure in India
OpenAI launches India initiative with Tata Group, starting with 100MW data centers scaling to 1GW. ChatGPT Enterprise deployment planned for hundreds of thousands of TCS employees.
FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200
Together AI's FlashAttention-4 achieves 1,605 TFLOPs/s on B200 GPUs, up to 2.7x faster than Triton. New pipelining overcomes asymmetric hardware scaling bottlenecks.
Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%
Bitfarms adds six senior leaders from MARA, Brookfield, and Stronghold as it accelerates transition from Bitcoin mining to AI data center infrastructure.
NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains
NVIDIA's open-source AIConfigurator tool optimizes LLM serving configurations in seconds, delivering 38% throughput improvements for disaggregated AI inference deployments.
NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers
NVIDIA releases Inference Transfer Library (NIXL), an open-source tool accelerating KV cache transfers for distributed AI inference across major cloud platforms.
NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support
Technology Innovation Institute integrates Falcon-H1 hybrid architecture and BitNet ternary training into NVIDIA's Megatron Core, enabling efficient large language model development.
