DEEPSEEK

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

deepseek

Enhancing Polars GPU Parquet Reader Performance with Chunked Reading and UVM

Explore how Polars GPU Parquet Reader boosts performance using chunked reading and Unified Virtual Memory, enhancing data processing capabilities for large datasets.

by Ted Hisokawa
Apr 11, 2025

deepseek

NVIDIA vGPU 18.0 Expands AI Capabilities Across Virtual Platforms

NVIDIA's vGPU 18.0 release enhances AI capabilities on virtual platforms, supporting Microsoft Windows Server 2025 and Proxmox VE, and introduces new AI toolkits for developers.

by Iris Coleman
Mar 20, 2025

deepseek

NVIDIA's Project Aether Boosts Apache Spark Efficiency

NVIDIA introduces Project Aether, streamlining Apache Spark workloads with GPU acceleration, significantly reducing processing times and costs for enterprises globally.

by Darius Baruo
Mar 19, 2025

deepseek

Together AI Unveils Cost-Effective On-Demand Dedicated Endpoints

Together AI introduces Dedicated Endpoints with up to 43% lower pricing, offering enhanced GPU inference capabilities for scaling AI applications, providing high-performance and cost-efficiency.

by James Ding
Mar 14, 2025

deepseek

Decoding PTX: The Core of NVIDIA CUDA GPU Computing

Explore PTX, the assembly language for NVIDIA CUDA GPUs, its role in enabling forward compatibility, and its significance in the GPU computing landscape.

by Rebeca Moen
Mar 13, 2025

deepseek

Enhancing CUDA C++ Development with Optimized Compile Times

Learn how the new --fdevice-time-trace feature in CUDA 12.8 improves compile times for CUDA C++ developers, boosting productivity and efficiency.

by Rebeca Moen
Mar 11, 2025

deepseek

NVIDIA Unveils Video Codec SDK 13.0 with Blackwell GPU Support

NVIDIA's Video Codec SDK 13.0 introduces significant upgrades with support for Blackwell GPUs, enhancing video encoding and decoding capabilities for modern video applications.

by Luisa Crawford
Feb 25, 2025

deepseek

DeepSeek-R1 Enhances GPU Kernel Generation with Inference Time Scaling

NVIDIA's DeepSeek-R1 model uses inference-time scaling to improve GPU kernel generation, optimizing performance in AI models by efficiently managing computational resources during inference.

by Felix Pinkston
Feb 14, 2025

deepseek

NVIDIA and Red Hat Enhance GPU Driver Support for RHEL9 with Signed Modules

NVIDIA and Red Hat collaborate to improve GPU driver support for Red Hat Enterprise Linux 9 by providing signed open GPU kernel modules, enhancing security and ease of deployment.

by Rongchai Wang
Feb 12, 2025

deepseek

Optimizing Data Workflows with cudf.pandas Profiler for GPU Acceleration

Explore how cudf.pandas Profiler enhances data processing by leveraging GPU acceleration. Discover its benefits for optimizing Python data science workflows.

by Ted Hisokawa
Feb 01, 2025

DEEPSEEK

Trending topics