LLM News - Blockchain.News

DEEPSEEK

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

deepseek

AutoJudge Revolutionizes LLM Inference with Enhanced Token Processing

AutoJudge introduces a novel method to accelerate large language model inference by optimizing token processing, reducing human annotation needs, and improving processing speed with minimal accuracy loss.

by Caroline Bishop
Dec 05, 2025

deepseek

NVIDIA's ComputeEval 2025.2 Challenges LLMs with Advanced CUDA Tasks

NVIDIA expands ComputeEval with 232 new CUDA challenges, testing LLMs' capabilities in complex programming tasks. Discover the impact on AI-assisted coding.

by Peter Zhang
Nov 07, 2025

deepseek

Generative AI Revolutionizes Legal Services with Custom LLMs

Harvey's custom LLMs are transforming legal services by addressing complex legal challenges across various jurisdictions and practice areas, enhancing efficiency and accuracy.

by Felix Pinkston
Nov 07, 2025

deepseek

Unsloth Simplifies LLM Training on NVIDIA Blackwell GPUs

Unsloth's open-source framework enables efficient LLM training on NVIDIA Blackwell GPUs, democratizing AI development with faster throughput and reduced VRAM usage.

by Iris Coleman
Oct 24, 2025

deepseek

ATLAS: Revolutionizing LLM Inference with Adaptive Learning

Together.ai introduces ATLAS, a system enhancing LLM inference speed by adapting to workloads, achieving 500 TPS on DeepSeek-V3.1.

by Rongchai Wang
Oct 10, 2025

deepseek

NVIDIA AI Red Team Offers Critical Security Insights for LLM Applications

NVIDIA's AI Red Team has identified key vulnerabilities in AI systems, offering practical advice to enhance security in LLM applications, focusing on code execution, access control, and data exfiltration.

by Iris Coleman
Oct 04, 2025

deepseek

Enhancing LLM Inference with NVIDIA Run:ai and Dynamo Integration

NVIDIA's Run:ai v2.23 integrates with Dynamo to address large language model inference challenges, offering gang scheduling and topology-aware placement for efficient, scalable deployments.

by Lawrence Jengar
Sep 29, 2025

deepseek

NVIDIA's Run:ai Model Streamer Enhances LLM Inference Speed

NVIDIA introduces the Run:ai Model Streamer, significantly reducing cold start latency for large language models in GPU environments, enhancing user experience and scalability.

by Ted Hisokawa
Sep 17, 2025

deepseek

Enhancing LLM Inference with CPU-GPU Memory Sharing

NVIDIA introduces a unified memory architecture to optimize large language model inference, addressing memory constraints and improving performance.

by Felix Pinkston
Sep 06, 2025

deepseek

Solana (SOL) Bench: Evaluating LLMs' Competence in Crypto Transactions

Solana (SOL) introduces Solana Bench, a tool to assess the effectiveness of LLMs in executing complex crypto transactions on the Solana blockchain.

by Terrill Dicki
Aug 28, 2025

DEEPSEEK

Trending topics