✕
  • News ▸

    • ◂ Back
    • Crypto News
      • Bitcoin News
      • Ethereum News
      • Cardano News
      • Stablecoin News
      • CBDC News
      • DeFi News
      Regulatory News
      • Regulation
      • Legal
      • Cybercrime
      Industry News
      • Exchanges
      • Mining
      • Data Service
      Technology News
      • Enterprise
      • Blockchain Application
  • Analysis
  • Interview
  • Wiki
  • Press Release
  • Price
  • Events
  • Opinion
  • Join Us
  • Tools
  • About
  • Contact
  • Privacy
  • Terms & Conditions
  • Advertise
  • 中文
  •      

DEEPSEEK

 

DeepSeek is an AI company and a family of large language models based in Hangzhou, China. It was founded in 2023 and funded by High-Flyer, a well - known quantitative asset management giant. DeepSeek is dedicated to developing advanced large language models and related technologies. It has released several models, including DeepSeek LLM, DeepSeek Coder, DeepSeekMath, and DeepSeek - VL. The latest version, DeepSeek - V3, which was launched in December 2024, has 67.1 billion parameters and was trained on a dataset of 14.8 trillion tokens. It uses FP8 training and open - sources the native FP8 weights. Benchmark tests show that it outperforms Llama 3.1 and Qwen 2.5 while matching GPT - 4O and Claude 3.5 Sonnet. In addition, DeepSeek - R1, which was officially released on January 20, 2025, performs on a par with OpenAI O1 in terms of mathematics, code, and natural language reasoning tasks. DeepSeek's models have a wide range of applications, such as chat and coding scenarios, multilingual automatic translation, image generation, and AI painting. With their high performance and low cost, DeepSeek's models have quickly gained popularity. For example, on February 2, 2025, the DeepSeek app climbed to the top of the download charts in 140 countries on the Apple App Store and also topped the Android Play Store in the United States

NVIDIA's GB200 NVL72 Revolutionizes AI with Enhanced MoE Performance

Lawrence Jengar     Dec 05, 2025

#NVIDIA #MIXTURE OF EXPERTS #AI MODELS
NVIDIA's Mistral 3 Models Boost AI Efficiency and Accuracy

Darius Baruo     Dec 03, 2025

#NVIDIA #AI MODELS #MISTRAL 3
NVIDIA and Mistral AI Unveil Advanced Open-Source AI Models

Timothy Morano     Dec 03, 2025

#NVIDIA #MISTRAL AI #AI MODELS #OPEN SOURCE
Enhancing Financial Data Workflows with AI Model Distillation

Terrill Dicki     Dec 02, 2025

#AI #FINANCE #MODEL DISTILLATION
Together AI Sets New Benchmark with Fastest Inference for Open-Source Models

Felix Pinkston     Dec 02, 2025

#AI #OPEN-SOURCE MODELS #GPU OPTIMIZATION #INFERENCE SPEED
Black Forest Labs Launches FLUX.2 Models Optimized for NVIDIA RTX GPUs

Alvin Lang     Nov 26, 2025

#NVIDIA #AI MODELS #FLUX.2 #RTX GPUS
Understanding Model Quantization and Its Impact on AI Efficiency

Peter Zhang     Nov 25, 2025

#AI #MODEL QUANTIZATION #NVIDIA
ElevenLabs Launches Image & Video Platform for Unified Content Creation

Joerg Hiller     Nov 18, 2025

#ELEVENLABS #IMAGE & VIDEO #CONTENT CREATION #AI MODELS
Character.AI's Kaiju: Scaling Conversational Models with Efficiency and Safety

Jessie A Ellis     Nov 07, 2025

#AI #MACHINE LEARNING #CONVERSATIONAL MODELS
NVIDIA Enhances PyTorch with NeMo Automodel for Efficient MoE Training

Caroline Bishop     Nov 07, 2025

#NVIDIA #PYTORCH #MOE #AI #NEMO AUTOMODEL



  • Write
  • About
  • Contact
  • Privacy
  • Terms & Condition
  • Advertise
Copyright © 2020 Blockchain News. All Rights Reserved.