DeepSeek AI Models

This cluster centers on discussions about DeepSeek AI models, including their performance, comparisons to OpenAI and Llama, training methods like distillation, open-source status, cost-effectiveness, and specific implementations.

🚀 Rising 177.1x AI & Machine Learning

2,407

Comments

Years Active

Top Authors

#3585

Topic ID

Activity Over Time

2016

2019

2020

2021

2022

2023

2024

2025

2,278

2026

Top Contributors

blackeyeblitzar (32) buyucu (30) aurareturn (20) simonw (18) rfoo (14)

Keywords

AI IMO LLM ollama.com substack.com definite.app RL DeepSeek github.com R1 deepseek openai model models training o1 trading rl llama running

Sample Comments

scarface_74 • Jan 31, 2025 • View on HN

DeepSeek is only thing because they used OpenAI for refinement.

Animats • Jan 28, 2025 • View on HN

Does anyone know how Deepseek does it yet?

karolist • Jan 29, 2025 • View on HN

What's specific to deepseek here that other models do not use, or are you just riding the keyword wave?

billconan • Jan 26, 2025 • View on HN

DeepSeek is based on llama, right? that 3% is only the fine tuning money?

ted_bunny • Jun 25, 2025 • View on HN

Did deepseek have a notably dofferent tenor?

blobbers • Jul 1, 2025 • View on HN

Why is DeepSeek specifically called out?

xyzzy9563 • Jan 31, 2025 • View on HN

What is the comparison of this versus DeepSeek in terms of good results and cost?

_alex_ • Mar 5, 2025 • View on HN

what flavor of deepseek are you running? what kind of performance are you seeing?

cudgy • Jan 29, 2025 • View on HN

Interesting take on deepseek. I think deepseek may be a breath of fresh air slightly leveling the playing field for small startups and individual developers, Assuming the claimed reduced training cost are real.

cbracketdash • Apr 28, 2025 • View on HN

Wasn't it shown that DeepSeek was distilling OpenAI's models? Seems like a presumptuous claim by Liang Wenfeng