DeepSeek AI Models

This cluster centers on discussions about DeepSeek AI models, including their performance, comparisons to OpenAI and Llama, training methods like distillation, open-source status, cost-effectiveness, and specific implementations.

🚀 Rising 177.1x AI & Machine Learning
2,407
Comments
9
Years Active
5
Top Authors
#3585
Topic ID

Activity Over Time

2016
1
2019
1
2020
3
2021
1
2022
4
2023
9
2024
86
2025
2,278
2026
24

Keywords

AI IMO LLM ollama.com substack.com definite.app RL DeepSeek github.com R1 deepseek openai model models training o1 trading rl llama running

Sample Comments

scarface_74 Jan 31, 2025 View on HN

DeepSeek is only thing because they used OpenAI for refinement.

Animats Jan 28, 2025 View on HN

Does anyone know how Deepseek does it yet?

karolist Jan 29, 2025 View on HN

What's specific to deepseek here that other models do not use, or are you just riding the keyword wave?

billconan Jan 26, 2025 View on HN

DeepSeek is based on llama, right? that 3% is only the fine tuning money?

ted_bunny Jun 25, 2025 View on HN

Did deepseek have a notably dofferent tenor?

blobbers Jul 1, 2025 View on HN

Why is DeepSeek specifically called out?

xyzzy9563 Jan 31, 2025 View on HN

What is the comparison of this versus DeepSeek in terms of good results and cost?

_alex_ Mar 5, 2025 View on HN

what flavor of deepseek are you running? what kind of performance are you seeing?

cudgy Jan 29, 2025 View on HN

Interesting take on deepseek. I think deepseek may be a breath of fresh air slightly leveling the playing field for small startups and individual developers, Assuming the claimed reduced training cost are real.

cbracketdash Apr 28, 2025 View on HN

Wasn't it shown that DeepSeek was distilling OpenAI's models? Seems like a presumptuous claim by Liang Wenfeng