Name: DeepSeek: DeepSeek R1 Zero
Author: deepseek

Question 1

What is DeepSeek: DeepSeek R1 Zero?

Accepted Answer

DeepSeek-R1-Zero is a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step. It's 671B parameters in size, with 37B active in an inference pass.

It demonstrates remarkable performance on reasoning. With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.

DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. See DeepSeek R1 for the SFT model.

Question 2

What is the context length of DeepSeek: DeepSeek R1 Zero?

Accepted Answer

DeepSeek: DeepSeek R1 Zero has a 163,840 token context window.

Question 3

What modalities does DeepSeek: DeepSeek R1 Zero support?

Accepted Answer

DeepSeek: DeepSeek R1 Zero accepts text input and produces text output.

Question 4

When was DeepSeek: DeepSeek R1 Zero released?

Accepted Answer

DeepSeek: DeepSeek R1 Zero was released on 2025-03-06. Its knowledge cutoff is 2024-07-31.

DeepSeek: DeepSeek R1 Zero

deepseek/deepseek-r1-zero

DeepSeek: DeepSeek R1 Zero

deepseek/deepseek-r1-zero

Activity

Frequently asked questions

Activity

Frequently asked questions