Overview

  • Founded Date May 9, 1947
  • Sectors test
  • Posted Jobs 0
  • Viewed 59

Company Description

Open-R1: a Completely Open Reproduction Of DeepSeek-R1

Hey there! This post is an intro to the task, not a claim that we have actually reproduced R1 yet. We’re integrating in the open, so as quickly as we have examination numbers, we’ll share them. You can follow our progress on Hugging Face and GitHub.

True, however it appears like there’s absolutely nothing to be examined since right now. I assume the ultimate goal is to train a brand-new thinking model and then use the very same assessment metrics as o1 and the DeepSeek-R1.

Well, there ought to be at least some peace of mind check and recognition to guarantee the model was trained properly.

Oh yes, if you are talking about the assessment number of deepseek’s model it’s coming soon!

As discussed in the post there is no model called Open-R1 to test at all … not yet anyway. This is a blog detailing that Hugging face will take the R1 Deepseek model, exercise how it was constructed as laid out in the paper and from what they launched, and then replicate that process.

in reality this is basically how science works … A develops a strategy, discovery or innovation and it is tested by B, C and D to see if it is reproduceable. Thats been the foundation of research study now for a few centuries.

This blog site is not saying they have actually already done so … Its a blog site outlining an intent to start training a design like R1 and calling it Open-R1.

Also DeepSeek-R1 was only launched last week, and even in their paper they described the compute hours required. While those are low calculate hours for a SOTA design this does not imply you can train stated model in a week. I ‘d personally like to be able to train a transformer model in a week, but we might need to wait a while for that level of compute technology.

So there are no criteria for a model that has not been built yet right? As detailed in the blog site, and again in reply to your question.

However fear not, there is a GitHub Repo already and contributors (hell I may join myself), some prelim work done, and a plan of attack. A good starting position.

n
@edbeeching
has actually examined the launched designs already

( src: https://x.com/edwardbeeching/status/1884273209136275742)

R1 just trained on o1 outputs, so jointly …/ s. This is what the brand-new AI czars are saying

Hi! This blog site post is an intro to the project, not a claim that we have actually replicated R1 yet. We will completely share the missing out on piece when we have them, you can expect the models and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo

That’s good and essential to comprehend this tremendous buzz that lacks technical comprehension and description. Science has to do with recreation, and if they claim to be open, let them fullfill the open part.

Please do publish the training expense.

We will!

Excalidraw Hi n
@bojan2501
thanks, we will undoubtedly be striving to ensure this training recipe can work for small language models on consumer hardware since not everybody has a cluster of H100s in your home:-RRB- The tool we used for the images was Excalidraw! https://excalidraw.com

eagerly anticipating it! WTF are your speaking about?

must be a joke

It’s actually cool to see how the entire open source community comes together!

Ops …

5.5 M is number reporter in the deepseekv3 tech report (simply the training, not the experiment afaik), for R1 difficult to estimate tbh however much less than 5.5 M imo

Historically, they have actually never released code or datasets of their LLM training, so I would not expect this time to be various. If they would release it that would be fantastic of course!

Yes of course!

So essentially you’re asking to replace existing censorship with another flavour of censorship?

The code for the designs are inside the design repositories, e.g. for V3: https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/modeling_deepseek.py

Hello Team, I’m Ray Bernard, the author and creator of EQUATOR. My research team will be dealing with a paper focused on duplicating certain parts of DeepSeek R1. Our goal is to reproduce the cold start and supply your group with a dataset that consists of COT and other techniques to support these efforts. We like to contribute our work to help. Please let me understand if you find this beneficial. Best, Ray Bernard https://www.facebook.com/groups/1186310571520299/

Where is the assessment numbers? without it you can’t call it reproduction.

8 replies

True, however it looks like there’s nothing to be evaluated since today. I assume the supreme goal is to train a brand-new reasoning model and after that utilize the exact same assessment metrics as o1 and the DeepSeek-R1.

That’s quite interesting, I was asking myself why the concerns the author exposed here are not being asked by others? I think the work they have done is memorable but at the very same time I wonder why they wouldn’t put these missing pieces on if they are supposed to be totally open.
Why even without reproduction and comprehension of the innovation they could affect so much the marketplace in this method?

4 replies

Hi! This article is an intro to the task, not a claim that we have actually reproduced R1 yet. We will completely share the missing out on piece when we have them, you can anticipate the designs and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo

Interesting read, and it is excellent that we see more effort into this direction: more optimization and less brute force.
Also question what tool did the author usage for producing step diagram.

2 replies

Excalidraw I’m so thankful that initiative like this already exist, I’m gon na attempt to contribute:-RRB- 1 reply

looking forward to it! So racist articel

2 replies

WTF are your speaking about?

Awesome to have this open recreation began!

For Step # 1 check out https://github.com/open-thoughts/open-thoughts!

https://x.com/ryanmart3n/status/1884284101265612856

Let’s do this thing!

1 reply

It’s truly cool to see how the entire open source neighborhood comes together!

Does anyone understand the real training expense of r1? I can’t discover it in the paper or the announcement post. Is the 6M cost reported by media simply the number drawn from v3’s training expense?

2 replies

Ops …

Has anybody asked the DeepSeek group to publish their training data and code, or a minimum of share them privately with an independent duplication task like this? Have they turned down such a demand?

A loyal replication depends upon using the exact same dataset and hyperparameters. Otherwise, any major discrepancies with the published standards would be difficult to pin down-whether due to training data differences or the duplication method itself.

1 reply

Historically, they have actually never launched code or datasets of their LLM training, so I wouldn’t anticipate this time to be various. If they would launch it that would be remarkable obviously!

In the meantime we have to make best guess estimates and see if we can arrive ourselves.

You offer good duplication procedure of Deepseek thinking training. I will attempt something similar to it.

This is really great info, can we tweak with particular use case when code is launched?

1 reply

Yes obviously!

Please think about getting rid of biased, polluted or unaligned training information and make an effort to get rid of copyrighted works from the crawl from intake. This will make the model more functional. If you reused anthropic curation checks, this may also help, remove obviouslybiased information will likely include a lot of value. We do not want another tainted, unaligned open source model, right? And no corporate would ever use deepseek or a model that reuses it, right?
We value your work for the advantage of humanity, we hope.
Miike C from NJ

1 reply

So essentially you’re asking to replace existing censorship with another flavour of censorship?

Can’t wait! Hopefully the design will be uncensored however whatever you can do is alright! Love seeing open source structure itself up. I’m not clever adequate to actually assist however I can contribute support lol

Hello guys, I am even just searching for code for DeepSeek-V2, in order to fully understand multi-head hidden . You do not appear to have code in Hugging Face even for that. Or am I missing something? Don’t see anything in src/transformers/models. MLA is not properly described in their paper, so it would be very important to have code for this.