Eipconsultants
Add a review FollowOverview
-
Founded Date April 17, 2013
-
Sectors test
-
Posted Jobs 0
-
Viewed 56
Company Description
Open-R1: a Fully Open Reproduction Of DeepSeek-R1

Hey there! This blog site post is an intro to the project, not a claim that we’ve reproduced R1 yet. We’re developing in the open, so as soon as we have assessment numbers, we’ll share them. You can follow our progress on Hugging Face and GitHub.

True, however it looks like there’s nothing to be evaluated since right now. I presume the ultimate objective is to train a new thinking model and after that utilize the exact same evaluation metrics as o1 and the DeepSeek-R1.

Well, there should be at least some peace of mind check and validation to make sure the model was trained properly.
Oh yes, if you are talking about the evaluation variety of deepseek’s model it’s coming soon!
As mentioned in the post there is no design called Open-R1 to evaluate at all … not yet anyway. This is a blog site describing that Hugging face will take the R1 Deepseek design, exercise how it was constructed as outlined in the paper and from what they released, and after that reproduce that procedure.
in truth this is practically how science works … A creates a plan, discovery or innovation and it is checked by B, C and D to see if it is reproduceable. Thats been the cornerstone of research study now for a few centuries.
This blog is not stating they have actually currently done so … Its a blog describing an intent to start training a design like R1 and calling it Open-R1.
Also DeepSeek-R1 was just launched last week, and even in their paper they detailed the compute hours required. While those are low compute hours for a SOTA model this does not indicate you can train said model in a week. I ‘d personally like to be able to train a transformer design in a week, however we might need to wait a while for that level of compute innovation.
So there are no standards for a design that has not been built yet right? As outlined in the blog site, and once again in reply to your question.
However fear not, there is a GitHub Repo already and factors (hell I might join myself), some prelim work done, and a master plan. A good starting position.
n
@edbeeching
has evaluated the launched designs already
( src: https://x.com/edwardbeeching/status/1884273209136275742)
R1 simply trained on o1 outputs, so jointly …/ s. This is what the brand-new AI czars are saying
Hi! This blog site post is an introduction to the job, not a claim that we have actually replicated R1 yet. We will totally share the missing out on piece when we have them, you can anticipate the designs and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo
That’s good and essential to comprehend this tremendous hype that does not have technical comprehension and description. Science is about recreation, and if they claim to be open, let them fullfill the open part.
Please do publish the training cost.
We will!
Excalidraw Hi n
@bojan2501
thanks, we will indeed be striving to make certain this training dish can work for small language models on consumer hardware because not everybody has a cluster of H100s in the house:-RRB- The tool we used for the images was Excalidraw! https://excalidraw.com
looking forward to it! WTF are your speaking about?
need to be a joke
It’s truly cool to see how the entire open source community comes together!
Ops …
5.5 M is number reporter in the deepseekv3 tech report (just the training, not the experiment afaik), for R1 difficult to approximate tbh but much less than 5.5 M imo
Historically, they have actually never ever released code or datasets of their LLM training, so I wouldn’t anticipate this time to be different. If they would release it that would be incredible naturally!
Yes obviously!
So generally you’re asking to replace existing censorship with another flavour of censorship?
The code for the models are inside the model repositories, e.g. for V3: https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/modeling_deepseek.py
Hello Team, I’m Ray Bernard, the author and creator of EQUATOR. My research group will be dealing with a paper concentrated on replicating specific of DeepSeek R1. Our aim is to replicate the cold start and supply your group with a dataset that consists of COT and other techniques to support these efforts. We like to contribute our work to assist. Please let me understand if you discover this beneficial. Best, Ray Bernard https://www.facebook.com/groups/1186310571520299/
Where is the evaluation numbers? without it you can’t call it reproduction.
8 replies
True, however it looks like there’s nothing to be examined as of right now. I presume the ultimate goal is to train a new reasoning design and after that use the very same examination metrics as o1 and the DeepSeek-R1.
That’s rather fascinating, I was asking myself why the questions the author exposed here are not being asked by others? I think the work they have actually done is unforgettable but at the very same time I wonder why they wouldn’t put these missing out on pieces on if they are supposed to be totally open.
Why even without reproduction and comprehension of the development they could impact so much the market in this method?
4 replies
Hi! This article is an intro to the job, not a claim that we’ve replicated R1 yet. We will totally share the missing piece when we have them, you can anticipate the models and datasets to be upload in this Hugging Face org and the code to be in this GitHub repo
Interesting read, and it is excellent that we see more effort into this direction: more optimization and less brute force.
Also question what tool did the author use for developing action diagram.
2 replies
Excalidraw I’m so happy that initiative like this currently exist, I’m gon na try to contribute:-RRB- 1 reply
looking forward to it! So racist articel
2 replies
WTF are your speaking about?
Awesome to have this open recreation started!
For Step # 1 check out https://github.com/open-thoughts/open-thoughts!
https://x.com/ryanmart3n/status/1884284101265612856
Let’s do this thing!
1 reply
It’s really cool to see how the whole open source community comes together!
Does anybody understand the actual training cost of r1? I can’t discover it in the paper or the statement post. Is the 6M expense reported by media just the number drawn from v3’s training cost?
2 replies
Ops …
Has anyone asked the DeepSeek group to release their training information and code, or a minimum of share them independently with an independent replication task like this? Have they turned down such a request?
A faithful replication depends on utilizing the same dataset and hyperparameters. Otherwise, any significant disparities with the published criteria would be difficult to pin down-whether due to training information differences or the replication method itself.
1 reply
Historically, they have actually never ever released code or datasets of their LLM training, so I wouldn’t expect this time to be different. If they would launch it that would be fantastic obviously!
In the meantime we need to make finest guess price quotes and see if we can arrive ourselves.
You supply good duplication procedure of Deepseek thinking training. I will attempt something similar to it.
This is really excellent details, can we tweak with particular use case when code is launched?
1 reply
Yes obviously!
Please think about removing prejudiced, tainted or unaligned training data and make an effort to remove copyrighted works from the crawl from intake. This will make the model more usable. If you recycled anthropic curation checks, this might also assist, get rid of obviouslybiased data will likely add a lot of worth. We do not desire another polluted, unaligned open source design, right? And no business would ever use deepseek or a model that recycles it, right?
We appreciate your work for the benefit of humankind, we hope.
Miike C from NJ
1 reply
So generally you’re asking to replace existing censorship with another flavour of censorship?

Can’t wait! Hopefully the model will be uncensored but whatever you can do is alright! Love seeing open source structure itself up. I’m not wise enough to really help but I can contribute ethical support lol
Hello guys, I am even simply searching for code for DeepSeek-V2, in order to completely comprehend multi-head hidden attention. You do not seem to have code in Hugging Face even for that. Or am I missing out on something? Don’t see anything in src/transformers/models. MLA is not properly explained in their paper, so it would be necessary to have code for this.
