Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.
The funniest thing about this is the fact Deepseek is open-source and their entire team publishes full research papers spilling the secrets and formulas they rely on. A Chinese lab is leading democratization of AI
And now Bytedance and others are popping up with similar LLMs.
Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too
Oof
https://twitter.com/shangguanjiewen/status/1883058711520006177This has even hit PhD holders and professors who were at prestige universities and are leaving because of geopolitics/discrimination. Its been very common now to see a new story pop up of a veteran scientist with decades in the U.S going home to China to take up a cozy department head at their university.
3 weeks ago: scmp.com/news/china/science/article/3293288/ex-dassault-aerospace-physicist-leaves-us-china-lead-new-energy-research
Those are just two quick examples. I can share years worth of this
How much of OpenAI’s chatgpt did they use to build their model?
how much of the public internet has openai used to train their models? it’s fair game atp
venturebeat.com/ai/why-everyone-in-ai-is-freaking-out-about-deepseek
Or simply it takes a lot of resources to get started but it takes a lot fewer resources to catch up. Best example I can give, it took America a couple hundred years to industrialize but it took Japan only a few decades. But that’s because the blueprints were already there so it’s way more efficient to catch up
Catch up? They’ve surpassed us and will continue to do so
And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen
Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.
Catch up? They’ve surpassed us and will continue to do so
And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen
Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.
What’s wrong with wanting to work from home
Catch up? They’ve surpassed us and will continue to do so
And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen
Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.
Okay boomer.
the real crazy thing is this is completely open-source and from free research available, their work is published and literally anyone can benefit from it.
open source won again
and deepseek is genuinely a huge marvel when comparing to gpt. solving things at a rate much faster, and providing answers that are more accurate lolz
it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still
it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still
it’s enough for people to replicate it isn’t it? that’s all that matters
it’s enough for people to replicate it isn’t it? that’s all that matters
people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.
https://arxiv.org/pdf/2501.12948
"After thousands of RL steps, DeepSeek-R1-Zero exhibits super performance
on reasoning benchmarks. For instance, the pass@1 score on AIME 2024 increases from 15.6% to
71.0%, and with majority voting, the score further improves to 86.7%, matching the performance
of OpenAI-o1-0912." I'm listening
It’s ok guys, we’re just gonna throw another $600 billion at AI and hire a bunch of H1Bs.
Well surely catch up.
Did this nigga @op really delete my post?
Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee
delete this, p****
people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.
yeah i have some versions of it running on my mac (google LM Studio it’s super easy to set up and free)
but i mean easyfor researchers to reproduce the same work/make use of the same techniques etc so that others can build on their work, since it goes into detail about how it works and how they did it, which is how scientific research is supposed to work (afaik?)
compare that paper with openai’s gpt4 paper which barely gives any actual info about how they created it at all despite being 100+ pages (mostly self congratulatory “look how good this s*** we did is”) — arxiv.org/pdf/2303.12712
Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee
delete this, p****
BP poster killin me
Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee
delete this, p****
anti woke police in china is crazy
Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee
delete this, p****
We love Panther!
Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.
The funniest thing about this is the fact Deepseek is open-source and their entire team publishes full research papers spilling the secrets and formulas they rely on. A Chinese lab is leading democratization of AI
And now Bytedance and others are popping up with similar LLMs.
Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too
Chinas been on the forefront of computer vision for the last ten years, they realize that the technology itself isn’t what’s important, it’s the application of it. If you really want to be the leader in the ai space you need to make it easy for anyone to start playing around with it.