DeepSeek-R1: New open source AI model from Chinese startup threatens OpenAI/US lead in AI race; tech bros panicking

YANDHI

Jan 26, 2025

1 reply

Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.

The funniest thing about this is the fact Deepseek is open-source and their entire team publishes full research papers spilling the secrets and formulas they rely on. A Chinese lab is leading democratization of AI

And now Bytedance and others are popping up with similar LLMs.

Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too

frenchpress 🔺

Jan 26, 2025

that's kinda fire tbh.

YANDHI

Jan 26, 2025

BrainWorms4U

Oof

https://twitter.com/shangguanjiewen/status/1883058711520006177

This has even hit PhD holders and professors who were at prestige universities and are leaving because of geopolitics/discrimination. Its been very common now to see a new story pop up of a veteran scientist with decades in the U.S going home to China to take up a cozy department head at their university.

3 days ago: scmp.com/news/china/science/article/3295774/chen-jing-award-winning-computer-scientist-and-blockchain-expert-returns-china-us

3 weeks ago: scmp.com/news/china/science/article/3293288/ex-dassault-aerospace-physicist-leaves-us-china-lead-new-energy-research

Those are just two quick examples. I can share years worth of this

sco

Jan 26, 2025

dotM

How much of OpenAI’s chatgpt did they use to build their model?

how much of the public internet has openai used to train their models? it’s fair game atp

venturebeat.com/ai/why-everyone-in-ai-is-freaking-out-about-deepseek

Shampoo Bracelets

Jan 26, 2025

4 replies

Squilliam

Or simply it takes a lot of resources to get started but it takes a lot fewer resources to catch up. Best example I can give, it took America a couple hundred years to industrialize but it took Japan only a few decades. But that’s because the blueprints were already there so it’s way more efficient to catch up

Catch up? They’ve surpassed us and will continue to do so

And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

emo genghis khan

Jan 26, 2025

viscera

Jbreezyondeck 🌬️

Jan 26, 2025

Shampoo Bracelets

Catch up? They’ve surpassed us and will continue to do so

And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

What’s wrong with wanting to work from home

Wack300

Jan 26, 2025

Shampoo Bracelets

Catch up? They’ve surpassed us and will continue to do so

And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

Okay boomer.

Nessy 🦎

Jan 26, 2025

1 reply

kusa

the real crazy thing is this is completely open-source and from free research available, their work is published and literally anyone can benefit from it.

open source won again

and deepseek is genuinely a huge marvel when comparing to gpt. solving things at a rate much faster, and providing answers that are more accurate lolz

it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still

nicobarret

Jan 26, 2025

Open source will always catch up and win

sco

Jan 26, 2025

1 reply

Nessy

it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still

it’s enough for people to replicate it isn’t it? that’s all that matters

sco

Jan 26, 2025

1 reply

arxiv.org/pdf/2501.12948

Plight2

Jan 26, 2025

1 reply

sco

it’s enough for people to replicate it isn’t it? that’s all that matters

people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.

soapmanwun

Jan 26, 2025

five big booms for china

Corporate Mór

Jan 26, 2025

sco

https://arxiv.org/pdf/2501.12948

"After thousands of RL steps, DeepSeek-R1-Zero exhibits super performance
on reasoning benchmarks. For instance, the pass@1 score on AIME 2024 increases from 15.6% to
71.0%, and with majority voting, the score further improves to 86.7%, matching the performance
of OpenAI-o1-0912." I'm listening

Undecided

Jan 26, 2025

It’s ok guys, we’re just gonna throw another $600 billion at AI and hire a bunch of H1Bs.

Well surely catch up.

WRU

Jan 26, 2025

viscera

Personality

Jan 26, 2025

Did this nigga @op really delete my post?

Plight2

Jan 26, 2025

very interesting !

Personality

Jan 26, 2025

7 replies

Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

delete this, p****

sco

Jan 26, 2025

Plight2

people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.

yeah i have some versions of it running on my mac (google LM Studio it’s super easy to set up and free)

but i mean easyfor researchers to reproduce the same work/make use of the same techniques etc so that others can build on their work, since it goes into detail about how it works and how they did it, which is how scientific research is supposed to work (afaik?)

compare that paper with openai’s gpt4 paper which barely gives any actual info about how they created it at all despite being 100+ pages (mostly self congratulatory “look how good this s*** we did is”) — arxiv.org/pdf/2303.12712

Cherrywine1

Jan 26, 2025

Personality

Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

delete this, p****

BP poster killin me

Plight2

Jan 26, 2025

Personality

Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

delete this, p****

anti woke police in china is crazy

Your Chinese Spy 🇨🇳

Jan 26, 2025

2 replies

Personality

Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

delete this, p****

We love Panther!

Ya boy

Jan 26, 2025

YANDHI

Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.

And now Bytedance and others are popping up with similar LLMs.

Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too

Chinas been on the forefront of computer vision for the last ten years, they realize that the technology itself isn’t what’s important, it’s the application of it. If you really want to be the leader in the ai space you need to make it easy for anyone to start playing around with it.

...