Reply
  • Jan 26
    ·
    1 reply

    Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.

    The funniest thing about this is the fact Deepseek is open-source and their entire team publishes full research papers spilling the secrets and formulas they rely on. A Chinese lab is leading democratization of AI

    And now Bytedance and others are popping up with similar LLMs.

    Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too

  • Jan 26

    that's kinda fire tbh.

  • BrainWorms4U

    Oof

    https://twitter.com/shangguanjiewen/status/1883058711520006177

    This has even hit PhD holders and professors who were at prestige universities and are leaving because of geopolitics/discrimination. Its been very common now to see a new story pop up of a veteran scientist with decades in the U.S going home to China to take up a cozy department head at their university.

    3 days ago: scmp.com/news/china/science/article/3295774/chen-jing-award-winning-computer-scientist-and-blockchain-expert-returns-china-us

    3 weeks ago: scmp.com/news/china/science/article/3293288/ex-dassault-aerospace-physicist-leaves-us-china-lead-new-energy-research

    Those are just two quick examples. I can share years worth of this

  • Jan 26
    dotM

    How much of OpenAI’s chatgpt did they use to build their model?

    how much of the public internet has openai used to train their models? it’s fair game atp

    venturebeat.com/ai/why-everyone-in-ai-is-freaking-out-about-deepseek

  • Jan 26
    ·
    4 replies
    Squilliam

    Or simply it takes a lot of resources to get started but it takes a lot fewer resources to catch up. Best example I can give, it took America a couple hundred years to industrialize but it took Japan only a few decades. But that’s because the blueprints were already there so it’s way more efficient to catch up

    Catch up? They’ve surpassed us and will continue to do so

    And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

    Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

  • viscera

  • Shampoo Bracelets

    Catch up? They’ve surpassed us and will continue to do so

    And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

    Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

    What’s wrong with wanting to work from home

  • Shampoo Bracelets

    Catch up? They’ve surpassed us and will continue to do so

    And this isn’t infrastructure or something that takes decades and trillions to tear down and build up. This is just s*** behind a computer screen

    Americans too busy crying about work from home because they desperately want to get keep getting paid $100/hr on top of RSU’s to play video games, make tik toks, and eat out.

    Okay boomer.

  • Nessy 🦎
    Jan 26
    ·
    1 reply
    kusa

    the real crazy thing is this is completely open-source and from free research available, their work is published and literally anyone can benefit from it.

    open source won again

    and deepseek is genuinely a huge marvel when comparing to gpt. solving things at a rate much faster, and providing answers that are more accurate lolz

    it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still

  • Open source will always catch up and win

  • Jan 26
    ·
    1 reply
    Nessy

    it's not open source the training code and the data aren't available only the research paper. to be fair the paper shares a lot more than most papers but still

    it’s enough for people to replicate it isn’t it? that’s all that matters

  • Jan 26
    ·
    1 reply
    sco

    it’s enough for people to replicate it isn’t it? that’s all that matters

    people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.

  • five big booms for china

  • Jan 26
    sco

    https://arxiv.org/pdf/2501.12948

    "After thousands of RL steps, DeepSeek-R1-Zero exhibits super performance
    on reasoning benchmarks. For instance, the pass@​1 score on AIME 2024 increases from 15.6% to
    71.0%, and with majority voting, the score further improves to 86.7%, matching the performance
    of OpenAI-o1-0912."
    I'm listening

  • It’s ok guys, we’re just gonna throw another $600 billion at AI and hire a bunch of H1Bs.

    Well surely catch up.

  • Jan 26
    viscera

  • Did this nigga @op really delete my post?

  • very interesting !

  • Jan 26
    ·
    7 replies

    Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

    delete this, p****

  • Jan 26
    Plight2

    people on twitter are all running it themselves. you just have to pay for API tokens, which are extremely cheap tbf.

    yeah i have some versions of it running on my mac (google LM Studio it’s super easy to set up and free)

    but i mean easyfor researchers to reproduce the same work/make use of the same techniques etc so that others can build on their work, since it goes into detail about how it works and how they did it, which is how scientific research is supposed to work (afaik?)

    compare that paper with openai’s gpt4 paper which barely gives any actual info about how they created it at all despite being 100+ pages (mostly self congratulatory “look how good this s*** we did is”) — arxiv.org/pdf/2303.12712

  • Personality

    Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

    delete this, p****

    BP poster killin me

  • Personality

    Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

    delete this, p****

    anti woke police in china is crazy

  • Jan 26
    ·
    2 replies
    Personality

    Niggas got me f***ed up on gooooooooood f*** china till the death of meeeeeeeee

    delete this, p****

    We love Panther!

  • YANDHI

    Wow this has spread fast in the last couple days. Its been a development since around Christmas when they released v3 but slowly exploded after their reasoning model on inauguration day.

    The funniest thing about this is the fact Deepseek is open-source and their entire team publishes full research papers spilling the secrets and formulas they rely on. A Chinese lab is leading democratization of AI

    And now Bytedance and others are popping up with similar LLMs.

    Years of narratives on Chinese AI being X years behind and researchers lacking innovation destroyed in a week. They're basically doing this handicapped because of chip sanctions too

    Chinas been on the forefront of computer vision for the last ten years, they realize that the technology itself isn’t what’s important, it’s the application of it. If you really want to be the leader in the ai space you need to make it easy for anyone to start playing around with it.