this post was submitted on 29 Jan 2025
213 points (96.9% liked)

World News

47277 readers
3901 users here now

A community for discussing events around the World

Rules:

Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.


Lemmy World Partners

News [email protected]

Politics [email protected]

World Politics [email protected]


Recommendations

For Firefox users, there is media bias / propaganda / fact check plugin.

https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/

founded 2 years ago
MODERATORS
 

Summary

Alibaba has launched Qwen 2.5-Max, an AI model it claims outperforms DeepSeek-V3, OpenAI’s GPT-4o, and Meta’s Llama-3.1-405B.

The release, coinciding with Lunar New Year, reflects mounting competition in China’s AI sector after DeepSeek’s rapid rise.

DeepSeek’s recent advancements have pressured Chinese rivals like ByteDance and Baidu to upgrade their models and cut prices.

DeepSeek’s founder downplays price wars, focusing on artificial general intelligence (AGI). The company’s lean, research-focused structure contrasts with China’s tech giants, which face challenges in AI innovation.

all 28 comments
sorted by: hot top controversial new old
[–] [email protected] 62 points 4 months ago (1 children)
[–] [email protected] 39 points 4 months ago (1 children)

DeepSeek’s “big change” isn’t the performance of its model though; it’s that it is fully open and operates on a fraction of the resources.

Is alibaba’s model also open weights, open reasoning, free for anyone to run, and runnable (and trainable) on consumer hardware?

[–] [email protected] 39 points 4 months ago (1 children)

Call it "open weight" if you want, but it's not "fully open". The training data is still proprietary, and the model can't be accurately reproduced. It's proprietary in the same way that llama is proprietary.

[–] [email protected] 9 points 4 months ago* (last edited 4 months ago) (1 children)

But I could use it as a starting point for training and build from it with my own data. I could fork it. I couldn't fork llama, I don't have the weights.

[–] [email protected] 10 points 4 months ago

You can also fork proprietary code that is source available (depending on the specific terms of that particular proprietary license), but that doesn't make it open source.

Fair point about llama not having open weights though. So it's not as proprietary as llama. It still shouldn't be called open source if the training data that it needs to function is proprietary.

[–] [email protected] 29 points 4 months ago

Oh, good. Maybe they will stop trying to scrape my websites at some ridiculous rate using faked real browser UAs. I just blocked their whole ASN (AS45102) in the end.

[–] [email protected] 25 points 4 months ago (1 children)

Can't wait for Wish.com to release DickGargle 3.8-Ultra1

[–] [email protected] 1 points 4 months ago

Spitroastgroup 2.7 is coming out too! On no wait, its going back in. Nope, out. Ah actually in. Man they're just going in and out like mad.

[–] [email protected] 22 points 4 months ago

I thought for sure this was an Onion article

[–] [email protected] 15 points 4 months ago (1 children)
[–] [email protected] 22 points 4 months ago (2 children)

I already have the Temu AI psuedocode. Here you go:

10 print "Hi, how can I help?"

20 receive input

30 print "That's awesome! What else?"

40 go to 20

[–] [email protected] 11 points 4 months ago

Looks pretty basic to me!

[–] mindbleach 1 points 4 months ago

"And theeen?"

[–] [email protected] 15 points 4 months ago (1 children)

Any word on the training cost? I feel like that's the most relevant metric here.

[–] [email protected] 14 points 4 months ago

2 Reeses Cups and a pack of ramen. Alibaba are efficient!

[–] [email protected] 5 points 4 months ago

Oh cool, I was worried my 401k had almost sort of recovered from the last bombshell earlier this week...

[–] [email protected] -5 points 4 months ago (1 children)

DeepSeek_R1 outperform or equalzz GPT-1o is major newZ, but : 4o is much better than 1o. Now, Qwen-2.5Max outperforms GPT-4o ... watever the investment involved, this is even more important ( ! ).

[–] [email protected] 10 points 4 months ago (1 children)
[–] [email protected] -4 points 4 months ago (1 children)

😋 yes, why ? becauzzze of the zzZ ?

[–] [email protected] 9 points 4 months ago

Among other things, yes.