this post was submitted on 27 Jan 2025

154 points (95.3% liked)

Technology

68991 readers

3904 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

[email protected]

154

DeepSeek releases new image model family (techcrunch.com)

submitted 2 months ago by [email protected] to c/[email protected]

17 comments fedilink hide all child comments

top 17 comments

sorted by: hot top controversial new old

[–] [email protected] 57 points 2 months ago (3 children)

Can it generate images of Winnie the Pooh?

[–] [email protected] 26 points 2 months ago (3 children)

What happened in 1989?

[–] [email protected] 20 points 2 months ago

[–] [email protected] 5 points 2 months ago (2 children)

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

[–] [email protected] 7 points 2 months ago (1 children)

Even though it's magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone's willing to invest this just to retrain it from scratch, you're left with the alignment of its trainers.

[–] [email protected] 1 points 2 months ago (1 children)

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

[–] [email protected] 1 points 2 months ago (1 children)

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

[–] [email protected] 1 points 2 months ago

I feel like we're talking about a guard dog now...

[–] Jackinopolis 1 points 2 months ago

It's baked into the training. It's not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn't know what to do with it.

[–] [email protected] 4 points 2 months ago

Now I'll never finish that history assignment...

[–] [email protected] 13 points 2 months ago* (last edited 2 months ago)

Wouldn't be surprised if you had to work around the filter.

Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

[–] [email protected] 4 points 2 months ago

if it is anything like LLMs, then only local ;)

However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.

[–] [email protected] 28 points 2 months ago (1 children)

The image generation is really bad. Image description capabilities seem good but it'll take time to see if it's better than what already exists.

They probably just put it out to keep the hype going.

[–] [email protected] 21 points 2 months ago (1 children)

Yeah, even the cherry picked examples they provide look only okay.

To be honest everything with this company feels like an ad campaign more than anything else.

[–] [email protected] 10 points 2 months ago

Everything from nearly every company feels like an ad campaign. Companies advertise themselves.

At least with open source stuff there's somewhat of a public benefit.

[–] [email protected] 9 points 2 months ago

https://www.analyticsvidhya.com/blog/2025/01/janus-pro-7b-vs-dall-e-3/

This informal testing found that Janus Pro explained a Nokia meme much more crisply than DALL-E 3 but was quite a bit worse than the other tasks, even appearing to hallucinate a score in one test case.

I suddenly realize I myself sound like CHatGPT. Haha. Haha.

Edit: At least you can run these models locally!

[–] [email protected] 2 points 2 months ago

Now if they'll do a video model...

Tencents Huanyuan is surprisingly flexible