this post was submitted on 27 Jan 2025
154 points (95.3% liked)

Technology

68991 readers
3904 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 17 comments
sorted by: hot top controversial new old
[–] [email protected] 57 points 2 months ago (3 children)

Can it generate images of Winnie the Pooh?

[–] [email protected] 26 points 2 months ago (3 children)
[–] [email protected] 5 points 2 months ago (2 children)

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

[–] [email protected] 7 points 2 months ago (1 children)

Even though it's magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone's willing to invest this just to retrain it from scratch, you're left with the alignment of its trainers.

[–] [email protected] 1 points 2 months ago (1 children)

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

[–] [email protected] 1 points 2 months ago (1 children)

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

[–] [email protected] 1 points 2 months ago

I feel like we're talking about a guard dog now...

[–] Jackinopolis 1 points 2 months ago

It's baked into the training. It's not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn't know what to do with it.

[–] [email protected] 4 points 2 months ago

Now I'll never finish that history assignment...

[–] [email protected] 13 points 2 months ago* (last edited 2 months ago)

Wouldn't be surprised if you had to work around the filter.

Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

[–] [email protected] 4 points 2 months ago

if it is anything like LLMs, then only local ;)

However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.

[–] [email protected] 28 points 2 months ago (1 children)

The image generation is really bad. Image description capabilities seem good but it'll take time to see if it's better than what already exists.

They probably just put it out to keep the hype going.

[–] [email protected] 21 points 2 months ago (1 children)

Yeah, even the cherry picked examples they provide look only okay.

To be honest everything with this company feels like an ad campaign more than anything else.

[–] [email protected] 10 points 2 months ago

Everything from nearly every company feels like an ad campaign. Companies advertise themselves.

At least with open source stuff there's somewhat of a public benefit.

[–] [email protected] 9 points 2 months ago

https://www.analyticsvidhya.com/blog/2025/01/janus-pro-7b-vs-dall-e-3/

This informal testing found that Janus Pro explained a Nokia meme much more crisply than DALL-E 3 but was quite a bit worse than the other tasks, even appearing to hallucinate a score in one test case.

I suddenly realize I myself sound like CHatGPT. Haha. Haha.

Edit: At least you can run these models locally!

[–] [email protected] 2 points 2 months ago

Now if they'll do a video model...

Tencents Huanyuan is surprisingly flexible