this post was submitted on 13 Jan 2025
249 points (94.9% liked)

Linux

49097 readers
793 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 83 points 1 week ago

It's not every day that you see actually useful applications of AI, but this might be one.

[–] [email protected] 50 points 1 week ago

It is probably good that OS community are exploring this however I'm not sure the technology is ready (or will ever be maybe) and it potentially undermines the labour intensive activity of producing high quality subtitling for accessibility.

I use them quite a lot and I've noticed they really struggle on key things like regional/national dialects, subject specific words and situations where context would allow improvement (e.g. a word invented solely in the universe of the media). So it's probably managing 95% accuracy which is that danger zone where its good enough that no one checks it but bad enough that it can be really confusing if you are reliant on then. If we care about accessibility we need to care about it being high quality.

[–] [email protected] 44 points 1 week ago (3 children)

If youtube transcriptions is anything to go by this won't be great. But I'm optimistic

[–] [email protected] 34 points 1 week ago (3 children)

Youtube transcriptions are suprisingly abysmal considering what technology google already has at hand.

[–] [email protected] 10 points 1 week ago (1 children)

I find them pretty good for English spoken by native speakers. For anything else it's horrible.

[–] [email protected] 1 points 6 days ago

As long as they are talking about normal things and not playing D&D 😃

load more comments (2 replies)
[–] [email protected] 7 points 1 week ago (1 children)

They're helpful to my deaf ears, even when they're wrong (50% of the words) they do give me a solid idea of what is being said together with what the audio sounds like.

With it, I get almost everything correct. Without it, I understand near to nothing.

This only goes for English spoken by Americans and sometimes London Britons, sadly, nothing else get detected nearly as good enough, so I can't enjoy YouTube in my native language (Dutch), but being able to consume English YouTube already helps a lot!

[–] [email protected] 4 points 1 week ago

That is very true. It's hard to find local subtitles to a lot of stuff. And the whole deaf angle :)

[–] jayandp 3 points 1 week ago* (last edited 1 week ago)

I've been messing with more recent open-source AI Subtitling models via Subtitle Editor which has a nice GUI for it. Quality is much better these days, at least for English. It still makes mistakes, but the mistakes are on the level of "I misheard what they said and had little context for the conversation" or "the speaker has an accent which makes it hard to understand what they're saying" mistakes, which is way better than most YouTube Auto Transriptions I've seen.

[–] [email protected] 41 points 1 week ago

This is not by default bad thing, if it is something you only use when you decide to do so, when you don't have other subtitles available tbh. I hate AI slop too but people just go to monkey brain rage mode when they read AI and stop processing any further information.

I'd still always prefer human translated subtitles if possible. However, right now I'm looking into translating entire book via LLM cause it would be only way to read that book, as it is not published in any language I speak. I speak English well enough, so I don't really need subtitles, just like to have them on so I won't miss anything.

For English language movies, I'd probably just watch them without subtitles if those were AI, as I don't really need them, more like nice to have in case I miss something. For languages I don't understand, it might be good, although I wager it will be quite bad for less common languages.

[–] [email protected] 26 points 1 week ago (1 children)

The whole knee jerk reaction against anything AI related is tiresome and utterly irrational. This seems like a perfectly legitimate use of technology. If I have a movie in a language I don't know and I can't find subs for it, then I'd much rather have AI subs than nothing at all.

[–] [email protected] 2 points 1 week ago

Yea. Sometimes I just can't process what they are saying because of my adhd ass and subs really help.

[–] [email protected] 25 points 1 week ago (1 children)

Oh so that wasn't a joke from their booth.

This seems really out of place, but locally ran auto subtitles from ethically sourced AI would be great.

It's just that there's two very big conditions in that sentence there.

[–] zarkanian 12 points 1 week ago (2 children)

Which AI is the ethically-sourced one

[–] [email protected] 14 points 1 week ago

There are a number of open weight open source models out there with all their data sourced from the public domain. Look up BLOOM and Falcon. There are others.

[–] [email protected] 5 points 1 week ago* (last edited 1 week ago)

JetBrains' AI code suggestions were only trained on code where authors gave explicit permission for it, but that's the only one I know from the top of my head. Most chat-oriented LLMs (ChatGPT, Claude, Gemini...) were almost certainly trained using corporate piracy.

[–] [email protected] 23 points 1 week ago (1 children)

Not against this feature, but this quote made me laugh:

… once this is in place, people won't have to scour the internet for sourcing subtitles to their favorite movies, shows, or even anime.

As if MTL will get anywhere near the nuance of a properly made human translation.

[–] [email protected] 4 points 6 days ago* (last edited 6 days ago)

Personally, I would be happy even if it didn't translate it but were able to give some half decent transcription of, at least, English voice into English text. I prefer having subtitles, even when I speak the language, because it helps in noisy environments and/or when the characters mumble / have weird accents.

However, even that would likely be difficult with a lightweight model. Even big companies like Google often struggle with their autogenerated subtitles. When there's some very context-specific terminology, or uncommon names, it fumbles. And adding translation to an already incorrect transcript multiplies the nonsense, even if the translation were technically correct.

[–] [email protected] 16 points 1 week ago (1 children)

If it's opt in/opt out then am fine with that.

[–] [email protected] 45 points 1 week ago (6 children)

Not only is it opt in, it's also running fully locally on your machine.

[–] [email protected] 3 points 1 week ago

My biggest issue with that is the amount of bloat a full local LLM implementation would add.

But if it's an optional module that you can choose to add (or choose not to add) after the fact, I have no complaint.

load more comments (5 replies)
[–] [email protected] 15 points 1 week ago

I've seen some pretty piss poor implementations on streaming apps but if anyone can get it right it's VLC

[–] [email protected] 13 points 1 week ago

Pandora's Box is already open. Might as well make use of it.

[–] [email protected] 6 points 1 week ago (1 children)

It won't be better than human translated ones but begter than no subtitles. I don't think even humans can make subtitles correctly without knowing context

[–] [email protected] 10 points 1 week ago (1 children)

Honestly, if it can generate subtitle files it'll be a huge benefit to people creating subtitles. It's way easier to start with bad subs and fix them than it is to write from scratch.

[–] [email protected] 4 points 1 week ago

Yeah true. Good feature anyways

[–] [email protected] 5 points 1 week ago (1 children)

I'm ready to deactivate it if it comes with any active component.

[–] [email protected] 8 points 1 week ago (6 children)

What do you mean by active component? Is processing the audio being played back to add subtitles active?

[–] [email protected] 6 points 1 week ago

Sending the audio to an LLM in the sky. But I assume it would be local?

load more comments (5 replies)
[–] [email protected] 5 points 1 week ago (2 children)

Im curious What makes what VLC is doing qualify as artificial intelligence instead of just an automated transcription plugin?

Automated transcription software has been around for decades, I totally understand getting in on the ai hype train but i guess I'm confused as to if software from years past like "dragon naturally speaking" or Shazam are also LLMs that predate openAI or is how those services worked to identify things different from how modern llms work?

[–] VintageGenious 13 points 1 week ago* (last edited 1 week ago)

automated transcription is AI, neural networks are just better AI sometimes

load more comments (1 replies)
[–] [email protected] 4 points 1 week ago (1 children)

Meh, I'll just stick with mpv.

[–] [email protected] 3 points 1 week ago (1 children)

How is MPVs impementation? Does it work fairly well?

[–] [email protected] 5 points 1 week ago* (last edited 1 week ago)

Its a command line multimedia player. It's implementation is ideal for minimalists, and easily understood by reading the man pages.

It works very well imo.

[–] [email protected] 3 points 1 week ago

I wonder how good it is.

Does it translate from audio or from text?

Does it translate multiple languages, if video has a, b, c languages does it translate all to x.

Does user need to set input language?

[–] [email protected] 3 points 1 week ago (1 children)

What would be actually cool if it could translate foreign movies based on audio and add the English subtitles to it.

[–] [email protected] 3 points 1 week ago (1 children)

Translating a transcription should be easy.

[–] [email protected] 2 points 1 week ago

Yes, if the transcript feature works well for the original language.

load more comments
view more: next ›