this post was submitted on 05 Dec 2023
125 points (87.9% liked)

Technology

59080 readers
3296 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

It's frustrating when you're not understood — especially when you're trying to speak to Siri, Alexa, or another internet-connected device.

Voice datasets that power voice recognition services are owned by a handful of major companies, and they can wildly underrepresent the voices of non-dominant accents, Black, Indigenous, and other people of color, disabled people and gender marginalised people. In fact, for people speaking other global languages - there may be no datasets at all.

That’s why Mozilla launched Common Voice — the world's largest public voice database, powered by the voices of volunteer contributors. Our goal is to teach machines how real people speak.

Today, we’re asking you to contribute to Common Voice, but we want you to choose how you’ll do it. Will you donate your voice to one of our Common Voice language datasets? Or will you make a $34 donation to Mozilla to support projects like this to reclaim the internet? (Or both!)

I'd be curious about the privacy concerns, but this might help a lot with underrepresented voice data. It might come down to if someone wants more datasets for their particular voice/language more than the other concerns.

If your language/accent is already well documented, it might not help as much?

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] -3 points 11 months ago (5 children)

Why would black or gender marginalised people have a different voice?

[–] [email protected] 23 points 11 months ago

Because the genetics that build the vocal tract could be different, which in simple terms could mean a change in pitch. There are also more cultural differences such as speech cadence, accent, and inflection.

[–] [email protected] 10 points 11 months ago

Technically there are different dialects and a lot of unique slang, idioms and specific descriptive words.

In the trans and non-binary community for instance there's a lot of terms regarding how people identify and express themselves that unless you know the actual function of how they work aren't easily indistinguishable from slurs to outsiders. Take "Femboy" and (please forgive me mods) "Shemale". The former is a perfectly socially acceptable description of a guy (cis or otherwise) whose gender expression is very feminine...the latter is a slur that places emphasis on the birth sex characteristics of a trans woman and implies heavily they are guys just pretending to be women and the term originates from the porn industry that fetishizes trans women.

You also have the usage of neo-pronouns. In languages with more gendered components than English sometimes what words are chosen either reflects the gender of the speaker or the person being addressed or objects can be given a gendered connotation. Some languages are actually very gendered and the usage non-binary folk using those languages make whole new conventions. English speakers whine a remarkable amount over they/them singular pronouns are confusing but ain't seen nothing. A lot of places your job title and status has no neutral gendered term or culturally there are sentence structures that differ down entirely binary gender lines. Are you latino or latina? Guess we need a new word... Latinx!

[–] [email protected] 8 points 11 months ago

Have you seen how (even if stereotypical) black folks talk vs white folks (in media)?
Just turn on GTA 5 and listen a bit to Lamar and Franklin talking in missions.
So much slang is in there.

[–] [email protected] 1 points 11 months ago* (last edited 11 months ago)

Right? Like we all are made the same on the inside

[–] [email protected] -1 points 11 months ago

Dude, I clicked on the link pretty excited to volunteer. I have a professional mic, a little time, and a decent voice. The first thing that greets me is “Voice datasets also underrepresent: non-English speakers, people of colour, disabled people, women and LGBTQIA+ people.”

Well, I’m none of those. So maybe they don’t want my donation, or I’d spend time and they wouldn’t use my recordings... Sort of a letdown.