this post was submitted on 19 Dec 2023
84 points (94.7% liked)

Programming

16760 readers
97 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]



founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 45 points 8 months ago (3 children)

Homoglyphs? Invisible text? Bidirectional text? Just highlight every line that goes beyond ASCII with yellow warning colors and require to vet it. Maybe make localization data an exception.

[–] [email protected] 12 points 8 months ago (3 children)

This doesn't work for code bases written in non-English languages. Especially east asian languages.

Any line containing an identifier that is also a word would be highlighted.

More and more programming languages are supporting unicode identifiers for this use case.

[–] [email protected] 11 points 8 months ago (2 children)

So it won't work for 0.0001% of all github projects.

[–] [email protected] 5 points 8 months ago

I'd suggest to have the occasional look at the "most popular repos" ranking. It's about 50% Chinese.

Super-interesting sometimes as it shows completely different tech trends.

[–] [email protected] 1 points 8 months ago

I know right.

It's wild that an American company primarily doing business in the West would have a bias towards English.

[–] [email protected] 6 points 8 months ago

Yeah, just don't. Allowing to code in anything other than English is a disservice, plain and simple.

Inb4, I'm not being US-centric, Latin ain't even my native alphabet.

[–] [email protected] 6 points 8 months ago (2 children)

Very simple solution actually. Here I was thinking we'd need AI to solve it.

[–] [email protected] 17 points 8 months ago

People would call that solution AI these days. If it has at least one if statement then they call it AI

[–] [email protected] 3 points 8 months ago

We say we have AI to get VC funding

[–] [email protected] 1 points 8 months ago (1 children)

Or the non-ascii character itself.

[–] [email protected] 3 points 8 months ago (1 children)

Doesn't work if it's invisible.

[–] [email protected] 1 points 8 months ago

what about a box around it?