this post was submitted on 26 Jun 2024
844 points (97.8% liked)

Technology

63455 readers
4079 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

GitCode, a git-hosting website operated Chongqing Open-Source Co-Creation Technology Co Ltd and with technical support from CSDN and Huawei Cloud.

It is being reported that many users' repository are being cloned and re-hosted on GitCode without explicit authorization.

There is also a thread on Ycombinator (archived link)

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 130 points 8 months ago (13 children)

The vast majority of projects on GitHub is open-source and forkable, why would that need authorization?

It's... suspicious that China's doing it en masse, but there's nothing wrong in cloning or forking a repo last i heard.

[–] [email protected] 110 points 8 months ago (8 children)

It's not about authorization. They want to build a knowledge base for when the Great Firewall gets some more filters. Just like russias mirror of wikipedia which is heavily edited to discredit the west.

[–] 31337 5 points 8 months ago

This seems like the most plausible explanation. Only other thing I can think of is they want to develop their own CoPilot (which I'm guessing isn't available in China due to the U.S. AI restrictions?), and they're just using their existing infrastructure to gather training data.

load more comments (7 replies)
load more comments (11 replies)