this post was submitted on 27 Apr 2024
33 points (100.0% liked)

retrocomputing

3972 readers
3 users here now

Discussions on vintage and retrocomputing

founded 1 year ago
MODERATORS
top 2 comments
sorted by: hot top controversial new old
[โ€“] [email protected] 9 points 3 months ago

tl;dr for article and comments:

Microsoft mangled arrays and code comments with ASCII extended characters into UTF-8 encoding, which makes building many of these files impossible without a lot of extra work. This was mistakenly attributed to Git.

The timestamps for each file are also not preserved, which is debatably a valid criticism of Git (original file timestamps can technically be preserved on an archive like this, but it requires a large amount of work to line up those times and the correct commit times programmatically).

Several Microsoft employees involved in this project appeared in the comments and offered to work directly with the author to correct the character encoding issues. One Microsoft employee indicated that historical timestamps could likely not be included due to Microsoft corporate policy around personally identifiable information.

[โ€“] [email protected] 3 points 3 months ago

Other than the timestamps, it doesn't seem like any of the issues are related to Git.