this post was submitted on 10 Jun 2023
178 points (98.9% liked)

Programmer Humor

32557 readers
480 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 3 points 1 year ago (2 children)

Ah, but where do you find the training set of all of the human-written good commit messages? ๐Ÿ˜ƒ

[โ€“] csm10495 2 points 1 year ago

Came to say this. Take my up vote.

[โ€“] [email protected] 1 points 1 year ago (1 children)

I work in bioinformatics and this is the kind of thing I keep trying to communicate to people in the field. Yes, these AI tools (like AlphaFold) are amazing, but if there's a significant gap in their training data, the AI is going to have that gap too (most of the structures in the protein database were solved via X ray crystallography, which isn't great for studying highly flexible or disordered proteins)

[โ€“] [email protected] 1 points 1 year ago

Yes. My (minimally informed from a single class) understanding is that it sort-of depends on the problem too. Like perhaps in looking at all the data on proteins, the neural network might notice a pattern in protein folding is applicable to the tweaked problem. Of course, there is no guarantee that such a generally applicable rule exists. And even if it does, it might not be discovered by the net before overtraining occurs.