this post was submitted on 13 Jun 2023
4 points (100.0% liked)

Learn Machine Learning

520 readers
1 users here now

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

Please do:

Please don't:

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning

founded 1 year ago
MODERATORS
 

Not OP. This question is being reposted to preserve technical content removed from elsewhere. Feel free to add your own answers/discussion.

Original question:

I got a data set from high performance liquid chromatography, because hplc is expensive we only got about 39 data point. Each data point is 9 dimension, representing 9different substances concentration. I tried different network and the accuracy is not higher than 50%. (We have four classes) however the KNN has a accuracy of more than 90%. I remember hearing that neural network is not good on small data set. Is this the reason? I have not tried svm or other traditional machine learning models yet. Should I try them if yes which one

top 2 comments
sorted by: hot top controversial new old
[โ€“] ShadowAether 3 points 1 year ago* (last edited 1 year ago)

Original answer:

NNs can require a lot of data to train. For smaller datasets, knn or svm can be a better choice especially if the classification boundary does not need to be very complex or classes are tightly clustered within-class and far away from other classes. Also keep in mind the bayes error of your problem which is the best value you would be able to get over the set you want to generalize to with any classifier and is based on the separability and measurement noise in your data.

[โ€“] [email protected] 2 points 1 year ago

I remember hearing that neural network is not good on small data set.

That's almost definitely it. Neural networks are good for high-dimensional problems with lots of available training data.