Learn Machine Learning

569 readers

4 users here now

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

"Should I use a deep neural network for my audio classification task?"
"I'm working with a small dataset, what can I do to make my model generalize well?"
"Is there a library available that implements function X in language Y?"
"I want to learn more about the math behind machine learning technique A, where should I start?"

Please do:

Be kind to new people
Post guides and tutorials that you find helpful
Link to open/free sources instead of paywalled when possible

Please don't:

Post news articles / memes (there are other machine learning/AI communities for this)

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning

founded 2 years ago

MODERATORS

ShadowAether

[email protected]

Google open sources tools to support AI model development (techcrunch.com)

submitted 1 year ago by [email protected] to c/learnmachinelearning

1 comments fedilink

Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem (pytorch.org)

submitted 1 year ago by [email protected] to c/learnmachinelearning

0 comments fedilink

Understanding GPU Memory 2: Finding and Removing Reference Cycles (pytorch.org)

submitted 2 years ago by [email protected] to c/learnmachinelearning

0 comments fedilink

PyTorch: Compiling NumPy code into C++ or CUDA via torch.compile (pytorch.org)

submitted 2 years ago by [email protected] to c/learnmachinelearning

0 comments fedilink

Introduction to Kernel Methods for Machine Learning (seis.bristol.ac.uk)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Kernel methods give a systematic and principled approach to training learning machines and the good generalization performance achieved can be readily justified using statistical learning theory or Bayesian arguments. We describe how to use kernel methods for classification, regression and novelty detection and in each case we find that training can be reduced to optimization of a convex cost function.

The Kernel Cookbook: Advice on Covariance functions (www.cs.toronto.edu)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

If you've ever asked yourself: "How do I choose the covariance function for a Gaussian process?" this is the page for you. Here you'll find concrete advice on how to choose a covariance function for your problem, or better yet, make your own.

An Intuitive Tutorial to Gaussian Processes Regression (arxiv.org)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

This tutorial aims to provide an intuitive understanding of the Gaussian processes regression. Gaussian processes regression (GPR) models have been widely used in machine learning applications because of their representation flexibility and inherent uncertainty measures over predictions.

Applied Machine Learning (Cornell Tech CS 5787, Fall 2020) (www.youtube.com)

submitted 2 years ago by [email protected] to c/learnmachinelearning

0 comments fedilink

DeepMind x UCL | Reinforcement Learning Course 2018 (www.youtube.com)

submitted 2 years ago by [email protected] to c/learnmachinelearning

0 comments fedilink

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes (blog.research.google)

submitted 2 years ago by [email protected] to c/learnmachinelearning

0 comments fedilink

Large language models (LLMs) are data-efficient but their size makes them difficult to deploy in real-world scenarios.

"Distilling Step-by-Step" is a new method introduced by Google researchers that enables smaller models to outperform LLMs using less training data. This method extracts natural language rationales from LLMs, which provide intermediate reasoning steps, and uses these rationales to train smaller models more efficiently.

In experiments, the distilling step-by-step method consistently outperformed LLMs and standard training approaches, offering both reduced model size and reduced training data requirements.

-4

Dr Stephen Wolfram says THIS about ChatGPT, Natural Language and Physics (www.youtube.com)

submitted 2 years ago by [email protected] to c/learnmachinelearning

3 comments fedilink

[Resource] Understanding UMAP - Google PAIR (pair-code.github.io)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Has nice interactive examples and UMAP vs t-SNE

[Resource] MIT OpenCourseWare: Introduction To Machine Learning (ocw.mit.edu)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

DuckAI - An open-source ML research community (lemmy.intai.tech)

submitted 2 years ago* (last edited 2 years ago) by [email protected] to c/learnmachinelearning

2 comments fedilink

https://duckai.org/

cross-posted from: https://lemmy.intai.tech/post/134262

DuckAI is an open and scalable academic lab and open-source community working on various Machine Learning projects. Our team consists of researchers from the Georgia Institute of Technology and beyond, driven by our passion for investigating large language models and multimodal systems.

Our present endeavors concentrate on the development and analysis of a variety of dataset projects, with the aim of comprehending the depth and performance of these models across diverse domains.

Our objective is to welcome people with a variety of backgrounds to cutting-edge ML projects and rapidly scale up our community to make an impact on the ML landscape.

We are particularly devoted to open-sourcing datasets that can turn into an important infrastructure for the community and exploring various ways to improve the design of foundation models.

[Resource] Style Guide for Python Code: PEP 8 (peps.python.org)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

[Resource] MIT OpenCourseWare: Statistical Learning Theory (ocw.mit.edu)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

[Resource] MIT OpenCourseWare: Mathematics Of Machine Learning (ocw.mit.edu)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Broadly speaking, Machine Learning refers to the automated identification of patterns in data. As such it has been a fertile ground for new statistical and algorithmic developments. The purpose of this course is to provide a mathematically rigorous introduction to these developments with emphasis on methods and their analysis.

[Resource] Durham University Materials for COMP3547 (Deep Learning) and COMP3667 (Reinforcement Learning) from Dr. Robert Lieck (github.com)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Includes lectures, lecture notes and assignments.

Lectures for Deep Learning: https://www.youtube.com/playlist?list=PLMsTLcO6etti_SObSLvk9ZNvoS_0yia57

Lectures for Reinforcement Learning: https://www.youtube.com/playlist?list=PLMsTLcO6ettgmyLVrcPvFLYi2Rs-R4JOE

[Resource] Rules of Machine Learning from Google (developers.google.com)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

A good set of best practices for deployment that isn't language-specific

[Resource] Coding Practices for Python/ML (github.com)

submitted 2 years ago* (last edited 2 years ago) by ShadowAether to c/learnmachinelearning

0 comments fedilink

Coding nowadays is a big part of ML and while it's important that the model works well, it's also important that the code is written properly too.

Link is the general python version, ML-specific version here: https://github.com/davified/clean-code-ml

Video version: https://bit.ly/2yGDyqT

[Resource] Tutorial: Image Recognition with CNN in Matlab (hevpdd.ca)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Introduces neural networks, the convolution operation, a few critical machine learning concepts and some state-of-the-art CNN models. Includes a hands-on Matlab tutorial (and code) demonstrating the model configuration, training process, and performance evaluation using the MNIST dataset.

[Resource] Tutorial: State of Charge Estimation with EKF and SVSF in Matlab (hevpdd.ca)

submitted 2 years ago* (last edited 2 years ago) by ShadowAether to c/learnmachinelearning

0 comments fedilink

This tutorial describes the process for the state of charge (SOC) estimation of Li-Ion cells using an equivalent circuit model. It helps students create and run a SOC estimation strategy based on the 3rd-order R-RC model in MATLAB-Simulink. The tutorial starts with a general overview of state estimation using the extended Kalman filter (EKF) and the novel smooth variable structure filter (SVSF) method.

[Resource] Standford University Cheat Sheets for ML (web version) (stanford.edu)

submitted 2 years ago* (last edited 2 years ago) by ShadowAether to c/learnmachinelearning

0 comments fedilink

I'm not sure if I'd call a 10+ page pdf a "cheat sheet" but they are good resources

Mathematics for Neural Networks (sh.itjust.works)

submitted 2 years ago by ShadowAether to c/learnmachinelearning

0 comments fedilink

Can't say I agree with all of this 100% (I'd put backpropagation in the math side, add in model evaluation, remove convex optimization, etc) plus it's kind of an oversimplification but the basics are there

[Resource] Materials from CORNELL CS4780/CS5780: Machine Learning for Intelligent Systems (self.learnmachinelearning)

submitted 2 years ago* (last edited 2 years ago) by ShadowAether to c/learnmachinelearning

0 comments fedilink

Lecture notes: https://www.cs.cornell.edu/courses/cs4780/2018fa/syllabus/

Recorded lectures: https://www.youtube.com/playlist?list=PLl8OlHZGYOQ7bkVbuRthEsaLr7bONzbXS