Learn Machine Learning

Learn Machine Learning ylai • 6mo ago • 100%

Google open sources tools to support AI model development

techcrunch.com

Learn Machine Learning ylai • 9mo ago • 87%

Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem

pytorch.org

Learn Machine Learning ylai • 10mo ago • 66%

Understanding GPU Memory 2: Finding and Removing Reference Cycles

pytorch.org

Learn Machine Learning ylai • 1y ago • 100%

PyTorch: Compiling NumPy code into C++ or CUDA via torch.compile

pytorch.org

Learn Machine Learning ShadowAether • 1y ago • 100%

Introduction to Kernel Methods for Machine Learning

https://seis.bristol.ac.uk/~enicgc/pubs/2000/svmintro.pdf

Kernel methods give a systematic and principled approach to training learning machines and the good generalization performance achieved can be readily justified using statistical learning theory or Bayesian arguments. We describe how to use kernel methods for classification, regression and novelty detection and in each case we find that training can be reduced to optimization of a convex cost function.

Learn Machine Learning ShadowAether • 1y ago • 100%

The Kernel Cookbook: Advice on Covariance functions

https://www.cs.toronto.edu/~duvenaud/cookbook/

If you've ever asked yourself: "How do I choose the covariance function for a Gaussian process?" this is the page for you. Here you'll find concrete advice on how to choose a covariance function for your problem, or better yet, make your own.

Learn Machine Learning ShadowAether • 1y ago • 100%

An Intuitive Tutorial to Gaussian Processes Regression

arxiv.org

This tutorial aims to provide an intuitive understanding of the Gaussian processes regression. Gaussian processes regression (GPR) models have been widely used in machine learning applications because of their representation flexibility and inherent uncertainty measures over predictions.

Learn Machine Learning manitcor • 1y ago • 80%

Applied Machine Learning (Cornell Tech CS 5787, Fall 2020)

https://www.youtube.com/playlist?list=PL2UML_KCiC0UlY7iCQDSiGDMovaupqc83

Learn Machine Learning manitcor • 1y ago • 71%

DeepMind x UCL | Reinforcement Learning Course 2018

https://www.youtube.com/playlist?list=PLqYmG7hTraZBKeNJ-JE_eyJHZ7XgBoAyb

Learn Machine Learning Chruesimuesi • 1y ago • 100%

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google

Large language models (LLMs) are data-efficient but their size makes them difficult to deploy in real-world scenarios. "Distilling Step-by-Step" is a new method introduced by Google researchers that enables smaller models to outperform LLMs using less training data. This method extracts natural language rationales from LLMs, which provide intermediate reasoning steps, and uses these rationales to train smaller models more efficiently. In experiments, the distilling step-by-step method consistently outperformed LLMs and standard training approaches, offering both reduced model size and reduced training data requirements.

Learn Machine Learning manitcor • 1y ago • 37%

Dr Stephen Wolfram says THIS about ChatGPT, Natural Language and Physics

www.youtube.com

-2

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Understanding UMAP - Google PAIR

pair-code.github.io

Has nice interactive examples and UMAP vs t-SNE

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] MIT OpenCourseWare: Introduction To Machine Learning

ocw.mit.edu

Learn Machine Learning manitcor • 1y ago • 88%

DuckAI - An open-source ML research community

https://duckai.org/ cross-posted from: https://lemmy.intai.tech/post/134262 > DuckAI is an open and scalable academic lab and open-source community working on various Machine Learning projects. Our team consists of researchers from the Georgia Institute of Technology and beyond, driven by our passion for investigating large language models and multimodal systems. > > Our present endeavors concentrate on the development and analysis of a variety of dataset projects, with the aim of comprehending the depth and performance of these models across diverse domains. > > Our objective is to welcome people with a variety of backgrounds to cutting-edge ML projects and rapidly scale up our community to make an impact on the ML landscape. > > We are particularly devoted to open-sourcing datasets that can turn into an important infrastructure for the community and exploring various ways to improve the design of foundation models.

Learn Machine Learning ShadowAether • 1y ago • 60%

[Resource] Style Guide for Python Code: PEP 8

peps.python.org

Learn Machine Learning ShadowAether • 1y ago • 75%

[Resource] MIT OpenCourseWare: Statistical Learning Theory

ocw.mit.edu

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] MIT OpenCourseWare: Mathematics Of Machine Learning

ocw.mit.edu

Broadly speaking, Machine Learning refers to the automated identification of patterns in data. As such it has been a fertile ground for new statistical and algorithmic developments. The purpose of this course is to provide a mathematically rigorous introduction to these developments with emphasis on methods and their analysis.

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Durham University Materials for COMP3547 (Deep Learning) and COMP3667 (Reinforcement Learning) from Dr. Robert Lieck

github.com

Includes lectures, lecture notes and assignments. Lectures for Deep Learning: https://www.youtube.com/playlist?list=PLMsTLcO6etti_SObSLvk9ZNvoS_0yia57 Lectures for Reinforcement Learning: https://www.youtube.com/playlist?list=PLMsTLcO6ettgmyLVrcPvFLYi2Rs-R4JOE

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Rules of Machine Learning from Google

developers.google.com

A good set of best practices for deployment that isn't language-specific

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Coding Practices for Python/ML

github.com

Coding nowadays is a big part of ML and while it's important that the model works well, it's also important that the code is written properly too. Link is the general python version, ML-specific version here: https://github.com/davified/clean-code-ml Video version: https://bit.ly/2yGDyqT

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Tutorial: Image Recognition with CNN in Matlab

https://hevpdd.ca/publications-and-case-studies/case-study-image-recognition-with-convolutional-neural-networks/

Introduces neural networks, the convolution operation, a few critical machine learning concepts and some state-of-the-art CNN models. Includes a hands-on Matlab tutorial (and code) demonstrating the model configuration, training process, and performance evaluation using the MNIST dataset.

Learn Machine Learning ShadowAether • 1y ago • 75%

[Resource] Tutorial: State of Charge Estimation with EKF and SVSF in Matlab

https://hevpdd.ca/case-study-state-of-charge-estimation-2/

This tutorial describes the process for the state of charge (SOC) estimation of Li-Ion cells using an equivalent circuit model. It helps students create and run a SOC estimation strategy based on the 3rd-order R-RC model in MATLAB-Simulink. The tutorial starts with a general overview of state estimation using the extended Kalman filter (EKF) and the novel smooth variable structure filter (SVSF) method.

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Standford University Cheat Sheets for ML (web version)

https://stanford.edu/~shervine/teaching/

I'm not sure if I'd call a 10+ page pdf a "cheat sheet" but they are good resources

Learn Machine Learning ShadowAether • 1y ago • 100%

Mathematics for Neural Networks

Can't say I agree with all of this 100% (I'd put backpropagation in the math side, add in model evaluation, remove convex optimization, etc) plus it's kind of an oversimplification but the basics are there

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Materials from CORNELL CS4780/CS5780: Machine Learning for Intelligent Systems

Lecture notes: https://www.cs.cornell.edu/courses/cs4780/2018fa/syllabus/ Recorded lectures: https://www.youtube.com/playlist?list=PLl8OlHZGYOQ7bkVbuRthEsaLr7bONzbXS

Learn Machine Learning ShadowAether • 1y ago • 95%

K-Means Clustering Infographic

Learn Machine Learning manitcor • 1y ago • 66%

My LLM CLI tool now supports self-hosted language models via plugins

simonwillison.net

Learn Machine Learning ShadowAether • 1y ago • 100%

Classification Model Evaluation Metrics

https://www.researchgate.net/publication/352902406_Classification_Model_Evaluation_Metrics

Learn Machine Learning ShadowAether • 1y ago • 100%

Introduction to Domain Adaptation for Neural Networks

machinelearning.apple.com

Learn Machine Learning ShadowAether • 1y ago • 100%

The standardization fallacy: the importance of variance

www.nature.com

Learn Machine Learning manitcor • 1y ago • 100%

OpenChat_8192 - The first model to beat 100% of ChatGPT-3.5

cross-posted from: https://lemmy.intai.tech/post/40699 > ## Models > - [opnechat](https://huggingface.co/openchat/openchat) > - [openchat_8192](https://huggingface.co/openchat/openchat_8192) > - [opencoderplus](https://huggingface.co/openchat/opencoderplus) > > ## Datasets > - [openchat_sharegpt4_dataset](https://lemmy.intai.tech/post/40692) > > ## Repos > - [openchat](https://github.com/imoneoi/openchat) > > ## Related Papers > - [LIMA Less is More For Alignment](https://lemmy.intai.tech/post/10277) > - [ORCA](https://lemmy.intai.tech/post/650) > > > ### Credit: > [Tweet](https://twitter.com/Yampeleg/status/1675165254144126978) > > ### Archive: > @Yampeleg > The first model to beat 100% of ChatGPT-3.5 > Available on Huggingface > > 🔥 OpenChat_8192 > > 🔥 105.7% of ChatGPT (Vicuna GPT-4 Benchmark) > > Less than a month ago the world witnessed as ORCA [1] became the first model to ever outpace ChatGPT on Vicuna's benchmark. > > Today, the race to replicate these results open-source comes to an end. > > Minutes ago OpenChat scored 105.7% of ChatGPT. > > But wait! There is more! > > Not only OpenChat beated Vicuna's benchmark, it did so pulling off a LIMA [2] move! > > Training was done using 6K GPT-4 conversations out of the ~90K ShareGPT conversations. > > The model comes in three versions: the basic OpenChat model, OpenChat-8192 and OpenCoderPlus (Code generation: 102.5% ChatGPT) > > This is a significant achievement considering that it's the first (released) open-source model to surpass the Vicuna benchmark. 🎉🎉 > > - OpenChat: https://huggingface.co/openchat/openchat > - OpenChat_8192: https://huggingface.co/openchat/openchat_8192 (best chat) > - OpenCoderPlus: https://huggingface.co/openchat/opencoderplus (best coder) > > - Dataset: https://huggingface.co/datasets/openchat/openchat_sharegpt4_dataset > > - Code: https://github.com/imoneoi/openchat > > Congratulations to the authors!! > > --- > > [1] - Orca: The first model to cross 100% of ChatGPT: https://arxiv.org/pdf/2306.02707.pdf > [2] - LIMA: Less Is More for Alignment - TL;DR: Using small number of VERY high quality samples (1000 in the paper) can be as powerful as much larger datasets: https://arxiv.org/pdf/2305.11206

Learn Machine Learning manitcor • 1y ago • 100%

GitHub - microsoft/Data-Science-For-Beginners: 10 Weeks, 20 Lessons, Data Science for All!

cross-posted from: https://lemmy.intai.tech/post/24579 > https://github.com/microsoft/Data-Science-For-Beginners

Learn Machine Learning manitcor • 1y ago • 100%

Emerging Architectures for LLM Applications | Andreessen Horowitz

a16z.com

Learn Machine Learning manitcor • 1y ago • 100%

Mathematical Foundations of Machine Learning

cross-posted from: https://lemmy.intai.tech/post/21511 > https://skim.math.msstate.edu/LectureNotes/Machine_Learning_Lecture.pdf

Learn Machine Learning ShadowAether • 1y ago • 100%

Neural Network Interactive Browser App: Tensorflow Playground

playground.tensorflow.org

Learn Machine Learning manitcor • 1y ago • 100%

MPT-30B-Chat - a Hugging Face Space by mosaicml

cross-posted from: https://lemmy.intai.tech/post/17993 > https://huggingface.co/spaces/mosaicml/mpt-30b-chat

Learn Machine Learning manitcor • 1y ago • 100%

101 fundamentals for aspiring the model makers

cross-posted from: https://lemmy.intai.tech/post/18067 > https://twitter.com/FrnkNlsn/status/1520585408215924736 > > https://www.researchgate.net/publication/327304999_An_Elementary_Introduction_to_Information_Geometry > > https://www.researchgate.net/publication/357097879_The_Many_Faces_of_Information_Geometry > > https://franknielsen.github.io/IG/index.html > > https://franknielsen.github.io/GSI/ > > https://www.youtube.com/watch?v=w6r_jsEBlgU&embeds_referring_euri=https%3A%2F%2Ftwitter.com%2F&source_ve_path=MjM4NTE&feature=emb_title

Learn Machine Learning ShadowAether • 1y ago • 100%

[Resource] Good collection of introductions to topics for stats and machine learning: Nature Methods' Points of Significance

https://www.nature.com/collections/qghhqm/pointsofsignificance

From Nature.com - Statistics for Biologists. A series of short articles that are a nice introduction to several topics and because the audience is biologists, the articles are light on math/equations.

Learn Machine Learning ShadowAether • 1y ago • 100%

[Discussion] What are your favourite tools for searching for info and why?

Bing, ChatGPT, etc.

Learn Machine Learning ShadowAether • 1y ago • 100%

On sourcing for benchmark datasets: Will the Real Iris Data Please Stand Up?

https://lucykuncheva.co.uk/papers/jbjkrklknptfs99.pdf

This paper highlights an issue that many people don't think about. Fyi when trying to compare or reproduce results, always try to get the dataset from the same source as the original author and scale it in the same way. Unfortunately, many authors assume the scaling is obvious and don't include it but changes in scaling can lead to very different results.

Learn Machine Learning

!learnmachinelearning@sh.itjust.works

1y ago

511 59 24

Welcome! This is a place for people to learn more about machine learning techniques, discuss applications and ask questions.

Example questions:

"Should I use a deep neural network for my audio classification task?"
"I'm working with a small dataset, what can I do to make my model generalize well?"
"Is there a library available that implements function X in language Y?"
"I want to learn more about the math behind machine learning technique A, where should I start?"

Please do:

Be kind to new people
Post guides and tutorials that you find helpful
Link to open/free sources instead of paywalled when possible

Please don't:

Post news articles / memes (there are other machine learning/AI communities for this)

Other communities in this area:

Similar subreddits: r/MLquestions, r/askmachinelearning, r/learnmachinelearning