Machine Learning

Machine Learning yogthos • 4w ago • 100%

How ‘Embeddings’ Encode What Words Mean

www.quantamagazine.org

Machine Learning yogthos • 1mo ago • 100%

New AI model “learns” how to simulate Super Mario Bros. from video footage

arstechnica.com

Machine Learning yogthos • 2mo ago • 100%

Reflection 70B holds its own against even the top closed-source models (Claude 3.5 Sonnet, GPT-4o)

huggingface.co

"Reflection 70B holds its own against even the top closed-source models (Claude 3.5 Sonnet, GPT-4o). It’s the top LLM in (at least) MMLU, MATH, IFEval, GSM8K. Beats GPT-4o on every benchmark tested. It clobbers Llama 3.1 405B. It’s not even close. The technique that drives Reflection 70B is simple, but very powerful. Current LLMs have a tendency to hallucinate, and can’t recognize when they do so. Reflection-Tuning enables LLMs to recognize their mistakes, and then correct them before committing to an answer. Additionally, we separate planning into a separate step, improving CoT potency and keeping the outputs simple and concise for end users. Important to note: We have checked for decontamination against all benchmarks mentioned using @lmsysorg’s LLM Decontaminator. The weights of our 70B model are available today on @huggingface here: https://huggingface.co/mattshumer/Reflection-70B @hyperbolic_labs API available later today. Next week, we will release the weights of Reflection-405B, along with a short report going into more detail on our process and findings. Most importantly, a huge shoutout to @csahil28 and @GlaiveAI. I’ve been noodling on this idea for months, and finally decided to pull the trigger a few weeks ago. I reached out to Sahil and the data was generated within hours. If you’re training models, check Glaive out. This model is quite fun to use and insanely powerful. Please check it out — with the right prompting, it’s an absolute beast for many use-cases. Demo here: https://reflection-playground-production.up.railway.app/ 405B is coming next week, and we expect it to outperform Sonnet and GPT-4o by a wide margin. But this is just the start. I have a few more tricks up my sleeve. I’ll continue to work with @csahil28 to release even better LLMs that make this one look like a toy. Stay tuned." https://x.com/mattshumer_/status/1831767014341538166

Machine Learning yogthos • 2mo ago • 100%

It’s Not Intelligent If It Always Halts: A Critical Perspective on Current Approaches to AGI

www.lifeiscomputation.com

Machine Learning yogthos • 2mo ago • 100%

The Difference Between Speaking and Thinking

www.theatlantic.com

https://archive.is/SXZMe

Machine Learning yogthos • 2mo ago • 100%

Diffusion Models Are Real-Time Game Engines

gamengen.github.io

Machine Learning yogthos • 2mo ago • 100%

Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU training throughput by 20% and reduces memory usage by 60%.

github.com

Machine Learning yogthos • 2mo ago • 100%

Transformer Explainer

poloclub.github.io

Machine Learning yogthos • 2mo ago • 90%

Alibaba claims no. 1 spot in AI math models with Qwen2-Math

https://venturebeat.com/ai/alibaba-claims-no-1-spot-in-ai-math-models-with-qwen2-math/

Machine Learning yboutros • 2mo ago • 100%

How to convert a positionally encoded predicted embedding from a decoder to its matching token?

When training a transformer on positionally encoded embeddings, should the tgt output embeddings also be positionally encoded? If so, wouldn't the predicted/decoded embeddings also be positionally encoded?

Machine Learning yogthos • 3mo ago • 100%

New Open-Source AI Image Generator Beats Midjourney, SD3 and Auraflow

decrypt.co

Machine Learning yogthos • 3mo ago • 96%

AI models collapse when trained on recursively generated data

www.nature.com

Machine Learning yogthos • 4mo ago • 57%

RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing

lmsys.org

Machine Learning yogthos • 4mo ago • 100%

Alibaba's Qwen LLM model leading open source rankings

huggingface.co

Machine Learning yogthos • 4mo ago • 78%

By using the same techniques Google used to solve Go (MTCS and backprop), Llama8B gets 96.7% on math benchmark GSM8K. That’s better than GPT-4, Claude and Gemini, with 200x fewer parameters!

https://arxiv.org/pdf/2406.07394

![](https://lemm.ee/api/v3/image_proxy?url=https%3A%2F%2Fmedia.mas.to%2Fmedia_attachments%2Ffiles%2F112%2F626%2F014%2F287%2F127%2F586%2Foriginal%2F98f7cf5629bf7f2b.png)

Machine Learning yogthos • 4mo ago • 87%

Mixture of Agents (MoA) leverages several open-source LLM agents to achieve a score of 65.1% on AlpacaEval 2.0

www.together.ai

Machine Learning ylai • 4mo ago • 100%

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

huggingface.co

Machine Learning keepthepace • 4mo ago • 90%

Torrent tracker for open models

aitracker.art

Someone (Dreamertist on reddit) got tired of depending on Huggingface for downloading models and proposes a torrent tracker to share more efficiently these huge blobs. It just started, only a few models uploaded yet, but I think it is worth that we all put our local stash online there. Making a new torrent is [super easy](https://aitracker.art/viewtopic.php?t=1) (one missing step though: when "re-downloading" the model you need to save it in the directory where it already exists. This way it will "resume" at 100% completion and switch to seeding mode)

Machine Learning wargreymon • 4mo ago • 75%

Can gpt generate a gpt model?

Imagine AI giving offsprings...

Machine Learning yogthos • 5mo ago • 60%

Sakuga-42M Dataset: Scaling Up Cartoon Research

arxiv.org

Machine Learning yogthos • 5mo ago • 85%

How AI 'Understands' Images (CLIP)

https://www.youtube.com/watch?v=KcSXcpluDe4

Machine Learning smokinliver • 6mo ago • 90%

Where do these stains come from and how can I fix them?

Hey guys, I have been experimenting with self-supervised visual learning a bit. Until now I have only ever used U-Nets and related architectures. No matter what specific task, images or other parameters I changed I always encountered these stains on my output-images (here marked with green), although sometimes more, sometimes less. Now I wondered if anybody could tell me where they came from and how I could prevent them? In the attached picture the input (left) and target (right) are the same, so that I can be sure these stains do not come from a badly designed learning task, yet they still appear (output is the middle image). Thanks in advance and all the best :D Edit: added line breaks

Machine Learning yogthos • 6mo ago • 50%

HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

hidiffusion.github.io

Machine Learning yogthos • 6mo ago • 80%

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

https://animate-your-word.github.io/demo/

Machine Learning yogthos • 7mo ago • 85%

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

arxiv.org

Machine Learning Kit • 7mo ago • 50%

What are your thoughts on Microsoft Copilot?

Copilot sounds amazing on paper. The free (to 365 subs) version on the web is just Chat GPT4, so that's familiar enough. The integration with 365 applications is really what grabs me. Stuff like tossing it 10 spreadsheets and asking it to analyze and compare the data, having a virtual assistant to remind me of upcoming actionables, and summarizing a meeting when I zone out - it all sounds really handy. I met with Microsoft last week and they're down for giving me a 90 day trial if I want to take it for a spin. Any thoughts or suggestions? I ideally want to determine if this will improve productivity for my end users enough to be worth the insane cost of $30/user/mo.

Machine Learning TheHobbyist • 7mo ago • 89%

Looking for a specific OpenAI employee personal blog

Hi all, I think around 1 or 2 years ago, I stumbled upon a personal blog of an asian woman (I think) working at OpenAI. She had numerous extensive fascinating blog posts on a black themed blog, going into the technical details of embeddings of language models and such. I can no longer find that blog and have no other information to go by. Would anyone possibly know which blog I'm referring to? It would be very much appreciated.

Machine Learning yogthos • 7mo ago • 81%

Introducing SIMA, a Scalable Instructable Multiworld Agent

deepmind.google

Machine Learning yogthos • 7mo ago • 66%

LLMs are not superintelligent | Yann LeCun and Lex Fridman

https://www.youtube.com/watch?v=NVxcsekcbhs

Machine Learning ericjmorey • 8mo ago • 100%

Where Is Noether's Principle in Machine Learning? | 2024-02-29

https://cgad.ski/blog/where-is-noethers-principle-in-machine-learning.html

2024-02-29 | [Christopher Gadzinski](https://cgad.ski/) writes: > Physics likes optimization! Subject to its boundary conditions, the time evolution of a physical system is a critical point for a quantity called an action. This point of view sets the stage for Noether's principle, a remarkable correspondence between continuous invariances of the action and conservation laws of the system. > > In machine learning, we often deal with discrete "processes" whose control parameters are chosen to minimize some quantity. For example, we can see a deep residual network as a process where the role of "time" is played by depth. We may ask: > > 1. Does Noether's theorem apply to these processes? > 2. Can we find meaningful conserved quantities? > > Our answers: "yes," and "not sure!"

Machine Learning yogthos • 8mo ago • 66%

Sora is an AI model that can create realistic and imaginative scenes from text instructions.

https://openai.com/sora

Machine Learning yogthos • 8mo ago • 66%

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

github.com

Machine Learning mawss • 8mo ago • 60%

Gemini 1.5

blog.google

Anybody got to try it?

Machine Learning yogthos • 8mo ago • 66%

DiffusionGPT: LLM-Driven Text-to-Image Generation System

https://diffusiongpt.github.io/

Machine Learning yogthos • 8mo ago • 66%

Stable cascade uses smaller the latent space than stable diffusion resulting in faster inference and cheaper training.

huggingface.co

Machine Learning yogthos • 9mo ago • 80%

Interpreting Neural Networks through the Polytope Lens

www.lesswrong.com

Machine Learning yogthos • 9mo ago • 66%

Matryoshka Representation Learning

arxiv.org

Machine Learning ericjmorey • 9mo ago • 100%

NumPy 2 is coming: preventing breakage, updating your code

pythonspeed.com

[Itamar Turner-Trauring](https://pythonspeed.com/about/) writes: > These sort of problems are one of the many reasons you want to “pin” your application’s dependencies: make sure you only install a specific, fixed set of dependencies. Without reproducible dependencies, as soon as NumPy 2 comes out your application might break when it gets installed with new dependencies. > > The really short version is that you have two sets of dependency configurations: > > - **A direct dependency list**: A list of libraries you directly import in your code, loosely restricted. This is the list of dependencies you put in pyproject.toml or setup.py. > - **A lock file**: A list of all dependencies you rely on, direct or indirect (dependencies of dependencies), pinned to specific versions. This might be a requirements.txt, or some other file dependencies on which tool you’re using. > > [At appropriate intervals you update the lock file](https://pythonspeed.com/articles/when-update-dependencies/) based on the direct dependency list. > > I’ve written multiple articles on the topic, in case you’re not familiar with the relevant tools: > > - “[Faster Docker builds with pipenv, poetry, or pip-tools](https://pythonspeed.com/articles/pipenv-docker/)” covers using those three tools to maintain lockfiles. > - For Conda, see “[Reproducible and upgradable Conda environments with conda-lock](https://pythonspeed.com/articles/activate-conda-dockerfile/)”. Read [NumPy 2 is coming: preventing breakage, updating your code](https://pythonspeed.com/articles/numpy-2/)

Machine Learning MOMA_Trance • 10mo ago • 53%

Coscientist: Meet the World's First AI Research Assistant

youtu.be

Machine Learning spaduf • 12mo ago • 100%

Theoretical Foundations of Graph Neural Networks - Seminar

https://www.youtube.com/watch?v=uF53xsT7mjc

cross-posted from: https://slrpnk.net/post/3892266 > **Institution:** Cambridge > **Lecturer:** Petar Velickovic > **University Course Code:** seminar > **Subject:** #math #machinelearning #neuralnetworks > **Description:** Deriving graph neural networks (GNNs) from first principles, motivating their use, and explaining how they have emerged along several related research lines.