optimizer

1 post connected to this tag.

Jun 28, 2026

Muon Optimizer: SGD vs AdamW vs Matrix-Aware Training Updates

Optimizer research note This post continues the optimizer path from SGD and AdamW into Muon, a matrix-aware training update that changes what the optimizer kernel has to do. Previously in th...

Read post →

Get my rants delivered to your inbox

I will send new posts as and when I write. No fixed cadence, just engineering notes, rants, and things I am thinking through.

ShivasNotes

Engineering notes, A.I. workflow, drones, systems programming, and the messy process of building in public.

Explore

Latest Articles
Blog
Projects
Resume
About

Connect

hello@shivasnotes.com
AntShiv Robotics
StylesDoc

Muon Optimizer: SGD vs AdamW vs Matrix-Aware Training Updates

Subscribe

Subscribe to emails from Anthony

ShivasNotes

Explore

Connect