File tree Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Expand file tree Collapse file tree 1 file changed +11
-0
lines changed Original file line number Diff line number Diff line change 1212
1313## What's New
1414
15+ ## Oct 16, 2025
16+ * Add an impl of the Muon optimizer (based on https://github.com/KellerJordan/Muon ) with customizations
17+ * extra flexibility and improved handling for conv weights and fallbacks for weight shapes not suited for orthogonalization
18+ * small speedup for NS iterations by reducing allocs and using fused (b)add(b)mm ops
19+ * by default uses AdamW (or NAdamW if ` nesterov=True ` ) updates if muon not suitable for parameter shape (or excluded via param group flag)
20+ * like torch impl, select from several LR scale adjustment fns via ` adjust_lr_fn `
21+ * select from several NS coefficient presets or specify your own via ` ns_coefficients `
22+ * First 2 steps of 'meta' device model initalization supported
23+ * Fix several ops that were breaking creation under 'meta' device context
24+ * Add device & dtype factory kwarg support to all models and modules (anything inherting from nn.Module) in ` timm `
25+
1526## Sept 21, 2025
1627* Remap DINOv3 ViT weight tags from ` lvd_1689m ` -> ` lvd1689m ` to match (same for ` sat_493m ` -> ` sat493m ` )
1728* Release 1.0.20
You can’t perform that action at this time.
0 commit comments