2025_neurips
NeurIPS 2025×3 Three papers have been accepted at NeurIPS 2025: Layer-wise Weight Decay in LLM, Gradient-Preserving Activation Scaling in LLM and The Curse of Depth in LLM.
NeurIPS 2025×3 Three papers have been accepted at NeurIPS 2025: Layer-wise Weight Decay in LLM, Gradient-Preserving Activation Scaling in LLM and The Curse of Depth in LLM.