Testing muP. #4
cloneofsimo
started this conversation in
Show and tell
Replies: 2 comments 1 reply
-
Looks correct to me! |
Beta Was this translation helpful? Give feedback.
0 replies
-
Could you please share the loss curves? with respect to training steps |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Initial Tests on muP running from 110M ~ 4.4B models.
Our goal is to see if results from TP-V5 and muScaling results are reproducible, and test to see if muP provides stable scaling-law predictions.
Current Status:
Beta Was this translation helpful? Give feedback.
All reactions