-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JIT multiple train steps #220
Conversation
@@ -13,7 +13,7 @@ jobs: | |||
uses: psf/black@stable | |||
with: | |||
src: "./apax" | |||
version: "22.10.0" | |||
version: "22.12.0" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
use pre-commit.ci now that the repo is public
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this be part of this PR or a separate one? I looked over it and the rest looks good for me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this be part of this PR or a separate one? I looked over it and the rest looks good for me.
separate
Added an option to jit multiple train steps using lax.scan to iterate over an additional batch axis.
The speedups this enables can be significant when training on small structures, when using small batch sizes or both.
It's fully compatible with training ensembles.