-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement the principled initialisation #484
Comments
Hi @mrTsjolder , and congrats on the acceptance! For inspiration, here's how the Gaussian initialization from here is implemented. |
thanks a lot @mrTsjolder ! yes, we're definitely very interested in any principled ICNN init! you might have noticed our most recent ICNN has a diagonal quadratic + low rank block, i think they're interesting in their own right |
I am having a bit of a problem with the implementation: I could think of workarounds, but these all seem to require some sort of redesign of the code/API, which is probably not desired. Does anyone have suggestions for possible workarounds? PS: sorry for the late reply, but something else got in my way... |
Hi @mrTsjolder , sorry for the delayed response. Will re-read your paper and come up with ideas how to best implement it! |
Is your feature request related to a problem? Please describe.
Training input-convex neural networks can be slow.
Describe the solution you'd like
Good initialisations can accelerate learning in a variety of scenarios.
I published a paper at NeurIPS that proposes a principled weight initialisation for input-convex networks:
https://openreview.net/forum?id=pWZ97hUQtQ
Describe alternatives you've considered
@marcocuturi came to visit my poster and mentioned an alternative approach is currently being used for initialisation in OTT.
However, he acknowledged that it might be a good idea to implement my principled initialisation in OTT.
Additional context
I might have some time to implement my proposed initialisation in this framework.
My main question would be whether/where you want to have this initialiser in the framework.
The text was updated successfully, but these errors were encountered: