Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Built-in support for (custom?) decryption of model weights #279

Open
vadimkantorov opened this issue Oct 28, 2024 · 1 comment
Open

Built-in support for (custom?) decryption of model weights #279

vadimkantorov opened this issue Oct 28, 2024 · 1 comment

Comments

@vadimkantorov
Copy link

Sometimes it's useful to allow the user to allow decryption of the model/weights prior to loading or allow a custom user hook for this end. This is useful for basic foolproof protection of models in some on-premises setups.

ORT supports something like this in:

Could this also be supported in ORT backend for Triton?

@vadimkantorov
Copy link
Author

vadimkantorov commented Nov 20, 2024

Here's a demonstration of adding decryption of the ONNX model weights at loading time:

But maybe the better way would be to implement this as allowing the user to specify a path to a custom .so-file in the triton model config or alternatively implement this via calling in the backend code stub I/O hooks which could then be overridden by the user with LD_PRELOAD'ed custom impl of these hooks. Then these hooks could implement loading model weights from some S3 / custom FS path or do custom decryption or something else.

Of course this approach can become more complicated if the model weights are accessed via mmap-ing of the weights.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant