Built-in support for (custom?) decryption of model weights #279

vadimkantorov · 2024-10-28T12:55:41Z

Sometimes it's useful to allow the user to allow decryption of the model/weights prior to loading or allow a custom user hook for this end. This is useful for basic foolproof protection of models in some on-premises setups.

ORT supports something like this in:

[TRT EP] Fix logic to reach cache encryption code. microsoft/onnxruntime#17111

Could this also be supported in ORT backend for Triton?

vadimkantorov · 2024-11-20T15:56:43Z

Here's a demonstration of adding decryption of the ONNX model weights at loading time:

add decryption WoodieDudy/onnxruntime_backend#1

But maybe the better way would be to implement this as allowing the user to specify a path to a custom .so-file in the triton model config or alternatively implement this via calling in the backend code stub I/O hooks which could then be overridden by the user with LD_PRELOAD'ed custom impl of these hooks. Then these hooks could implement loading model weights from some S3 / custom FS path or do custom decryption or something else.

Of course this approach can become more complicated if the model weights are accessed via mmap-ing of the weights.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Built-in support for (custom?) decryption of model weights #279

Built-in support for (custom?) decryption of model weights #279

vadimkantorov commented Oct 28, 2024

vadimkantorov commented Nov 20, 2024 •

edited

Loading

Built-in support for (custom?) decryption of model weights #279

Built-in support for (custom?) decryption of model weights #279

Comments

vadimkantorov commented Oct 28, 2024

vadimkantorov commented Nov 20, 2024 • edited Loading

vadimkantorov commented Nov 20, 2024 •

edited

Loading