-
Notifications
You must be signed in to change notification settings - Fork 54
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
LoRA Adapters for vLLM & support for s3, gs, oss for pulling adapters…
… and models (to cache) from buckets (#304) * Add `.spec.adapters` * Support requesting adapters using the pattern: `{"model": "<model>_<adapter>", ... }` * Load LoRA adapters into running vLLM containers * Support updating LoRA adapters without needing to restart vLLM * Rewrite `.model` to use adapter in chat request body when proxying to vLLM * Add adapters to model list * Add support for `s3://`, `gs://`, `oss://` urls (for adapters and cache loading) * Add new cloud credentials to support new urls * Update docs * Update Model validation NOTE: * Was unable to test `oss://` urls... Had issues opening acct. FOLLOWUP: * Need to add adapter e2e tests (have not found a small enough model with adapters for use in kind cluster) * Need to update chart values.yaml to include GH-actions-built image for model loader after merge!!! Fixes #132, #303
- Loading branch information
Showing
81 changed files
with
2,168 additions
and
391 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,11 @@ | ||
# More info: https://docs.docker.com/engine/reference/builder/#dockerignore-file | ||
# Ignore build and test binaries. | ||
bin/ | ||
charts/ | ||
components/ | ||
docs/ | ||
manifests/ | ||
examples/ | ||
manifests/ | ||
proposals/ | ||
test/ | ||
tmp/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
{{- if and .Values.secrets.aws.create (and (not (empty .Values.secrets.aws.accessKeyId)) (not (empty .Values.secrets.aws.secretAccessKey))) }} | ||
apiVersion: v1 | ||
kind: Secret | ||
metadata: | ||
name: {{ include "kubeai.awsSecretName" . }} | ||
labels: | ||
{{- include "kubeai.labels" . | nindent 4 }} | ||
data: | ||
accessKeyId: {{ .Values.secrets.aws.accessKeyId | b64enc }} | ||
secretAccessKey: {{ .Values.secrets.aws.secretAccessKey | b64enc }} | ||
{{- end }} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.