Skip to content

Commit

Permalink
add e2e test for vLLM with lora adapters
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 committed Dec 13, 2024
1 parent f0d7dd4 commit 4affddf
Show file tree
Hide file tree
Showing 4 changed files with 12 additions and 1 deletion.
1 change: 1 addition & 0 deletions .github/workflows/tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,7 @@ jobs:
- "autoscaler-restart"
- "cache-shared-filesystem"
- "engine-vllm-pvc"
- "engine-vllm-adapters"
steps:
- name: Checkout code
uses: actions/checkout@v4
Expand Down
4 changes: 4 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -95,6 +95,10 @@ test-e2e-cache-shared-filesystem: skaffold
test-e2e-engine-vllm-pvc: skaffold
./test/e2e/run.sh engine-vllm-pvc --profile e2e-test-default

.PHONY: test-e2e-engine-vllm-adapters
test-e2e-engine-vllm-adapters: skaffold
./test/e2e/run.sh engine-vllm-adapters --profile e2e-test-default

.PHONY: test-e2e-engine
test-e2e-engine: skaffold
CACHE_PROFILE=$(CACHE_PROFILE) ./test/e2e/run.sh engine-$(ENGINE) --profile e2e-test-default
Expand Down
6 changes: 6 additions & 0 deletions charts/models/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -304,6 +304,12 @@ catalog:
url: "ollama://qwen2.5:7b"
engine: OLlama
resourceProfile: cpu:2
qwen2.5-0.5b-cpu:
enabled: false
features: ["TextGeneration"]
engine: VLLM
url: "hf://Qwen/Qwen2.5-0.5B-Instruct"
resourceProfile: cpu:1
qwen2-500m-cpu:
enabled: false
features: ["TextGeneration"]
Expand Down
2 changes: 1 addition & 1 deletion test/e2e-manual/gke-vllm-adapters/run.sh
Original file line number Diff line number Diff line change
Expand Up @@ -16,4 +16,4 @@ curl -v http://localhost:8000/openai/v1/completions \
# with adapter
curl -v http://localhost:8000/openai/v1/completions \
-H "Content-Type: application/json" \
-d '{"model": "tiny-llama/colorist", "prompt": "Who was the first president of the United States?", "max_tokens": 40}'
-d '{"model": "tiny-llama_colorist", "prompt": "Who was the first president of the United States?", "max_tokens": 40}'

0 comments on commit 4affddf

Please sign in to comment.