Skip to content

Commit

Permalink
comment out env variables and set max seq=1
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 committed Nov 17, 2024
1 parent 11fd736 commit facdf4e
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions manifests/models/opt-125m-cpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,11 +5,12 @@ metadata:
name: opt-125m-cpu
spec:
features: [TextGeneration]
env:
VLLM_RPC_TIMEOUT: "10000"
VLLM_CPU_KVCACHE_SPACE: "1"
# env:
# VLLM_RPC_TIMEOUT: "10000"
# VLLM_CPU_KVCACHE_SPACE: "1"
args:
- --disable-frontend-multiprocessing
- --max-num-seqs=1
owner: facebook
url: hf://facebook/opt-125m
engine: VLLM
Expand Down

0 comments on commit facdf4e

Please sign in to comment.