add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

samos123 · 2024-12-02T01:28:14Z

@nstogner curious what you think about this pattern of using memory requirements as profiles. Note this is for my laptop where I have an NVIDIA GPU of 8GB memory. It's an RTX 4070 and the laptop has version has 8GB whereas desktop version has 12GB.

nstogner · 2024-12-02T01:59:27Z

I think it would be better to define two profiles for RTX 4070, similar to what we do with the different memory variants of a100:

kubeai/charts/kubeai/values.yaml

Line 153 in d834b1c

nvidia-gpu-a100-40gb:

samos123 requested a review from nstogner December 2, 2024 01:28

samos123 added 2 commits December 2, 2024 18:17

add nvidia-gpu-8gb resource profile

a0775f6

include gpu family in name

679d9e2

samos123 force-pushed the rtx-4070 branch from 94fb6b4 to 679d9e2 Compare December 3, 2024 02:20

samos123 changed the title ~~add nvidia-gpu-8gb resource profile~~ add nvidia-gpu-rtx4070-8gb resource profile Dec 3, 2024

nstogner approved these changes Dec 3, 2024

View reviewed changes

samos123 added 2 commits December 3, 2024 14:23

add local models for coding

71e0962

enable false by default

63ee3e0

samos123 changed the title ~~add nvidia-gpu-rtx4070-8gb resource profile~~ add nvidia-gpu-rtx4070-8gb and qwen2.5 models Dec 3, 2024

generate manifests

6a77c00

samos123 merged commit bcda5c2 into main Dec 4, 2024
15 checks passed

samos123 deleted the rtx-4070 branch December 4, 2024 00:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

samos123 commented Dec 2, 2024 •

edited

Loading

nstogner commented Dec 2, 2024

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

Conversation

samos123 commented Dec 2, 2024 • edited Loading

nstogner commented Dec 2, 2024

samos123 commented Dec 2, 2024 •

edited

Loading