Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

Merged
merged 5 commits into from
Dec 4, 2024
Merged

add nvidia-gpu-rtx4070-8gb and qwen2.5 models #326

merged 5 commits into from
Dec 4, 2024

Conversation

samos123
Copy link
Contributor

@samos123 samos123 commented Dec 2, 2024

@nstogner curious what you think about this pattern of using memory requirements as profiles. Note this is for my laptop where I have an NVIDIA GPU of 8GB memory. It's an RTX 4070 and the laptop has version has 8GB whereas desktop version has 12GB.

@samos123 samos123 requested a review from nstogner December 2, 2024 01:28
@nstogner
Copy link
Contributor

nstogner commented Dec 2, 2024

I think it would be better to define two profiles for RTX 4070, similar to what we do with the different memory variants of a100:

nvidia-gpu-a100-40gb:

@samos123 samos123 changed the title add nvidia-gpu-8gb resource profile add nvidia-gpu-rtx4070-8gb resource profile Dec 3, 2024
@samos123 samos123 changed the title add nvidia-gpu-rtx4070-8gb resource profile add nvidia-gpu-rtx4070-8gb and qwen2.5 models Dec 3, 2024
@samos123 samos123 merged commit bcda5c2 into main Dec 4, 2024
15 checks passed
@samos123 samos123 deleted the rtx-4070 branch December 4, 2024 00:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants