You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Runnfing this command gives following output probably meaning all-1g.10gb profile has been selected: sudo nvidia-mig-parted export
2024/07/18 13:41:59 WARNING: unable to get device name: [failed to find device with id '20b5']
2024/07/18 13:41:59 WARNING: unable to get device name: [failed to find device with id '20b5']
version: v1
mig-configs:
current:
- devices: all
mig-enabled: true
mig-devices:
1g.10gb: 7
but when I run nvidia-smi, I'm having following output with no MIG devices found.
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.223.02 Driver Version: 470.223.02 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA A100 80G... Off | 00000000:00:08.0 Off | On |
| N/A 40C P0 71W / 300W | 45MiB / 80994MiB | N/A Default |
| | | Enabled |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA A100 80G... Off | 00000000:00:0B.0 Off | On |
| N/A 41C P0 65W / 300W | 45MiB / 80994MiB | N/A Default |
| | | Enabled |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| MIG devices: |
+------------------+----------------------+-----------+-----------------------+
| GPU GI CI MIG | Memory-Usage | Vol| Shared |
| ID ID Dev | BAR1-Usage | SM Unc| CE ENC DEC OFA JPG|
| | | ECC| |
|==================+======================+===========+=======================|
| No MIG devices found |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
Also running sudo nvidia-smi mig -lgip gives following
I'm using
nvidia-mig-parted
version 0.8.0 and I have two nvidia A100 80GB PCIe GPUs in my node.This is my config.yaml file which I have applied.
Runnfing this command gives following output probably meaning
all-1g.10gb
profile has been selected:sudo nvidia-mig-parted export
but when I run
nvidia-smi
, I'm having following output with no MIG devices found.Also running
sudo nvidia-smi mig -lgip
gives followingI wonder why MIG devices I expected were not created.
I'm getting following error when I try to create GPU instances.
Any help would be appreciated!
The text was updated successfully, but these errors were encountered: