Skip to content

Actions: substratusai/kubeai

Publish docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
95 workflow runs
95 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add k8s device plugin / GPU operator values file (#308)
Publish docs #70: Commit 49f453e pushed by samos123
November 10, 2024 18:52 54s main
November 10, 2024 18:52 54s
Llama 3.1 70b on L4 with pipeline parallelism (#307)
Publish docs #69: Commit c30396a pushed by samos123
November 10, 2024 18:51 51s main
November 10, 2024 18:51 51s
add llama 3.1 70b fp8 model on 1 x gh200 (#302)
Publish docs #68: Commit 4f2bf76 pushed by samos123
November 8, 2024 23:49 56s main
November 8, 2024 23:49 56s
Update README.md (#305)
Publish docs #67: Commit 766d1d8 pushed by nstogner
November 8, 2024 01:22 53s main
November 8, 2024 01:22 53s
update README (#296)
Publish docs #66: Commit 6f71d75 pushed by nstogner
November 4, 2024 13:05 55s main
November 4, 2024 13:05 55s
Add gh200 support and model (#300)
Publish docs #65: Commit 6082f5c pushed by samos123
November 2, 2024 01:07 55s main
November 2, 2024 01:07 55s
Update private-deep-chat.md
Publish docs #64: Commit c287c12 pushed by nstogner
October 31, 2024 16:04 50s main
October 31, 2024 16:04 50s
Deep Chat integration (#294)
Publish docs #63: Commit 62a1af5 pushed by nstogner
October 31, 2024 15:30 48s main
October 31, 2024 15:30 48s
Update kubernetes api reference (#290)
Publish docs #62: Commit 9d4fb2c pushed by samos123
October 29, 2024 05:59 49s main
October 29, 2024 05:59 49s
improve caching docs (#295)
Publish docs #61: Commit 04d8f56 pushed by samos123
October 29, 2024 05:54 48s main
October 29, 2024 05:54 48s
helm: bump chartVersions and appVersion to v0.10.0 (#291)
Publish docs #60: Commit 6434766 pushed by samos123
October 25, 2024 06:18 53s main
October 25, 2024 06:18 53s
add caching models with EFS guide (#289)
Publish docs #59: Commit b1b579c pushed by samos123
October 25, 2024 05:54 52s main
October 25, 2024 05:54 52s
Add EKS Installation Guide (#287)
Publish docs #58: Commit 40ba147 pushed by samos123
October 25, 2024 02:27 49s main
October 25, 2024 02:27 49s
increase caching e2e test timeout (#288)
Publish docs #57: Commit 1b8066c pushed by nstogner
October 25, 2024 00:10 1m 14s main
October 25, 2024 00:10 1m 14s
add kubeai metrics service endpoint (#284)
Publish docs #56: Commit 09fb570 pushed by samos123
October 24, 2024 17:13 59s main
October 24, 2024 17:13 59s
Add support for HTTP X-Label-Selector headers to support Multitenancy…
Publish docs #55: Commit 4213d3a pushed by nstogner
October 24, 2024 00:10 50s main
October 24, 2024 00:10 50s
Github action docker build add timeout to address stuck WF's (#281)
Publish docs #54: Commit a76db7b pushed by samos123
October 21, 2024 03:01 46s main
October 21, 2024 03:01 46s
helm: bump models chartVersion to v0.7.0
Publish docs #53: Commit dcdb370 pushed by samos123
October 19, 2024 06:02 1m 4s main
October 19, 2024 06:02 1m 4s
helm: update appVersion to v0.9.0 (#280)
Publish docs #52: Commit 7517d32 pushed by samos123
October 19, 2024 05:58 56s main
October 19, 2024 05:58 56s
add manual test of vLLM on GPU and TPU (#279)
Publish docs #51: Commit 3c37aed pushed by samos123
October 19, 2024 05:38 51s main
October 19, 2024 05:38 51s
Fix broken screenshot
Publish docs #50: Commit 8593701 pushed by nstogner
October 18, 2024 16:11 53s main
October 18, 2024 16:11 53s
Shared filesystem caching (#272)
Publish docs #49: Commit 5e8f2c5 pushed by nstogner
October 18, 2024 15:26 1m 14s main
October 18, 2024 15:26 1m 14s
update vllm images to 0.6.3 (#273)
Publish docs #48: Commit 7fdbc7d pushed by samos123
October 18, 2024 13:07 57s main
October 18, 2024 13:07 57s
add tpu quota to GKE install guide and use values-gke.yaml (#271)
Publish docs #47: Commit 4c9fb05 pushed by samos123
October 11, 2024 18:52 1m 0s main
October 11, 2024 18:52 1m 0s
Add comment about autoscaling edge case and add log line
Publish docs #46: Commit 8c7238b pushed by nstogner
October 9, 2024 14:16 56s main
October 9, 2024 14:16 56s