Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there a way to have performance metrics when running kubeai? #299

Open
strus38 opened this issue Oct 31, 2024 · 3 comments
Open

Is there a way to have performance metrics when running kubeai? #299

strus38 opened this issue Oct 31, 2024 · 3 comments
Labels
good first issue Good for newcomers

Comments

@strus38
Copy link

strus38 commented Oct 31, 2024

Hello
I am looking at a way to have performance metrics show-up while using kubeai.
Is this provided already or is there a way to do it, even if it means deploying more containers alongside?
Thanks

@nstogner
Copy link
Contributor

KubeAI exposes some very basic metrics. Some of the backend engines like vLLM expose more granular metrics about their performance. I think it makes sense for us to add a guide on how to scrape all of these metrics. Do you have a monitoring stack that you already use?

@samos123
Copy link
Contributor

samos123 commented Nov 6, 2024

Here is an example pod scraping metric that you can use with Google Managed Prometheus:

apiVersion: monitoring.googleapis.com/v1
kind: PodMonitoring
metadata:
  name: vllm-pods
spec:
  selector:
    matchLabels:
      app.kubernetes.io/name: vllm
  endpoints:
  - port: http
    interval: 30s

I plan to work on optional config to deploy prometheus + grafana together with KubeAI so you get everything working out of the box. Would that be of interest?

@samos123
Copy link
Contributor

samos123 commented Dec 2, 2024

We should include the following grafana dashboard: https://github.com/vllm-project/vllm/tree/main/examples/production_monitoring

@samos123 samos123 added the good first issue Good for newcomers label Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants