Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unexpected Database Load running cubedash-gen #582

Open
omad opened this issue Apr 5, 2024 · 1 comment
Open

Unexpected Database Load running cubedash-gen #582

omad opened this issue Apr 5, 2024 · 1 comment

Comments

@omad
Copy link
Member

omad commented Apr 5, 2024

Follow on question from #581 .

We recently noticed a significant spike in Database IO charges against the development DEA Database, and suspect that it's due to an oversight leading to cubedash-gen being run hourly instead of 6 hourly, and also not coping correctly with the agdc schema tables being manually modified as a part of new product development. Deleting and re-indexing of some ODC Products.

The command executed is cubedash-gen --verbose --no-init-database --refresh-stats --all, which I expected would have been able to run very efficiently if there hadn't been new Datasets added. However, this command was taking between 10-20 minutes to run, and causing significant load on the database.

  • Any idea what's going wrong?
  • Is this a bug?
  • What's an appropriate way to run cubedash-gen?

image

Example Airflow log of a failing cubedash-gen run

@jeremyh
Copy link
Collaborator

jeremyh commented Apr 8, 2024

There's a good chance that the == None/ is None changes found in #581 caused these performance issues, as the queries used by Explorer would no longer match the underlying indexes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants