Skip to content

Unexpected Database Load running cubedash-gen #582

@omad

Description

@omad

Follow on question from #581 .

We recently noticed a significant spike in Database IO charges against the development DEA Database, and suspect that it's due to an oversight leading to cubedash-gen being run hourly instead of 6 hourly, and also not coping correctly with the agdc schema tables being manually modified as a part of new product development. Deleting and re-indexing of some ODC Products.

The command executed is cubedash-gen --verbose --no-init-database --refresh-stats --all, which I expected would have been able to run very efficiently if there hadn't been new Datasets added. However, this command was taking between 10-20 minutes to run, and causing significant load on the database.

  • Any idea what's going wrong?
  • Is this a bug?
  • What's an appropriate way to run cubedash-gen?

image

Example Airflow log of a failing cubedash-gen run

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions