[AI-6308] Add ability to filter metrics and don't collect GPU metrics by default #21957
Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.
What does this PR do?
Adds ability to filter metrics by source command and stops collecting GPU metrics by default
Motivation
We want to be able to limit metric collection (ie if a command is not supported in a certain LSF version or environment, or if the metrics are not needed).
Additionally, the GPU commands are not available on lsf servers that do not have GPUs, so we don't want to always run them by default.
Note to reviewers: I ran into an issue in our config model generation that limits the amount of options in an enum. I will make a card for it and once it's fixed, remove the validator and make the
metric_sourcesfield an enum.Review checklist (to be filled by reviewers)
qa/skip-qalabel if the PR doesn't need to be tested during QA.backport/<branch-name>label to the PR and it will automatically open a backport PR once this one is merged