Skip to content

Conversation

@tiranux
Copy link

@tiranux tiranux commented Nov 25, 2025

Description

Adding a new DAG tool to troubleshoot and profile DAGs parsing.

Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.

Checklist

@product-auto-label product-auto-label bot added samples Issues that are directly related to samples. api: composer Issues related to the Cloud Composer API. labels Nov 25, 2025
@tiranux tiranux force-pushed the add-dag-parsing-profiler branch from a74ffb7 to 8552eb2 Compare November 25, 2025 07:18
@tiranux tiranux marked this pull request as ready for review November 25, 2025 16:56
@tiranux tiranux requested review from a team as code owners November 25, 2025 16:56
@snippet-bot
Copy link

snippet-bot bot commented Nov 25, 2025

Here is the summary of changes.

You are about to add 2 region tags.

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

  • Refresh this comment
@tiranux tiranux marked this pull request as draft November 25, 2025 17:58
@tiranux tiranux force-pushed the add-dag-parsing-profiler branch from 8552eb2 to 507bff9 Compare November 25, 2025 18:00
@tiranux tiranux marked this pull request as ready for review November 27, 2025 06:25
When triggered, it offloads the resource-intensive DAG parsing process to a temporary, isolated Kubernetes Pod. Its primary goal is to detect **parsing latency issues** and identify heavy top-level code execution without impacting your environment's workload resources. As a byproduct of this analysis, it also validates DAG integrity and catches syntax errors.

## 🌟 Key Features
* **Isolated Execution:** Offloads parsing logic to a separate Pod, protecting the Scheduler from resource contention and crashes.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

note, in Composer 3 dag processor is run separately from Scheduler, in a dedicated workload - let's add info about Composer 2/Composer 3 and when this tool is applicable

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done, ommited the specific mention of the scheduler as refer to environment resources as it moving to the Kubernetes pod, isolates the workload of this analysis wether it is a Composer 2 or Composer 3.

Copy link
Author

@tiranux tiranux Dec 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added further clarification regarding the version for image setup (when no automatic detection).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api: composer Issues related to the Cloud Composer API. samples Issues that are directly related to samples.

2 participants