1

I try to run a simple DAG that asynchronously submits beam python pipeline and the sensor that waits for the pipeline completion, as suggested in airflow/beam best practice.

According to the official airflow documentation, the Airflow operator should asynchronously submit the Beam job, where the wait_for_the_beam_job task is checking the status of the submit_beam_job and mark the sensor task as Done when the job finishes.

Unexpectedly, the beam is running synchronously. Have you any idea how can this issue be solved? Here is my minimal reproducible example: https://github.com/MichaelDuemcke/minimal-reproducible-example-airflow-waiting-beam

1 Answer 1

1

try by removing line

result.wait_until_finish() 

in https://github.com/MichaelDuemcke/minimal-reproducible-example-airflow-waiting-beam/blob/6f1fe87c1f71a56b3dc06adb7fa7c8d5889a68e0/dummy_beam_job.py#L47

Sign up to request clarification or add additional context in comments.

Comments

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.