Skip to main content

Data Factory

Needs Votes

Dataflow Gen2 Publish vs Run

Vote (39) Share
David Wagstaff's profile image

David Wagstaff on 28 Sep 2023 03:22:56

Update/change functionality when publishing Dataflow Gen2. Currently, when a dataflow is published, the dataflow is saved and then automatically runs. Dataflow Gen1 functionality never did this. When authoring all I needed to do was save the dataflow and then it could be executed whenever. Half the time I am not ready for the dataflow to run when working with Gen2 and it unnecessarily runs resulting in additional overhead for me to go clean up whatever it just did.

Comments (3)
David Wagstaff's profile image Profile Picture

Jake O'Malley on 30 Jan 2025 17:36:04

RE: Dataflow Gen2 Publish vs Run

Completely agree. This also applies to Power Apps Gen2 Dataflows. I can move dataflows to downstream environments via solutions. Generally I will use Power Automate to orchestrate dataflows for the initial data migration into my downstream environments. The issue?I have to publish each dataflow individually in my target environment before I can run my orchestration Power Automate flow (which essentially just calls each dataflow in the order I want them to run, sometimes with other Power Automate flows running in between for data transformation). If I don't publish (and consequently, run) each dataflow then the orchestration flow fails because it can't run an unpublished dataflow. Publishing also runs each dataflow, so I have to go back through and bulk delete all the data created when I publish the flows. Then I can run my orchestration flow.Allowing us to publish gen2 dataflows without running them would be a massive improvement, in my opinion.Also, finding a fix to having to re-establish dataflow connections every time a solution is imported would be huge as well!

David Wagstaff's profile image Profile Picture

Paul Chantakraiwat on 13 Jun 2024 02:45:05

RE: Dataflow Gen2 Publish vs Run

I find it frustrating that I can't save changes without publishing/running the task. Most of my data processes are scheduled, and if I've already run today's task, running it again causes data duplication. I need the task to run tomorrow, but without saving the changes now, they won't be there for tomorrow.

David Wagstaff's profile image Profile Picture

Joshua Rodgers on 20 Feb 2024 21:59:48

RE: Dataflow Gen2 Publish vs Run

This is a huge regression from gen 1 dataflows. We should be able to save and publish them without triggering a run.