Change Management for Data Wrangling (Alteryx) Workflows
Alteryx is a leading Data Wrangling software used for Data Integration, cleanup, and Data Science. Alteryx gets used by Data scientists and Business Users to perform Self Service Data Analytics. As part of Self-service data analytics, business users are empowered to perform analysis independently with minimal help from the technology organization. An added advantage of this is short time-to-market for data analytics. As Business users are not familiar with the Software Development Lifecycle (SDLC) concepts, there is a lack of change management practice in their workflow development. Alteryx too does not provide integration with a version control and change management toolset.
To overcome this, the SDLC best practices from programming languages like Java, Python, etc. can be applied to Alteryx workflows in a lightweight manner to make them palatable for Business users.
The below diagram depicts a process flow of Change and version management for Alteryx Workflows for Business users. Here are the steps:
- The user creates business functionality in form of Alteryx workflows using Alteryx Designer and commits to a Version control tool ( GIT, CVS, SVN, etc.) after completing unit testing on his laptop/desktop
- Continuous integration (CI) tools (Jenkins, Bamboo, GitLab, etc.) pick it up and create an assembly using Maven or Ant
- Finally, this artifact from #2 gets deployed to a network share
- The user also publishes a shell workflow to the Alteryx Gallery that will have information on how to invoke this deployed artifact from the network share. If you are starting with an Analytic App, there is an option in the Interface Designer (Ctrl+Alt+D) window to run another workflow. This will open the interface window of the specified Analytic App.
If you have a workflow (non-Analytic App) and/or want to run a workflow (without questions and actions), I would use the Run Command Event. This can be configured to run on completion of the current workflow and can execute a batch file that can be written to execute a Alteryx Workflow. This feature mimics the functionality of the Run Command tool that can be added to the workflow
5. This shell workflow can take parameters and can either be scheduled within Alteryx Gallery or invoked using Alteryx REST APIs
Version Management:
In the version control tool, head-based development could be followed where the head or master branch is used by users to commit the code. After deployment, relevant workflows are tagged accordingly (DEV/QA/PROD). Create a branch after the production migration and associate it with a tag for easy identification (i.e. changes for requirement x).
Folder Structure:
Within the version management tool following folder or packages can be created to manage work associated with business functionality.
With these steps you can add lightweight version control and change management to the Alteryx workflows.