Skip to main content
← Back to Blog

How to Create Alteryx Workflows: Beginner Step-by-Step Guide

Step-by-step Alteryx workflow tutorial for importing, cleaning, combining, and exporting data.

How to Create Alteryx Workflows: Beginner Step-by-Step Guide

How to Create Alteryx Workflows: Beginner Step-by-Step Guide

Learn a beginner-friendly Alteryx workflow to load, clean, combine, and export data with confidence.

Overview

Product Name: Alteryx Designer (x64)
Product Version: 2022.3.1.450 Patch 2

Alteryx is a versatile data analytics platform with tools for data preparation, blending, and advanced analytics. Its goal is to help users with different levels of programming experience manipulate, cleanse, and transform data from multiple sources. The drag-and-drop interface lets you build custom workflows by connecting pre-built tools, which keeps the learning curve approachable for beginners. Alteryx supports many data formats and integrates with databases and other analytics systems, making it a practical option for building custom datasets quickly.

In this tutorial, we’ll create and complete a basic Alteryx workflow:

  • Data collection
  • Data integration
  • Data cleansing
  • Data serialization

For this tutorial, we’ll combine datasets from three online learning platforms and export one complete dataset at the end. I’ve already downloaded the datasets I need, and there are many free and premium sources online.

Data collection

Obtain datasets

For this example, I’ll combine online course data from multiple sources so that, by the end, we’ll export one complete combined dataset.

These can come in various formats, e.g. .csv, .xlsx, DB connections and more.

Dataset sources

Create workflow

Now let’s create a workflow in Alteryx:

  1. Click File → New Workflow.
  2. Do it again but hit Save this time (File → Save).
  3. You now have an empty workflow. It will adopt the saved file name by default.

New workflow

Input data

Now that we have a workflow, let’s bring in our data.

  • Add and configure an Input Data tool from the In/Out tab for each dataset.
  • Attach a Browse tool from the In/Out tab for each data source so you can inspect data after a run.
  • Click the blue Run button in the top right corner.
  • Select any Browse tool to view the results for that dataset.

Input Data tool

Browse tool

Run and Browse results

Select desired fields

Now that we’ve confirmed we can view each dataset, it’s time to start shaping the data.

  • Replace each Browse tool with the Select tool from the Preparation tab.
  • For our purpose we just want the title from each dataset.

Select tool

Column selection

Scroll to the right in the Select window and you’ll find the Rename Column option. Since we’ll be combining multiple datasets, this helps track where each title comes from.
Rename columns like DATASETNAME_title.

Data integration

Join data

Now let’s combine the selected data from each dataset.

  • Add a Join tool from the Join tab and connect all of the Select tools to it.
  • Configure it to the Join by Specific Fields option.
  • Set the Field Outputs to the data you’re carrying over (e.g. title).

Join tool

Transpose data

Instead of using a separate column name to identify the dataset source (e.g. DATASETNAME_title), we’ll move that source label into a field on each record.

  • Attach a Transpose tool from the Transform tab to the Join tool.
  • Set the Data Columns to the data you’re carrying over (e.g. title).
  • Verify that the restructuring was successful.

Transpose tool

Data cleaning

Cleanse, filter, and select data

Now we’ll cleanse our data:

  • Remove entirely null rows and columns.
  • Set nulls to empty on strings.
  • Cleanse whitespace, tabs, and breaks.

Attach a Data Cleansing tool from the Preparation tab.

One oversight: this does not clean partially null records. Let’s add a custom filter:

  • Append a Filter tool from the Preparation tab to the Data Cleansing tool.

Data cleansing

End of workflow cleanup

Finalize your column names with a Select tool.

Filter tool

Data serialization

Export data

  • Append the Output Data tool from the In/Out tab.
  • Click Set Up a Connection.
  • Select your desired output format (e.g. .xlsx).
  • Click the blue Run button again.
  • Check the save location for the output file.

Final column rename

Output data

Conclusion

Great job, you now have the groundwork for a fully functional Alteryx Workflow!

Final output

You can combine any number of data sources, customize the data, and then create your own insightful datasets. There is more than one way to complete a task, and you have the freedom to sequence steps in whatever way works best for your project. Don’t stop here, though. There are many more tools built into Alteryx, and we’ve only scratched the surface.

BONUS

I made some modifications!

Instead of outputting only Course Titles and Course Providers, I changed the workflow to count how many times each word appears in Course Titles and sort from largest to smallest to find which technology “keywords” were repeated most often.

Along with the .xlsx file, I also set it to output a .hyper file for Tableau. This is one example of where you can go from this tutorial.

Workflow complete

Keyword count