Skip to main content

Select Columns

Overview

Imported datasets often contain low-standardized data with redundant or irregular fields. In this case, you can use the Select Columns operator to retain valid fields and create a cleaner, more effective dataset.

By selecting columns, you can reduce the amount of data that needs to be processed and transferred, allowing users to focus on key fields. This is especially important in large-scale data processing scenarios.

User Guide

  1. Drag the Select Columns operator from the ETL operator area to the canvas on the right.
  2. Click the Select Columns operator and select the required columns as needed. Search is supported.
  3. Hover over a selected field to rename the column name if needed.
  4. After configuration is complete, preview the data result.

Detailed Explanation

The following example shows how to keep 5 columns out of 34 columns. The original preview looks like this:

  1. Drag the Select Columns operator from the ETL operator area to the canvas on the right and connect it to the upstream node.

  2. Click the Select Columns operator. The left panel becomes the current operator configuration area, where you can rename the operator based on business needs if necessary.

  3. Select the required columns and rename existing columns as needed.

  4. After configuration is complete, preview the data result.