Skip to main content

Select Columns

Overview

Imported datasets often contain poorly standardized data, redundant fields, or inconsistent field design. In such cases, you can use Select Columns to keep only valid fields and produce a cleaner dataset.

By selecting columns, you can reduce the volume of data that needs to be processed and transferred, helping users focus on key fields. This is especially important in large-scale data processing scenarios.

Procedure

  1. Drag the Select Columns operator from the dataflow operator panel into the canvas on the right.

  2. Click the Select Columns operator and select the required columns as needed. Search is supported.

  3. Move the pointer over a selected field to rename the column if needed.

  4. After configuration is complete, preview the result.

Example

The following example shows how to keep 5 columns out of 34 columns. The original preview looks like this:

  1. Drag the Select Columns operator from the ETL operator area to the canvas on the right and connect it to the upstream node.

  2. Click the Select Columns operator. The left panel becomes the current operator configuration area, where you can rename the operator based on business needs if necessary.

  3. Select the required columns and rename existing columns as needed.

  4. After configuration is complete, preview the data result.