Select Columns
Overview
Imported datasets often contain poorly standardized data, redundant fields, or inconsistent field design. In such cases, you can use Select Columns to keep only valid fields and produce a cleaner dataset.
By selecting columns, you can reduce the volume of data that needs to be processed and transferred, helping users focus on key fields. This is especially important in large-scale data processing scenarios.

Procedure
-
Drag the
Select Columnsoperator from the dataflow operator panel into the canvas on the right.
-
Click the
Select Columnsoperator and select the required columns as needed. Search is supported.
-
Move the pointer over a selected field to rename the column if needed.

-
After configuration is complete, preview the result.

Example
The following example shows how to keep 5 columns out of 34 columns. The original preview looks like this:

-
Drag the
Select Columnsoperator from the ETL operator area to the canvas on the right and connect it to the upstream node. -
Click the
Select Columnsoperator. The left panel becomes the current operator configuration area, where you can rename the operator based on business needs if necessary.
-
Select the required columns and rename existing columns as needed.

-
After configuration is complete, preview the data result.
