Database Output (Advanced)
1. Overview
1.1. Function Description
Output dataset is the result data after data flow processing, which can directly push the processed data to the database.
Multiple database output operators can be configured at any node in the ETL data flow, and different storage locations can be specified.
1.2. Prerequisites
At least 1 "Input Dataset" or "Database Input" is required in the data flow before "Database Output" can be configured.
2. Operation Steps
-
Drag the Database Output operator from the data flow operator area into the right canvas editing area, and connect it using connection lines.
-
Click the Database Output operator and select the update method.
Update Method | Explanation |
---|---|
Direct Append | Retain existing data in the target table and write new data |
Full Update | Clear historical data from the target table before writing |
Insert Update | If comparison fields match successfully in the target table, update data will be executed, otherwise data will be directly appended |
-
Select database type, data account, and target table name (table will be automatically created if it doesn't exist).
-
Perform field mapping for the target table.
-
If the user selects insert update, comparison fields need to be confirmed.
-
If predecessor cleaning is needed for the target table, enter scripts in the predecessor SQL. Supports time macro parameters, global parameters, and workflow parameters. For workflow parameter creation methods, see Workflow Parameters.
Usage example:
3. High-Performance Mode
The following databases support high-performance mode to improve data synchronization performance:
Database Type | Notes |
---|---|
StarRocks | Direct append and full update support using high-performance model. When switching to high-performance mode, note the following: 1. HTTP port needs to be configured for StarRocks data account, otherwise task execution will report errors 2. When synchronized data length exceeds the target table field type length, it will automatically be set to null. It is recommended to plan field length in advance and create tables |
GaussDB | Simply enable the high-performance switch |