Quick Start
This article introduces the Data Development workflow to help you quickly understand and start using DataFlow.
- Integrate multiple data sources:
Supports heterogeneous data sources such as business systems, data warehouses, file-based data, and APIs, enabling flexible full and incremental synchronization. - Offline data development:
- Provides low-threshold visual ETL and
Dataflow Nodeorchestration capabilities, while supporting extended task types such as Python scripts and Shell commands to improve development efficiency. It also uses loop control,Conditional Branch, andSubprocessto complete task orchestration efficiently. For details, see Offline Development Task. - Provides minute-level
Near Real-Time Schedulingto ensure data timeliness and supports event-driven scheduling to avoid empty task runs. For details, see [Task Scheduling](06-Task Scheduling.md).
- Provides low-threshold visual ETL and
- Operations management:
Provides visual tools such as theInstance Runtime Gantt ChartandWorkflow Tree Viewto monitor task status and quickly identify abnormal nodes. You can then trace and resolve issues based onNode Logs. It also supportsRerunandRecover Failed Tasksto repair tasks and data and ensure data accuracy. For details, see [Task Monitor](08-Task Monitor.md).