Skip to main content

Dataset Management Overview

Overview

After a dataset is integrated, users can manage and maintain it comprehensively. This includes dataset permission settings, data structure adjustments, refresh scheduling, and resource lineage review, ensuring efficient management and effective use of data assets.

These operations can be performed either on the dataset list page or in the dataset detail page. This section explains the relevant configuration flows in detail so users can better understand and perform these operations.

Getting Started

Operation GuideDescription
Basic OperationsUsers can perform a series of basic operations on datasets, such as save as, rename, and delete.
Dataset Preview and EditingHelps users better understand and use data by previewing and modifying the current dataset’s structure and details.
Dataset RefreshTo keep data timely, users can refresh datasets through scheduled refresh, manual refresh, URL-triggered refresh, and other modes.
Dataset Consumption and UsageAfter integrating a dataset, users will analyze it and create downstream resources such as ETL flows or cards. This includes permission settings and downstream resource review and management.
High-Performance Query Table (Advanced)A data calculation and storage acceleration service provided by Guandata BI, suitable for datasets with 10 million rows or more and capable of significantly improving query efficiency on cards.
Resource LineageTo simplify resource management, Guandata BI provides Resource Lineage. Users can review global lineage for the current resource, and also inspect Field Lineage at a finer level to understand the impact of field changes in one view.

Common Dataset Questions

If you encounter issues while using datasets, see Dataset FAQ.