Incremental
With BladePipe, you can create a Full Data & Incremental DataJob in minutes. It includes schema migration, existing data migration, incremental data synchronization and other stages. Once the DataJob is started, BladePipe will automatically finish the multiple DataTasks in order.
This page introduces how to create a Full Data & Incremental DataJob.
Select DataSource
- Log in to the BladePipe Cloud.
- In the top navigation bar, click DataJob.
- On the top of the page, click Create DataJob.
- Select the Cluster for DataJob execution. info
If the cluster have multiple Workers, BladePipe will schedule the Workers for DataJob Level 2 Disaster Recovery. If it has only one Worker, BladePipe will adopt the plan for DataJob Level 2 Disaster Recovery.
- Select the source and target datasource instances and finish the related settings. Click Test Connection.
- Select the source and target database or schema. BladePipe supports the migration and synchronization of multiple schemas. Then click Next Step.
Select DataJob Properties
- Select DataJob Type. Here taking the creation of Full Data & Incremental DataJob as an example, click Incremental and select Full Data option.
- Select Specification.info
When the memory of the worker is sufficient, you can choose a larger specification with better performance and higher stability. When there are many DataJobs, you can choose the specifications based on the reality, taking into account the worker utilization.
- Configure the following DataJob information.
Item | Description |
---|---|
Synchronize DDL |
|
Verification |
|
Correction Mode |
|
Clean Target Data Before Full Data | If enabled, the target data will be cleared up before full data initialization. |
Rebuild Target Schema | If enabled, BladePipe will automatically rebuild the target schema in the target database. |
Start Automatically |
|
- Click Next Step.
Select Tables
- (Optional) If the target mapping rules need to be modified, click Mapping Rules and make the changes accordingly.
- Select the tables to be synchronized.
- Exact match: Enter the table name plus a semicolon (half-width) in the input box to filter tables. You can enter multiple table names separated by semicolons without spaces in between.
- Fuzzy match: Enter characters in the input box to filter out tables with names containing these characters.
- Filtering by categories: You can filter tables by conditions in the search box.
tipClick the check box on the far left of the title row to select all tables on the current page. Click the Select All check box at the bottom left of the list to select all tables.
To select all tables by default when creating a DataJob, click Settings > Preference > BladePipe tab and set the value of parameter jobTableDefaultSelectAll to true. - Set the target table name.
- Automatically generate table names based on mapping rules. If you modify the mapping rules, the table names will change accordingly.
- After selecting the table, you can set the existing table name as the target table name in the Target Table column.
- After selecting the table, you can enter a custom table name in the Target Table column and click the option with the Enter symbol, or press the Enter key to confirm.
- Click Batch Modify Target Names to add prefixes and suffixes to table names in batches.
- Filter actions.
- Setting individually: After selecting the table, you can set the action to be synchronized for each table separately.
- Setting in batches: You can select the action in batches above the list, or click Action Filter in the table list on the left to set it in batches.
- Click Next Step.
Select Columns
- (Optional) If the target mapping rules need to be modified, click Mapping Rules and make the changes accordingly.
- You can view all selected tables on the left side of the page, and search for tables in the search box and input box above the list.
- Select and configure the columns to be synchronized.
- Setting individually: Select the columns to be synchronized for each table, or click Operation to set filter conditions, target database update conditions, primary key of the target database, etc.
- Setting in batches: Click Batch Operation in the upper-right corner to set target database update conditions, primary key of the target database, etc.
The following table describes the configurable items:
Item | Description |
---|---|
Set Virtual Column | Add a virtual column to the target table and set the name, value, type, and length of the virtual column. |
Set Data Filtering | Set data filtering conditions. For more information, see Data Filtering |
Add Update Condition | Set the conditions to update the target tables. |
Set Target Primary Key | Set the primary key of the target table. If the source table has no primary key but has a unique key, the unique key will be automatically set as the target primary key. |
Batch Filter Columns | Filter columns in batches. |
- (Optional) Upload custom code. For more information, see Custom Code.
- Click Next Step.
Confirm DataJob Creation
After confirming the DataJob configuration information, click Create DataJob at the bottom of the page.
If there are schemas, tables, or columns to be created in the Target instance during the DataJob creation process, BladePipe will automatically migrate the schemas. If not, there won't be schema migration.
View DataJob
- On the DataJob list page, you can view the DataJob progress.
- Click Details in the Operation column on the right side of the list to enter the DataJob Details page and view specific DataJob running information.