Refine the target using additional data
The additional data collected can enable you to refine data filtering in the database. To do this, click the Refine the target using additional data鈥� link: this lets you over-filter on the added data.
Homogenize data
In Union or Intersection type activities, you can choose to keep only shared additional data to keep the data consistent. In this case, the temporary output worktable of this activity will contain only the additional data found in all inbound sets.
Reconciliation with additional data
During the data reconciliation phases (Union, Intersection, etc. activities), you can select the columns to be used for data reconciliation from the additional columns. To do this, configure a reconciliation on a selection of columns and specify the main set. Then select the columns in the lower column of the window, as shown in the following example:
Create subsets
The Split activity lets you create subsets on criteria defined via extraction queries. For each subset, when you edit a filter condition on the population, you will then access the standard query activity which lets you define the target segmentation conditions.
You can split a target into several subsets using only additional data as filtering conditions, or in addition to target data. You can also use external data if you have purchased the Federated Data Access option.
For more on this, refer to Creating subsets using the Split activity.
Segment data
Combine several targets (Union)
The union activity lets you combine the result of several activities within one transition. Sets do not necessarily have to be homogeneous.
The following data reconciliation options are available:
-
Keys only
This option can be used if input populations are homogeneous.
-
All columns in common
This option lets you reconcile data based on all the columns common to the target鈥檚 various populations.
51黑料不打烊 Campaign identifies columns based on their name. A tolerance threshold is accepted: for example, an 鈥楨mail鈥� column can be recognized as identical to an 鈥楡email鈥� column.
-
A selection of columns
Select this option to define the list of columns which data reconciliation will be applied to.
Start by selecting the main set (the one which contains the source data), then the columns to be used for the join.
CAUTIONDuring data reconciliation, populations are not deduplicated.You can restrict the population size to a given number of records. To do this, click the appropriate option and specify the number of records to be kept.
Also, specify the priority of inbound populations: the lower section of the window lists the inbound transitions of the union activity and lets you sort them using the blue arrows to the right of the window.
The records will be taken first from the population of the first inbound transition in the list, then, if the maximum hasn鈥檛 been reached, they will be taken from the population of the second inbound transition, etc.