Rules and blocking keys define how records are matched in Aperture Data Studio. To create a new set of rules or blocking keys or view existing ones go to Step settings > Find duplicates settings, or from the Duplicate stores screen either click Create new Duplicate store or select the Edit details action on the action menu of an existing Duplicate store.
When configuring blocking keys, you can set the blocking key limit, which is the limit at which point potential matches generated by the blocking key value are ignored. The larger the block, the more comparisons need to be made. A block of 500 records would need almost 125k comparisons as every record needs to be compared with every other record. You can set a limit for each key individually.
When configuring rules, the following options can be selected:
Aperture Data Studio provides default Find duplicates step settings for use with the Find duplicates step which can be found in Step settings > Find duplicates settings:
Default blocking keys and rules are provided for Australia (AUS), Great Britain (GBR), and United States (USA) as detailed in the table below:
| Name | Summary |
|---|---|
| AUS_Individual_Default | Default Australia individual level rules and blocking keys based on name and address |
| AUS_Household_Default | Default Australia household level rules and blocking keys based on surname (last name) only and address |
| AUS_Location_Default | Default Australia location level rules and blocking keys based on address only |
| GBR_Individual_Default | Default United Kingdom individual level rules and blocking keys based on name and address |
| GBR_Individual_Alternative | Alternative United Kingdom individual level rules and blocking keys based on name and address that may produce better results for large cities |
| GBR_Household_Default | Default United Kingdom household level rules and blocking keys based on surname (last name) only and address |
| GBR_Location_Default | Default United Kingdom location level rules and blocking keys based on address only |
| USA_Individual_Default | Default United States of America individual level rules and blocking keys based on name and address |
| USA_Household_Default | Default United States of America household level rules and blocking keys based on surname (last name) only and address |
| USA_Location_Default | Default United States of America location level rules and blocking keys based on address only |
The summary of each step setting is included to explain the purpose of the blocking keys and rules. The details of a step setting can also be viewed when clicked in the Step settings list screen.
Find duplicate settings can be configured at an Environment level to be available by all Spaces in the Environment or at Space level: