Rules and groups

Rules

Rules are the checks that are created to identify and report specific data quality issues and are based on SQL code. To access the list of rules created so far (shown in the screen below), navigate to ( Rules > Rules ) or (Quality > Rules > Rules) in smaller screens.

To  create a new rule, click on ‘Create Rule’ and you will be prompted with the screen below

  • Tags: Attach Tag (To be used in Tag Search)
  • Subject Area: Based of the Data dictionary hierarchy, the Subject Area containing the data that the rule is being applied on.
  • Entity: Based of the Data dictionary hierarchy, the Entity containing the data that the rule is being applied on.
  • Rule group: The Rule Group in which the Rule is assigned to.
  • Name: Name of rule. 
  • Description: Brief description of Rule purpose.
  • Status: Active, Inactive.
  • Severity: Low, Medium, High, Critical. The Severity level attached to the rule.
  • Rule Type: (Accuracy, Completeness, Consistency, Suitability, Timeliness, Validity, Uniqueness, Enterprise Glossary, Critical Data Elements, Customer Terms).
  • Cleanse Type: (Investigation Required, Manual, Automated).
  • Cleanse Group: Team of personnel assigned to rectify the errors (data quality issues) identified by the rule.
  • Cost: The Cost attached to each error identified by the rule. 

After a Rule has been created, you need to follow a few extra steps. In the Rules Window when you click on a specific rule, 4 additional windows appear below the rules window. These are the :

  1. Implementations window
  2. Attributes window
  3. Processes window
  4. Report groups

Therefore for every new rule you need to create at least one new implementation, attach the necessary attributes, attach the necessary processes and attach the necessary Report Groups.  

Creating an Implementation

This is done in the implementations window (shown below).

A Rule can have multiple Implementations. Typically most rules have one implementation each. Multiple implementations would be useful in the case whereby slight variations of the rule needs to be applied. For example, the same rule being applied across 2 Datasets/systems will have 2 Implementations. In this case each implementation will be analyzing a different dataset but they will be looking to identity the same data quality issue.

To Create a new Implementation simply click ‘Create Implementation’ in the ‘Implementations Window' and you will be presented with the screen below.

  • Tags: Attach Tag (To be used in Tag search)
  • Rule: The Rule to which the Implementation belongs to.
  • Name: Name of Implementation
  • Description: Description of the Implementation
  • Status: (In Development, Test, Active, Inactive). It has to be set to ‘Active’ for the implementation to be applied. 
  • Available placeholders: Click on the placeholders available to insert them into the SQL code.
  • SQL code window: This window contains the SQL code that will be executed. The code should contain the location of the data set to be analyzed, logic being used to identify specific data quality issues and structure of results (records identified as having data quality issues) being returned. Clicking on ‘Reset to default’ button provides the base structure of the code to which you can build on according to your needs. Clicking on the ‘Grow’ or ‘Shrink’ button adjusts the size of the code window accordingly.

Attributes, Processes & Report Group windows

  • Attributes window: This window enables you to attach a Rule Implementation to specific Attributes for reporting purposes. Therefore you should be able to view all the data quality issues related to chosen Attributes. Simply click on ‘Assign Attribute’ and select the desired Entity and any of its Attributes to attach.  
  • Processes window: This window enables you to attach a Rule Implementation to specific Processes for reporting purposes. Therefore you should be able to view all the data quality issues related to chosen Processes and their associated costs. Simply click on ‘Assign Process’ and select the desired Process to attach.
  • Report Groups window: This window enables you to attach a Rule Implementation to specific Report Groups for reporting purposes. Therefore you should be able to view all the data quality issues related to chosen Report Groups. Simply click on ‘Assign Group’ and select the desired Report Group to attach. 

Rule Groups

These are a collection of rules concerned with a particular aspect of an Entity. For Example, within a person-type record, there may be a Rule group for the validation of address. This Rule group may contain several Rules such as:

  1. Postcode is in invalid format
  2. Address line 1 is missing
  3. Address line 4 is missing
  4. Building number is in valid format

To Navigate to the Rule Groups go to (Rules > Rules) or (Quality > Rules > Rules Groups) in smaller screens.

To create a new Rule Group click of the ‘Create Rule Group’ button and you will be presented with the screen below.

  • Tags: Attach Tag (To be used in Tag search)
  • Subject Area: Subject Area of Entity that Rule group focuses on.
  • Entity: Entity that is focused on by the Rule Group
  • Name: Name of Rule Group
  • Description: Description of Rule Group

After a Rule Group has been created it will be available as a selection whilst creating Rules and available to attach to relevant elements.

Cleanse Groups

Cleanse groups are Teams/Individuals assigned to resolve designated data quality issues.

To navigate to ‘Cleanse Groups’ go to (Quality > Rules > Cleanse Groups).

To create a new Cleanse Group simply click on the ‘Create Cleanse Group’ button and you will be presented with the screen below.

  • Tags: Attach Tag (To be used in Tag Search)
  • Name: Name of Cleanse Group.
  • Description: Description of Cleanse Group.

After a cleanse group has been created it will be available to be attached to relevant elements. 

Severities

Severities are the subjective classification of the impact of a given Rule. They are associated with cleansing timeframes that express how quickly the organization expects the error to be cleansed. They are customizable but as a default have for levels that include:

  1. Low
  2. Medium
  3. High
  4. Critical

To navigate to severities go to (Quality > Rules > Severities )

To create a new ‘Severity’ simply click on the ‘Create Severity’ button and you will be presented with the window below.

  • Tags: Attach Tag (to be used in Tag Search).
  • Name: Name of Severity.
  • Description: Description of Severity.
  • Level: The figure used to determine the Severity impact. The higher the figure, the more severe the Severity classification.
  • Cleanse Window: The number of days allowed to cleanse the errors attached to the Severity.
  • Colour: The Colour chosen to be associated with the Severity.

After a new Severity has been created it will be available as selection whilst creating Rules.

Value Lists

These are lookup references that can be used whenever necessary while creating Rules. A JOIN statement is the SQL code of the Rule is typically used to access the Value Lists.

To  navigate to Value Lists go to (Quality > Rules > Value Lists). 

Here you can upload, amend and delete value lists as necessary. 

To create a new ‘Value List’ simply click on ‘Create Value List’ and you will be presented with the window below.

  • Name: Name of Value List
  • Description: Description of Value list

The Value Lists contain 3 columns (Value, Description, Notes) that can be populated in anyway that would best suit its purpose.

The ‘Add Row’, ‘Delete Row’ and ‘Delete All Rows’ buttons simply adds and deletes the rows on the Values list as it is displayed on the bottom left of the window.

Processes

These are elements that represent various business processes of the organization that can be attached to the Data quality Rules. Their main purpose is to provide insight into the financial impact of the data quality issues that have been identified by the Rules. 

Therefore a Process is defined/created by determining the cost of a specific business processes failing due to data quality issues then attaching that Process to relevant Rules.

To Navigate to Processes go to (Quality > Rules > Processes)  and you will be presented with the window below containing a list of all the Processes that have been created.

The Processes can be categorized into different ‘Process Categories’. Clicking on a specific Process Category in the Processes Category window will populate the Processes window with a list of all the associated processes.

To create a new Process category simply click on the ‘Create Category’ button on the ‘Process Categories’ window. 

  • Tags: Attach Tag (To be used in Tag search)
  • Name: Name of Process Category
  • Description: Description of Process Category

To Create a new Process simply click on the ‘Create Process’ within the ‘Processes’ window.

  • Tags: Attach Tag (To be used in Tag Search)
  • Process Category: Process Category that Process belongs to
  • Name: Name of Process
  • Description: Description of Process
  • Cost: Cost impact of the Process failing as a result of data quality issues
Aperture Governance Studio

Data Quality