Delete duplicated data records
Summary
Depending on the defined options, this operator deletes certain duplicated data records in the input table.
Configuration
Input settings of existing table
Name | Value | Opt. | Description | Example |
---|---|---|---|---|
Columns to be checked | System.Object | - | Which columns should be checked for identical values? Enter the required column identifiers, e.g.: A,C-H,K | - |
Settings
Name | Value | Opt. | Description | Example |
---|---|---|---|---|
Case-sensitive? | System.Boolean | opt. | When checking for sameness, should upper and lower case letters be considered? | - |
Action | System.String
| - | What should be deleted if rows in the selected columns have identical values? | - |
Additional column for data record definitions | System.Boolean | opt. | Should an additional column showing the number of duplicate data records be output? | - |
Name of the result column | System.String | opt. | Input of the name of the result column. | - |
Sort result according to the number of data records? | System.String
| opt. | Should the result be sorted by the number of identical data records? | - |
Data already sorted? | System.Boolean | opt. | If "data already sorted" is not selected, then sorting takes place after the identification columns. | - |
Want to learn more?
Examples
Example: How many different shift types are in the data?
Situation | The following data node (excerpt) contains shifts worked by different employees, as indicated by column D "Shift_Type". In order to investigate how many different shift types the employees have worked, we use the operation "Delete duplicated data records". |
---|---|
Settings | The column to be checked is D containing the shift types |
Result | The results are shown in the following data node. In total, 8 "S" shifts, 4 "N" shifts, and 2 "F" shifts have been worked in this example. |
Project-File |
Troubleshooting
Nothing known up to now.
Related topics
- Operators (alphabetical overview)
- Cut values
- Delete rows 2.0
- Using the Formula operator (row-by-row) (5.0) Extras, it is possible to integrate a "traffic light" for certain values as a next step.