Summary

Depending on the defined options, this operator deletes certain duplicated data records in the input table.

Configuration

Input settings of existing table

Name

Value

Opt.

Description

Example

Columns to be checked

System.Object

-

Which columns should be checked for identical values? Enter the required column identifiers, e.g.: A,C-H,K

-

Settings

Name

Value

Opt.

Description

Example

Case-sensitive?

System.Boolean

opt.

When checking for sameness, should upper and lower case letters be considered?

-

Action

System.String

  • With identical rows, delete all but the last
  • With identical rows, delete all but the first
  • For identical lines - delete last line
  • For identical lines - delete first line
  • Only identical rows - delete all but the last
  • Only identical rows - delete all but the first
  • Keep all duplicate records

-

What should be deleted if rows in the selected columns have identical values?

-

Additional column for data record definitions

System.Boolean

opt.

Should an additional column showing the number of duplicate data records be output?

-

Name of the result column

System.String

opt.

Input of the name of the result column.

-

Sort result according to the number of data records?

System.String

  • No sorting
  • Sort ascending
  • Sort descending

opt.

Should the result be sorted by the number of identical data records?

-

Data already sorted?

System.Boolean

opt.

If "data already sorted" is not selected, then sorting takes place after the identification columns.

-

Want to learn more?

Examples

Example: How many different shift types are in the data?

Situation

The following data node (excerpt) contains shifts worked by different employees, as indicated by column D "Shift_Type".

In order to investigate how many different shift types the employees have worked, we use the operation "Delete duplicated data records".

Settings

The column to be checked is D containing the shift types

Result

The results are shown in the following data node. In total, 8 "S" shifts, 4 "N" shifts, and 2 "F" shifts have been worked in this example.

Project-File

Confluence Op Delete duplicates.gzip


Troubleshooting

Nothing known up to now.

Related topics