This article describes how to create a new variable which identifies the cases in a data set which have duplicated values in one or more variables. This feature is useful when you want to identify cases to delete from the data set.
Requirements
- A data set loaded into a Q Project.
Method
- Select one or more variables in the Variables and Questions tab.
- Select Automate > Browse Online Library > Create New Variables > Duplicates.
If you wish to change the variables that are being used to identify duplicates, then
- Select the Duplicates variable in the Variables and Questions tab.
- Right-click and select Edit R Variable.
- Choose variables in the Variables box.
- Click Update R Variable.
Next
How to De-duplicate Raw Data Using R