What options exist to handle duplicate records?

Avatar image
Sep 10, 2014
Gary Robinson agent wrote
For example what if I have transactions by ID and want to sum them up by ID, but also keep other information about each ID like gender, age etc.?
1 Answer
Avatar image
Sep 10, 2014
Gary Robinson agent wrote
You can create a unique ID on each record in your file by creating a new dataset view and creating a new field using the formula below.

UID = substring(MD5(RAND(748319383)),1,10)

Then create another new dataset view from the previous dataset and aggregate on the fields of interest by your non-unique ID and also determine the minimum value for your new unique ID field. Call this Min UID.

Finally join the just created aggregated view to the previous created view using UID and Min UID. This will keep one record per original ID with all of the information associated with the specific record that happened to have the minimum UID and will append the aggregated information.