Remove duplicate based on Group

yale_work · March 21, 2012, 7:34pm

Hi,

How can I remove duplicates from a file based on group on other column? for example:

First we need to look at column 1 and then remove the duplicate rows based on column 5. Column 1 has two groups Test1 and Test17 so we have to find duplicates in column 5 based on column 1. Output of this file is:

agama · March 21, 2012, 7:55pm

This should work:

awk -F "|" ' ! s[$1,$5]++ ' input-file >output-file

yale_work · March 21, 2012, 8:36pm

Thanks.