Hello,
I am new to shell scripting. I have a huge file with multiple columns for example:
I have 5 columns below.
HWUSI-EAS000_29:1:105 + chr5 76654650 AATTGGAA HHHHG
HWUSI-EAS000_29:1:106 + chr5 76654650 AATTGGAA B@HYL
HWUSI-EAS000_29:1:108 + chr5 76654650 AATTGGAA C:)ADH
HWUSI-EAS000_29:1:110 - chr6 86754325 GATCGTAA YYCHY
I want to remove duplicates based on column 4 (7664650). In the above case it should list me only row1 and row 4
Any help on this is greatly appreciated.
Thanks,
Diya