My input is listed as:
giNumber RefAminoAcid VarAminoAcid
10190711 P P
10190711 D D
109255248 I A
110349771 A D
My desired output is:
giNumber RefAminoAcid VarAminoAcid
109255248 I A
110349771 A D
*Those with same amino acid, I want delete it and just remain those different amino acid one at the end.
What command line I should type?
Thanks you and appreciate your advise.
Actually that first field is not an amino acid. Field 2 and field 3 must be different from each other. If this understanding is right, then
$
$ cat file
giNumber RefAminoAcid VarAminoAcid
10190711 P P
10190711 D D
109255248 I A
110349771 A D
$
$
$ awk '$2 != $3' file
giNumber RefAminoAcid VarAminoAcid
109255248 I A
110349771 A D
$
$