I have this file:
@Muestra-1
agctgcgagctgcgacccgggttatataggaagagacacacacaccccc
+
!@$#%^&*()@^#&HH!&*(@&#*(FT^%$&*()*&^%@
@Muestra-2
agctgcgagctgcgacccgggttatataggaagagacacacacaccccc
+
!@$#%^&*()@^#&HH!&*(@&#*(FT^%$&*()*&^%@
@Muestra-3
agctgcgagctgcgacccgggttatataggaagagacacacacaccccc
+
!@$#%^&*()@^#&HH!&*(@&#*(FT^%$&*()*&^%@
@Muestra-4
agctgcgagctgcgacccgggttatataggaagagacacacacaccccc
+
!@$#%^&*()@^#&HH!&*(@&#*(FT^%$&*()*&^%@
@Muestra-5
agctgcgagctgcgacccgggttatataggaagagacacacacaccccc
+
!@$#%^&*()@^#&HH!&*(@&#*(FT^%$&*()*&^%@
And I would like to use this code to output the entries with identical nucleotide sequences (second line for every forth one). I thought did would do the job but obviously didnt:
awk 'BEGIN{ RS="^@"; FS="\n" } !x[$2]++ '
I am not quite sure what I am doing wrong here