HI
I have a file that looks like below
abc
{Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}
BBC
{Seq('AGGATA', SingleLetterAlphabet()): 1, Seq('GGATAG', SingleLetterAlphabet()): 1, Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}
CCB
{Seq('GACGGA', SingleLetterAlphabet()): 1, Seq('ACGGAT', SingleLetterAlphabet()): 1, Seq('CGGATA', SingleLetterAlphabet()): 1, Seq('GGATAG', SingleLetterAlphabet()): 1, Seq('GATAGC', SingleLetterAlphabet()): 1, Seq('ATAGCG', SingleLetterAlphabet()): 1, Seq('TAGCGG', SingleLetterAlphabet()): 1}
I wanted to get something like:
abc GATAGC
abc ATAGCG
abc TAGCGG
BBC AGGATA
---------------
I have been trying to get the solution in python but wth less success. Is there any easy way to do it in awk?