Unix Linux Community

Remove duplicates within row and separate column

Shell Programming and Scripting

manigrover August 10, 2012, 5:41am 1

Hi all

I have following kind of input file

ESR1 PA156 leflunomide PA450192 leflunomide
CHST3 PA26503  docetaxel  Pa4586; thalidomide Pa34958; decetaxel docetaxel docetaxel

I want to remove duplicates and I want to separate anything before and after PAxxxx entry into columns or anything separated by ; sign into columns so I will get data in columns like this

ESR1 PA156  leflunomide PA450192
CHST3 PA26503 docetaxel  Pa4586 thalidomide Pa34958

cabrao August 10, 2012, 11:42am 2

You can start from here...

http://www.unix.com/shell-programming-scripting/104953-remove-duplicate-words-line.html