Conditional removing of words from a line

Hi ,

I have a .csv file,from which I want to remove some data from each column as below.

Source Data

GT_12_AUDIT,SCHEDULED,NOZOMI2010/GT_12_AUDIT,CTSCAN/Zh_GT_6547887/GT_12_AUDIT,CTSCAN/Zh_GT_6547887
GT_12_13_AUDIT,SCHEDULED,NOZOMI2010/GT_12_13_AUDIT,XRAY/XT_TH_6987458/GT_12_13_AUDIT,XRAY/Zh_GT_6547887
GT_12_13_VISITED,SCHEDULED,NOZOMI2010/GT_12_13_VISITED,XRAY/XT_TH_6987458/GT_12_13_VISITED,XRAY/Zh_GT_6547887
Target Data

AUDIT,SCHEDULED,NOZOMI2010/AUDIT,CTSCAN/Zh_GT_6547887/AUDIT,CTSCAN/Zh_GT_6547887
AUDIT,SCHEDULED,NOZOMI2010/AUDIT,XRAY/XT_TH_6987458/AUDIT,XRAY/Zh_GT_6547887
VISITED,SCHEDULED,NOZOMI2010/VISITED,XRAY/XT_TH_6987458/VISITED,XRAY/Zh_GT_6547887

I really don't have any clue on how to do this, please give me some logic to do the same.

Regards,Deepti

Try if this sed fills your requirment :

sed 's/\(GT_[0-9][0-9]_\)\|\(GT_[0-9][0-9]_[0-9][0-9]_\)//g'

Hi Deepti,

u can try as below also...

sed 's/GT_12_13_//g; s/GT_12_//g' <file_name>

:slight_smile:

Alternate Sed

sed 's/GT_.._//g ; s/[0-9][0-9]_//g' inputfile > outfile