delete two patterns and remove one pattern

ppat7046 · April 7, 2009, 4:17pm

Friends,

I would like to do:
1) Delete line with START
2) Delete line with END
3) Remove ABC|
4) Delete duplicate records

The following command works fine which deletes line with START and END
sed -e /^START/d -e /^END/d Filename.txt

How do I incorporate task 3 and 4?
NOTE: The file will have more than 500,000 thousand rows.

Thanks in advance for suggestion,
Prashant

pinnacle · April 7, 2009, 4:35pm

nawk -F'|'  'NF==3{print $2,$3}' patel

tostay2003 · April 7, 2009, 4:57pm

This code is not doing uniq + removes pipe delimiter

use this

grep "|" test | sed 's/^ABC|//g' | sort -u

pinnacle · April 7, 2009, 5:02pm

Tostay2003:
What if the data doesnt start with ABC your logic fails

use this
nawk -F'|' 'NF==3{print $2,$3}' patel | sort -u

tostay2003 · April 7, 2009, 5:14pm

I assumed from the authors description that the first field remains common in the file

Use this if you didnt mean that the first field woudl be same.

grep "|" test | cut -d'|' -f2,3 | sort -u

or with slight amendment to code written by zenith i.e. by adding OFS

vgersh99 · April 7, 2009, 5:16pm

nawk -F'|' 'NF==3 && !a[$2,$3]++ {print $2,$3}' patel

ppat7046 · April 8, 2009, 9:11am

Thank you all for your reply.

nawk -F'|' 'NF==3 && !a[$2,$3]++ {print $2,$3}' patel

I had made small change in print statment because it was not printing | symbol.

nawk -F'|' 'NF==3 && !a[$2,$3]++ {print $2,"|",$3}' patel

However, it prints the space after and before | symbol.

Thanks,
Prashant

vgersh99 · April 8, 2009, 9:19am

nawk -F'|' 'NF==3 && !a[$2,$3]++ {print $2, $3}' OFS='|' patel