search multiple patterns

godzilla07 · June 20, 2011, 12:57pm

I have two lists in a file that look like
a b
b a
e f
c d
f e
d c

I would like a final list
a b
c d
e f

I've tried multiple grep and awk but can't get it to work

Corona688 · June 20, 2011, 1:41pm

grep can't do it because grep recalls nothing about previous lines. Even in awk you have to tell it to remember, which I do with the associative array A.

$ echo "a b
b a
e f
c d
f e
d c
" | awk '{
        # get a b order even if we get b a
        if($1 > $2) { tmp=$1; $1=$2; $2=tmp; }
        if(A[$1] != $2)  { print ; A[$1]=$2; }
}' | sort # sort is necessary for your example data, otherwise we get a b \n e f \n ...
a b
c d
e f
$

shamrock · June 20, 2011, 5:09pm

Try this awk script which sorts each line and removes the dups...

awk '{
   n = split($0,a," ")
   for (i=1; i<n; i++)
     for (j=1; j<=n-i; j++)
       if (a[j] > a[j+1]) {
          t=a[j]
          a[j]=a[j+1]
          a[j+1]=t
       }
   for (i=1; i<=n; i++)
       v=sprintf("%s",v?v""a:a)
   if (v in x); else print $0
   x[v]=v; v=""
}' file

mirni · June 20, 2011, 6:13pm

or Corona's solution in slightly different flavor:

awk '{
  if(a[$1]==$2 || a[$2]==$1) next; 
  a[$1]=$2;  print
}' data

comes out unsorted.

ctsgnb · June 20, 2011, 6:25pm

Don't know how the input file can look like but with the given example, the following works :

fold -w 1 inputfile | sort | uniq | xargs -n2

---------- Post updated at 12:25 AM ---------- Previous update was at 12:22 AM ----------

fold -w 1 inputfile | sort -u | xargs -n2

mirni · June 20, 2011, 6:32pm

ctsgnb:

Don't know how the input file can look like but with the given example, the following works :
fold -w 1 inputfile | sort | uniq | xargs -n2
---------- Post updated at 12:25 AM ---------- Previous update was at 12:22 AM ----------
fold -w 1 inputfile | sort -u | xargs -n2

This would miss entries if you have multiple occurrences in one column:

$ cat data
a b
b a
e f
c d
f e
d c
a c
$ fold -w 1 data | sort | uniq | xargs -n2
a b
c d
e f

ctsgnb · June 20, 2011, 6:38pm

godzilla07 did not specify the criteria nor the logic needed to get the output of his example, so i am aware that the code i provide may not fit all cases, but i shot it as is since no futher constraints has been specified so far.

mirni · June 21, 2011, 10:44am

i didn't mean to be critical, just pointing out...

ctsgnb · June 21, 2011, 10:50am

@mirni

No problem dude