Column sum group by uniq records

Nayanajith · January 26, 2008, 9:44pm

Dear All,

I want to get help for below case.
I have a file like this.

saman 1
gihan 2
saman 4
ravi 1
ravi 2

so i want to get the result,

saman 5
gihan 2
ravi 3 like this.

Pls help me.

Thank you.

KevinADC · January 27, 2008, 12:45am

What have you tried so far?

jaduks · January 27, 2008, 6:22am

This can be done using associative array in awk.


$ cat nayan.out
saman 1
gihan 2
saman 4
ravi 1
ravi 2

$ awk '{arr[$1]+=$2} END {for (i in arr) {print i,arr}}' nayan.out > nayan.out.tmp

$ cat nayan.out.tmp
ravi 3
saman 5
gihan 2

//Jadu

Nayanajith · January 28, 2008, 8:45am

Dear Jadu,

Thank u ! it is working.

Thanks you again,

Nayanajith.

sandeep13 · February 16, 2009, 4:27am

Hi Jadu,

I am new to unix and i have a similar requirement given below:

Input file:

Test.txt
PORT; ID; TOTAL
port1;p1;100000
port2;p2;5000
port1;p1;500

Output file:
PORT; ID; TOTAL
port1;p1; 100500
port2;p2; 5000

How can achive this? Any help on this regard is higly appreciated.
Thanks.

Regards,
Sandeep

ranjithpr · February 16, 2009, 4:41am

Try below script ( Not tested)

awk -F ";" '{ arr[$1 ";" $2] += $3 } END {for (i in arr) {print i ";" arr } }' inputfile

sandeep13 · February 16, 2009, 4:54am

Hi Ranjith,
Thanks for the reply but this doesn't work....can we use like arr[$1 ";" $2]???

Regards,
Sandeep

ranjithpr · February 16, 2009, 5:05am

Try this

awk -F  ";"  '{ string=$1 ";" $2; arr[string] += $3 } END {for (i in arr) {print i ";" arr } }' inputfile

sandeep13 · February 16, 2009, 6:15am

This one also doesn't work....

Franklin52 · February 16, 2009, 6:31am

What doesn't work? Did you get errors, wrong output, no output? Try this:

awk 'BEGIN{FS=OFS=";"}
NR==1{print;next}
{a[$1";"$2]+=$3}
END{for(i in a)print i, a}' file

Use nawk or /usr/xpg4/bin/awk on Solaris.

Regards

sandeep13 · February 16, 2009, 6:46am

Hi Franklin,

I tried ur solution and getting below o/p

PORT;PID;TOTAL
port1;p1
port2;p2

I guess someting is missing, it is doing group by but missing the output values.
Thanks for the response.

Regards,
Sandeep

sandeep13 · February 16, 2009, 6:55am

getting below o/p..missed some values in last post

PORT;PID;TOTAL
port1;p1;500
port2;p2;0

Franklin52 · February 16, 2009, 7:17am

This is what I get:

$ cat file
PORT; ID; TOTAL
port1;p1;100000
port2;p2;5000
port1;p1;500
$
$
$ awk 'BEGIN{FS=OFS=";"}                              
NR==1{print;next}
{a[$1";"$2]+=$3}
END{for(i in a)print i, a}' file
PORT; ID; TOTAL
port2;p2;5000
port1;p1;100500
$
$

Regards

sandeep13 · February 16, 2009, 8:06am

HI Franklin,

Thanks a lot. It works....using nawk

/usr/bin/nawk 'BEGIN{FS=OFS=";"}NR==1{print;next}{a[$1";"$2]+=$3}END{for(i in a)print i, a[i]}' file

Much appreciated.

Cheers,
Sandeep

franklin52:

This is what I get:

$ cat file
PORT; ID; TOTAL
port1;p1;100000
port2;p2;5000
port1;p1;500
$
$
$ awk 'BEGIN{FS=OFS=";"}                              
NR==1{print;next}
{a[$1";"$2]+=$3}
END{for(i in a)print i, a}' file
PORT; ID; TOTAL
port2;p2;5000
port1;p1;100500
$
$

Regards

sandeep13 · May 20, 2009, 2:25am

Hi Guys,

Once again I have a query regarding grouping the columns, below is my requirement:

Input File:
COL1; COL2; AMT1;AMT2
PORT1;CURR1;100;50
PORT1;CURR1;200;100
PORT2;CURR2;300;150
PORT3;CURR3;400;200
PORT3;CURR3;500;250

Expected Output:
COL1; COL2; AMT1;AMT2
PORT1;CURR1;300;150
PORT2;CURR2;300;150
PORT3;CURR3;900;450

How can I pass to values in below command:
/usr/bin/nawk 'BEGIN{FS=OFS=";"}NR==1{print;next}{a[$1";"$2]+=$3}END{for(i in a)print i, a[i]}' <INPUT FILE>

Please suggest so that I can get both AMT1 and AMT2 gouped by COL1 and COL2.
Thanks in advance.

Regards,
Sandeep

aigles · May 20, 2009, 4:14am

nawk '
BEGIN { FS=OFS=";" }
NR==1 { print ; next }
{
   id = $1 ";" $2;
   amt1[id] += $3;
   amt2[id] += $4;
}
END {
   for (id in amt1)
      print id, amt1[id], amt2[id];
}
' inputfile

Jean-Pierre.

Franklin52 · May 20, 2009, 6:10am

Or with a little adjustment of my solution:

awk 'BEGIN{FS=OFS=";"}                              
NR==1{print;next}
{a[$1";"$2]+=$3; b[$1";"$2]+=$4}
END{for(i in a)print i, a, b}' file

sandeep13 · May 20, 2009, 9:57am

Thanks Aigles and Franklin.
I already got it work.

Regards,
Sandeep