Hello everyone,
i have to normalize this dataset (with 20.000 rows):
2,4,4,3,2,7,8,2,9,11,7,7,1,8,5,6
4,7,5,5,5,5,9,6,4,8,7,9,2,9,7,10
7,10,8,7,4,8,8,5,10,11,2,8,2,5,5,10
4,9,5,7,4,7,7,13,1,7,6,8,3,8,0,8,8
6,7,8,5,4,7,6,3,7,10,7,9,3,8,3,7,8
in this form: value=($1*mean)/standard_deviation but i cant figure out how to normalize it.
I write this file to calculate the standard distribution and mean.
BEGIN{
FS=","
}
{
for(i=1;i<NF;i++)
{
total+=$i;
totalSquared+=$i^2;
}
numberColumn=NF;
}
END{
for (i=1;i<numberColumn;i++)
{
media=total/NR;
printf("%.2f|%.2f\n",media,sqrt((totalSquared/NR)-media^2));
}
}
Can anyone help me to figure out?