Help with script

dsravan · November 20, 2009, 4:05pm

I have a file which is comma delimited. I have around 35 fields in each row. The problem is in the 21st field I have data separated by �;� I need the data in only the 21st field separated by �;� to be replaced by �|�. This is a very big file and I need something in awk that its much faster than regular commands. Can anybody tell exactly how I can only achieve this for 21st field.

cfajohnson · November 20, 2009, 4:09pm

awk -F, '{ sub(";","|",$21) } { print }' "$file"

dsravan · November 20, 2009, 4:29pm

This is not working as only the first ; is converted to |

cfajohnson · November 20, 2009, 4:54pm

awk -F, '{ gsub(";","|",$21) } { print }' "$file"

momo.reina · November 20, 2009, 11:01pm

sed doesn't work for you?

sed 's/;/|/g' file

cfajohnson · November 21, 2009, 4:19am

That will change all semicolons, not just those in the 21st field.

momo.reina · November 21, 2009, 7:25am

tsk tsk i didn't read the problem :rolleyes:

Franklin52 · November 21, 2009, 7:47am

You forgot the the OFS:

awk -F, '{ gsub(";","|",$21) } { print }' OFS="," "$file"

cfajohnson · November 21, 2009, 8:02am

True, but I would use:

awk -F, -v OFS=, '{ gsub(";","|",$21) } { print }' "$file"

Or:

awk -F, 'BEGIN { OFS = "," } { gsub(";","|",$21) } { print }' "$file"

Franklin52 · November 21, 2009, 8:27am

In that case this should be more comprehensible:

awk 'BEGIN { FS = OFS = "," } { gsub(";","|",$21) } { print }' "$file"

ghostdog74 · November 21, 2009, 8:53am

putting the variables behind or in front with -v has some differences and usage. Its documented here and here for anyone interested