sed - print only matching regex

domi55 · May 11, 2009, 9:51am

Hi folks,

Lets say I have the following text file:

name, lastname, 1234, name.lastname@test.com
name1, lastname1, name2.lastname2@test.com, 2345
name, 3456, lastname, name3.lastname3@test.com
4567, name, lastname, name4.lastname4@test.com

I now need the following output:

Is 'sed' the right way? If yes: How can I print out just the matching regex?

sed -n '/[0-9]{4}/p'

jim_mcnamara · May 11, 2009, 10:16am

awk?

 awk -F', '   '{ for(i=1; i<=NF; i++) if($i ~/[0-9]{4}/) {print $i} }' filename

durden_tyler · May 11, 2009, 10:19am

Or maybe perl ?

perl -ne '{chomp; @x=split/, /; foreach $item (@x){$item =~ /\d{4}/ && print $item,"\n"}}' input.txt

tyler_durden

ghostdog74 · May 11, 2009, 10:26am

# perl -ne 'print  if s/.*(\d{4}).*/\1/' file
1234
2345
3456
4567

domi55 · May 11, 2009, 10:45am

OMG. Thanks so much guys.

But unfortunately I prefere awk

@ jim mcnamara:

How can I just print out four (4) characters.

jim_mcnamara · May 11, 2009, 10:51am

If the field is four numbers, and you have -F', ' (with a modern awk) leading and trailing delimiters will be removed. In this case I set delimiters as space and comma. You may need to add other whitepsace characters like tab to the -F option argument. I don't know what is in your file.

Try

awk -F', '   '{ for(i=1; i<=NF; i++) if($i ~/[0-9]{4}/) {print $i} }' filename | od -c | more

to see what kinds of extra characters you are getting. -F '[^0-9]' makes everything that is not a number a field delimiter, if you cannot tell what to exclude