shell script to parse html file

sais · August 26, 2009, 10:50am

hi all,

i have a html file something similar to this.

<tr class="evenrow">
<td class="data">added</td><td class="data">xyz@abc.com</td>
<td class="data">filename.sql</td><td class="modifications-data">08/25/2009 07:58:40</td><td class="data">Added TK prof script</td>
</tr>
<tr class="oddrow">
<td class="data">added</td><td class="data">xyz@abc.com</td><td class="modifications-data">filename2.sql</td><td class="modifications-data">08/25/2009 07:58:40</td><td class="data">Added TK prof script</td>
</tr>

and so till filname100.sql..
i need to fetch the "filename1-100.sql" ... the rest of the html can be ignored ... can anyone help me out with doing this.

vgersh99 · August 26, 2009, 11:10am

To keep the forums high quality for all users, please take the time to format your posts correctly.

First of all, use Code Tags when you post any code or data samples so others can easily read your code. You can easily do this by highlighting your code and then clicking on the # in the editing menu. (You can also type code tags

```text
 and 
```

by hand.)

Second, avoid adding color or different fonts and font size to your posts. Selective use of color to highlight a single word or phrase can be useful at times, but using color, in general, makes the forums harder to read, especially bright colors like red.

Third, be careful when you cut-and-paste, edit any odd characters and make sure all links are working property.

Thank You.

The UNIX and Linux Forums

---------- Post updated at 11:10 AM ---------- Previous update was at 11:06 AM ----------

assuming all the 'files' you're interested in are '.sql' files:

nawk -F'[<>]' '{for(i=1;i<=NF;i++) if($i ~ /.sql$/) print $i}' myHTMLfile.html