urgent<parsing data from a excel file>

Hi all,
I wud like to get ur assistance in retrieving lines containing l1.My excel dataset contains around 8000 lines.I converted it into a text tab delimiter file and got the lines containing l1,My output is a list of lines containing l1 saved in a outfile.Some of d lines from my outfile s shown below;

"NM_001354","chr10","-",5021965,5050207,"Y","LINE,LTR,","L1,MaLR,","+,-,","5039541,5050000,","5039749,5050345,","5'UTR,5'UTR,"
"NM_001494","chr10","-",5847192,5895379,"N","LINE,","L1,","+,","5850340,","5850974,","5'UTR,"
"NM_012311","chr10","-",7837373,7869950,"N","LINE,","L1,","+,","7837122,","7837639,","3'UTR,"
"NM_001033855","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,","Alu,L1,Alu,Alu,L1,","-,-,-,-,-,","14988880,14989317,14989863,14990031,14990305,","14989154,14989863,14990030,14990305,14990395,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,"
"NM_001033855","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,","Alu,L1,Alu,Alu,L1,","-,-,-,-,-,","14988880,14989317,14989863,14990031,14990305,","14989154,14989863,14990030,14990305,14990395,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,"
"NM_001033857","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,SINE,","Alu,L1,Alu,Alu,L1,MIR,","-,-,-,-,-,+,","14988880,14989317,14989863,14990031,14990305,15029509,","14989154,14989863,14990030,14990305,14990395,15029704,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,5'UTR,""NM_001354","chr10","-",5021965,5050207,"Y","LINE,LTR,","L1,MaLR,","+,-,","5039541,5050000,","5039749,5050345,","5'UTR,5'UTR,"
"NM_001494","chr10","-",5847192,5895379,"N","LINE,","L1,","+,","5850340,","5850974,","5'UTR,"
"NM_012311","chr10","-",7837373,7869950,"N","LINE,","L1,","+,","7837122,","7837639,","3'UTR,"
"NM_001033855","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,","Alu,L1,Alu,Alu,L1,","-,-,-,-,-,","14988880,14989317,14989863,14990031,14990305,","14989154,14989863,14990030,14990305,14990395,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,"
"NM_001033855","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,","Alu,L1,Alu,Alu,L1,","-,-,-,-,-,","14988880,14989317,14989863,14990031,14990305,","14989154,14989863,14990030,14990305,14990395,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,"
"NM_001033857","chr10","-",14988878,15036100,"N","SINE,LINE,SINE,SINE,LINE,SINE,","Alu,L1,Alu,Alu,L1,MIR,","-,-,-,-,-,+,","14988880,14989317,14989863,14990031,14990305,15029509,","14989154,14989863,14990030,14990305,14990395,15029704,","3'UTR,3'UTR,3'UTR,3'UTR,3'UTR,5'UTR,"

You may notice some lines containing only "L1," and some lines containing "L1,MaLR," and so on.I would like to retrieve the lines containing only "L1,"into a separate outfile and the other lines into another outfile.I need a regular expression for matching this.

Thank you,

Regards,
Sayee.

---------- Post updated at 10:13 PM ---------- Previous update was at 10:09 PM ----------

Sorry friends,
I need to parse the data using perl!!!

Negative lookahead assertion can help you.

$ echo 'L1' | perl -ne 'print if /L1(?!MaLR)/'
L1
$ echo 'L1,MaLR' | perl -ne 'print if /L1(?!,MaLR)/'

Or try awk anyway:

awk -F'","' '$5 == "L1,"' infile > outfile
$> cat outfile
"NM_001494","chr10","-",5847192,5895379,"N","LINE,","L1,","+,","5850340,","5850974,","5'UTR,"
"NM_012311","chr10","-",7837373,7869950,"N","LINE,","L1,","+,","7837122,","7837639,","3'UTR,"
"NM_001494","chr10","-",5847192,5895379,"N","LINE,","L1,","+,","5850340,","5850974,","5'UTR,"
"NM_012311","chr10","-",7837373,7869950,"N","LINE,","L1,","+,","7837122,","7837639,","3'UTR,"

@thegeek---->is it possible to use this coz am concentrating only on L1 and i hu gotta large dataset which contains a no of Mal1 lik elements.hope u get me!!!i need a pattern match which shud exactly match the lines with "L1," read in an outfile and d remaining lines read in another outfile.I will show u my script which check i used to get the output shown above;

#!/usr/bin/perl -w

use strict;
use warnings;

# declaration
my $file = "dataset1.txt";
my $pattern = "L1";

#opens the file using file handle
open(IN,$file) or die "Cannot open file \"$file\n";

print "sayee";
#reading the file into array
while(my @lines = <IN>)
{
#going through each line and checking for pattern
foreach my $lines(@lines)
   {
  while($lines =~/$pattern/g)
     {
 
     #printing to out file
     open OUT, ">>outsai.txt" or die ;
     print OUT $lines;
   
      }
   }
}
$ echo 'L1' | perl -ne 'print if /L1(?!MaLR)/'
L1
$ echo 'L1,MaLR' | perl -ne 'print if /L1(?!,MaLR)/' 

Sorry, am not able to get what you exactly mean. But the thing is clear that when you want only the data without L1,MaLR and only MaLR use the above regex ?!

What is the confusion ?

If you have doubt still, try with sample data and tell us what your problem.

Not clear what you want.
Always post the input and the output also.

 
sed -n '/"L1,"/p' myfile            # L1 only. Simple straight
sed -n '/"L1,[[:alpha:]]/p' myfile  # L1 with MaLR. Regexp
sed -n '/"L1,[[:alnum:]]/p' myfile  # L1 with MaLR99. Regexp.
sed -n '/"L1,[^"]/p' myfile           # L1 with any thing except ". Regexp.

If you want, you can directly write to two diff. files in one shot with sed.