Hi Masters,
I knew this isn't a new issue, but couldn't find any similar threads. So, I have to bother you. Here is my input file (genomic data). The file has many sessions, each session seperated by //. Within eash session there is only one ID and GN line.
ID 3HAO_HUMAN STANDARD; PRT; 286 AA.
AC P46952; Q8N6N9;
DT 01-NOV-1995 (Rel. 32, Created)
DT 01-NOV-1995 (Rel. 32, Last sequence update)
DT 10-MAY-2005 (Rel. 47, Last annotation update)
DE 3-hydroxyanthranilate 3,4-dioxygenase (EC 1.13.11.6) (3-HAO) (3-
DE hydroxyanthranilic acid dioxygenase) (3-hydroxyanthranilate
DE oxygenase).
GN Name=HAAO;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
//
ID A4GCT_HUMAN STANDARD; PRT; 340 AA.
AC Q9UNA3;
DT 28-FEB-2003 (Rel. 41, Created)
DT 28-FEB-2003 (Rel. 41, Last sequence update)
DT 13-SEP-2005 (Rel. 48, Last annotation update)
DE Alpha-1,4-N-acetylglucosaminyltransferase (EC 2.4.1.-) (Alpha4GnT).
GN Name=A4GNT;
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606;
//
................
What I need to do is to extract part of line GN, ID and put them into this format. Thanks in advance.
GN ID
HAAO 3HAO_HUMAN
A4GNT A4GCT_HUMAN
.... ....