Hi all,
I have 2 files in which i have to find commom entries in column 1 an dif soemthing is common write other data of both files in front of it mentioned.
Gene symbol and disease name
column 1 column2
ARFGEF2 CAD
DDEF2 CAD
PSCD3 CAD
PSCD4 CAD
CAMK1 CAD,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT,HT
HSP90AA1 CAD,CAD,CAD,T2D,T2D
KDR CAD,CD,CD
VEGF CAD,CAD,CAD,CAD,T2D,T2D,T2D
CTNNA3 CAD,HT,T2D
PTPRM CAD,T2D
RAC2 CAD,CAD,T1D,T1D
SMAD3 CAD,T2D,T2D,T2D,T2D,T2D,T2D,T2D
SORBS1 CAD,CAD,CAD
CD36 CAD
IRS1 CAD,CAD,CAD
IRS2 CAD,CAD,CAD,CAD
MTFMT CAD,CAD,CAD,T1D,T1D,T1D
SARS CAD
GNPDA2 CAD
NANS CAD
SRD5A1 CAD
Second file with 3 colu
second file is like this
Gene symbol drug drug
F2 Lepirudin Refludan
FCGR2A,FCGR2B,FCGR2C,EGFR,FCGR3B,C1R,C1QA,C1QB,C1QC,FCGR3A,C1S,FCGR1A Cetuximab Erbitux
Not Available Dornase Alfa Pulmozyme
IL2RA,IL2RB,IL2RG Denileukin diftitox Ontak
C1S,C1R,C1QA,C1QB,C1QC,TNF,TNFRSF1B,FCGR1A,FCGR3A,FCGR2A,FCGR2B,FCGR2C,LTA,FCGR3B Etanercept Enbrel
F2 Bivalirudin Angiomax
GNRHR Leuprolide Eligard
IFNAR2,IFNAR1 Peginterferon alfa-2a Pegasys
PLG,FGA,PLAUR,SERPINE1 Alteplase Activase (Genentech Inc
The expected out put contain
common entries of first column disease from first file drug from second file drug from second file