finding lines only between a certain string

aismann · October 19, 2008, 11:54pm

Dear experts,

Ive been trying to figure this out for a while, but i cant. Please help.
I have a file, with approx 1 million lines. The contents are separated with "----------". Please see example below

M-GET CONFIRMATION (
INVOKE IDENTIFIER 346,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS alarmRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341862
},
CURRENT TIME "20080626105012",
ATTRIBUTE LIST {
objectClass alarmRecord,
nameBinding logRecord-log,
managedObjectClass sypObjMAL,
managedObjectInstance { sypAlarmObjectId = "MAL" },
eventType communicationsAlarm,
eventTime "20080609032818",
logRecordId number : 341862,
loggingTime "20080609032819",
packages {
GAAGDA1C.correlatedNotificationsPackage,
GAAGDA1C.additionalInformationPackage, eventTimePackage
},
probableCause SAXBAA0C.lossOfSignal,
perceivedSeverity cleared,
correlatedNotifications { { correlatedNotifications { 23147 } } },
additionalInformation {
{
identifier alarmIdentification,
information AlarmIdentification : "CLASS=PCMMAL LTG=13-08 DIU=1"
}
}
}
)
--------------------------------------------------------------------------------------------------------------------------
M-GET CONFIRMATION (
INVOKE IDENTIFIER 347,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS alarmRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341863
},
CURRENT TIME "20080626105012",
ATTRIBUTE LIST {
objectClass alarmRecord,
nameBinding logRecord-log,
managedObjectClass chargingFailure,
managedObjectInstance { chargingFailureId = 1 },
eventType processingErrorAlarm,
eventTime "20080609173506",
logRecordId number : 341863,
loggingTime "20080609173507",
packages {
specificProblemsPackage, notificationIdentifierPackage,
proposedRepairActionsPackage, additionalTextPackage,
eventTimePackage
},
probableCause SAXBAA0C.storageCapacityProblem,
specificProblems { specProb-chargingFailure-sp5 },
perceivedSeverity critical,
notificationIdentifier 33589625,
proposedRepairActions { propRA-chargingFailure-alarm1 },
additionalText "SAMAR FULL : IA.ICITR"
}
)

So my problem is, i need to find all texts that have the keyword "GAA", but i need to see everything that is between the --------- lines. Desperatedly need a script to do this. Any help is appreciated. Thanks !

avis1981 · October 20, 2008, 12:05am

Can you provide sample expected output.

aismann · October 20, 2008, 12:08am

A sample expected output should be all lines in the file, which match the search pattern "GAA", and output in the format below

M-GET CONFIRMATION (
INVOKE IDENTIFIER 109,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS stateChangeRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341650
},
CURRENT TIME "20080626105008",
ATTRIBUTE LIST {
objectClass stateChangeRecord,
nameBinding logRecord-log,
managedObjectClass KMMQGC0C.lic,

eventType stateChange,
eventTime "20080608013857",
logRecordId number : 341650,
loggingTime "20080608013857",
packages { eventTimePackage },
stateChangeDefinition {
{
attributeID GAAGDA1C.operationalState,
oldAttributeValue GAAASA1C.OperationalState : disabled,
newAttributeValue GAAASA1C.OperationalState : enabled
}
}
}
)
--------------------------------------------------------------------------------------------------------------------------
M-GET CONFIRMATION (
INVOKE IDENTIFIER 110,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS alarmRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341651
},
CURRENT TIME "20080626105008",
ATTRIBUTE LIST {
objectClass alarmRecord,
nameBinding logRecord-log,
managedObjectClass sypObjMAL,
managedObjectInstance { sypAlarmObjectId = "MAL" },
eventType communicationsAlarm,
eventTime "20080608013858",
logRecordId number : 341651,
loggingTime "20080608013859",
packages {
GAAGDA1C.correlatedNotificationsPackage,
GAAGDA1C.additionalInformationPackage, eventTimePackage
},
probableCause SAXBAA0C.lossOfSignal,
perceivedSeverity cleared,
correlatedNotifications { { correlatedNotifications { 29463 } } },
additionalInformation {
{
identifier alarmIdentification,
information AlarmIdentification : "CLASS=PCMMAL LTG=13-05 DIU=1"
}
}
}
)

--------------------------------------------------------------------------------------------------------------------------
M-GET CONFIRMATION (
INVOKE IDENTIFIER 109,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS stateChangeRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341650
},
CURRENT TIME "20080626105008",
ATTRIBUTE LIST {
objectClass stateChangeRecord,
nameBinding logRecord-log,
managedObjectClass KMMQGC0C.lic,

eventType stateChange,
eventTime "20080608013857",
logRecordId number : 341650,
loggingTime "20080608013857",
packages { eventTimePackage },
stateChangeDefinition {
{
attributeID GAAGDA1C.operationalState,
oldAttributeValue GAAASA1C.OperationalState : disabled,
newAttributeValue GAAASA1C.OperationalState : enabled
}
}
}
)
--------------------------------------------------------------------------------------------------------------------------
M-GET CONFIRMATION (
INVOKE IDENTIFIER 110,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS alarmRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341651
},
CURRENT TIME "20080626105008",
ATTRIBUTE LIST {
objectClass alarmRecord,
nameBinding logRecord-log,
managedObjectClass sypObjMAL,
managedObjectInstance { sypAlarmObjectId = "MAL" },
eventType communicationsAlarm,
eventTime "20080608013858",
logRecordId number : 341651,
loggingTime "20080608013859",
packages {
GAAGDA1C.correlatedNotificationsPackage,
GAAGDA1C.additionalInformationPackage, eventTimePackage
},
probableCause SAXBAA0C.lossOfSignal,
perceivedSeverity cleared,
correlatedNotifications { { correlatedNotifications { 29463 } } },
additionalInformation {
{
identifier alarmIdentification,
information AlarmIdentification : "CLASS=PCMMAL LTG=13-05 DIU=1"
}
}
}
)
--------------------------------------------------------------------------------------------------------------------------

Annihilannic · October 20, 2008, 12:42am

Try this:

awk '
        # new record, reset array index
        /^----/ { i=0 }
        # accumulate record contents in array
        { a[++i]=$0 }
        # matching record
        /GAA/ {
                # dump array contents
                for (j=1; j<=i; j++)
                        print a[j]
                # get the rest of this record
                while (getline && $0 !~ /^----/)
                        print
                # print the record terminator and reset the array
                print
                i=0
        }
' inputfile > outputfile

aismann · October 20, 2008, 1:16am

Annihilannic,

Tried the script, it took about 3 mins to run, but no output. outputfile was empty

Annihilannic · October 20, 2008, 1:34am

Strange, works fine for me with the sample data you provided. What operating system are you using? Are there any spaces at the beginning of the lines?

aismann · October 20, 2008, 1:44am

The file starts like this

root@ckpgpay11core> more /tmp/AIALARM_MSNLA_20080626.txt
MSNLA/UCR40_23MSC_SSNC_EC1C00 6/26/2008 10:50:03 AM
8-15475 KPGCMCS01/AZAHXX#1

DISPEVREC:Event log=String : "AIALARM";
STARTED

Event log records

M-GET CONFIRMATION (
INVOKE IDENTIFIER 102,
LINKED IDENTIFIER 1,
MANAGED OBJECT CLASS alarmRecord,
MANAGED OBJECT INSTANCE {
logId = string : "AIALARM",
logRecordId = number : 341643
},
CURRENT TIME "20080626105008",
ATTRIBUTE LIST {
objectClass alarmRecord,
nameBinding logRecord-log,
managedObjectClass sypObjMAL,
managedObjectInstance { sypAlarmObjectId = "MAL" },
eventType communicationsAlarm,
eventTime "20080608013830",
logRecordId number : 341643,
loggingTime "20080608013832",
packages {
notificationIdentifierPackage,
GAAGDA1C.additionalInformationPackage, eventTimePackage
},
probableCause SAXBAA0C.lossOfSignal,
perceivedSeverity major,
notificationIdentifier 20195,
additionalInformation {

Im using solaris 9

Annihilannic · October 20, 2008, 1:54am

What's the output of head -50 /tmp/AIALARM_MSNLA_20080626.txt | cat -vet? Can you post it between [ code ] tags instead of [ quote ] tags please?

aismann · October 20, 2008, 3:56am

oh dear im getting a wierd result. the file is filled with ^M. Im sorry ill ftp the file out again. Thanks

root@ckpgpay11core> head -50 AIALARM_MSNLA_20080626.txt | cat -vet
Mroot@ckpgpay11core>

Annihilannic · October 20, 2008, 6:32pm

No big deal, it just means the file is in DOS/Windows format. You can use dos2unix to convert it (if available on your system), or tr -d '\r' < dosfile > unixfile.

aismann · October 29, 2008, 10:24pm

Thanks guys

aismann · October 29, 2008, 10:52pm

Dear Annihillanic,

I got the script working. Sorry but i have another question. I am making the script s little interactive, and have modified it to search for different strings everytime i run it. I cant seem to replace the variable "GAA" in the script. Any idea why? below if the output of the script when run in debug mode

root@ckpgpay11core> ./sarascript.sh
+ echo Enter input filename:
Enter input filename:
+ read inputfile
mstgb_aialarm_20081021.wri
+ echo Enter path of outputfile with name:
Enter path of outputfile with name:
+ read outputfile
/tmp/testing
+ echo Enter tag to look for:
Enter tag to look for:
+ read GAA
20081013003507
+ awk
        # new record, reset array index
        /^----/ { i=0 }
        # accumulate record contents in array
        { a[++i]=$0 }
        # matching record
        /\$GAA/ {
                # dump array contents
                for (j=1; j<=i; j++)
                        print a[j]
                # get the rest of this record
                while (getline && $0 !~ /^----/)
                        print
                # print the record terminator and reset the array
                print
                i=0
        }
 mstgb_aialarm_20081021.wri

root@ckpgpay11core> ls -la | grep testing
-rw-r--r--   1 root     other          0 Oct 30 10:47 testing

summer_cherry · October 29, 2008, 11:26pm

perl

undef $/;
open FH,"<file";
$str=<FH>;
@arr=split(/-+/,$str);
for $key(@arr){
if($key=~/pat/){
print $key;
}
}
close FH;

Annihilannic · October 30, 2008, 12:27am

Because the awk script is between single quotes, shell variables are not expanded.

You can change it like this to allow it to interpolate the variable:

        /'"$GAA"'/ {

Or else use :

awk -v SEARCHSTRING="$GAA" '
        ...
        $0 ~ SEARCHSTRING {
        ...
' inputfile > outputfile

aismann · October 30, 2008, 5:38am

thanks guys. works perfectly