data:
C812F5C9B 0818053014 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
C812F5C9B 0818054514 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
C812F5C9B 0818060014 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
C812F5C9B 0818061514 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
C812F5C9B 0818063014 P S hdisk45 SOFTWARE PROGRAM ABNORMALLY TERMINATED
i have a huge data that has an output similar to the above.
i'm using the following command to try to get rid of any duplicates:
sort -k 1,1 -k 2,2 -k3,3 -u
not enough dups are being eliminated. so i was wondering if you guys have a better approach to this.
i would like to sort and unique by the first 6 characters of the 2nd field.
the numbers in the second field mean:
let's use 0818063014 as an example:
08 = month
18 = day
06 = hour
30 = minute
12 = year
is what i'm trying to do possible?