I have a fixed width file with no separators , but the file has JUNK characters in it and i know how to remove it. but in few cases these junk characters having created a mess by splitting single row into 2 or 3 rows.
I need to put them back to a single line like rest of the rows. This fixed width file as rows which is of length 1-484.
The only way to identify that the row is spilt into 2 or 3 rows is by its length only and that too is not consistent.
Example :
Row expected :
0401000000030020170628000000710000366400201706280002750035*T000100N101892928550383900000009201 00000000000008402017062800000020 0000006435000000000000000840A1E098D09D9279BE4000561510A00003220000000000000000000000FF 05A17DD6318008000048000 83337602E0F8C80002420300 00000000000000 0000002278BF8AC19546B1EE0012 A0000000042203 A0000000042203 R 000000199551
Row with the issue : CASE1 :
0401000000030020170628000000710000365400201706280001041125
T000100N101909856755446700000018201 00000000000008402017062800000000 00000067540000000000000008402ED730917E9D1DC040000B0810A04003240000000000000000000000FF 083426C5739000000001000 83337602E0F8C800021E0300 00000000000000 00000022F85D9594E4FC84E2031A A0000000041010 A0000000041010 R 000000199442
Row with the issue : CASE2 :
0401000000480020170628000000830000656500201706280004755339
T000100N
0000034655000000000000000840A6258473ADBF88E24000740110A04001220000000000000000000000FF 06E09E43739000000008000 88131669E0F8C800021E0300 00000000000000 00000022B050CC3E5A75C2E90012 A0000000041010 A0000000041010 R 000002274242