Optimized Method

matrixmadhan · December 14, 2006, 8:23am

Hi All,

I have got two files.

File A with 50000 records and
File B with some 500 million records.

I need to extract the mapping data (common data) from both the files.

There should be definitely many ways though I have a way which is definitely not optimzed and takes a longer time to run.

What bothers me is the time taken to run?
Any optimized way to do that?

Cheers!

sysgate · December 14, 2006, 9:00am

I would vote for Python to be the best performer ( based on my experience ) but depending on what you need to extract - you may use awk, or simple grep.

Perderabo · December 14, 2006, 9:26am

See my suggestion in my last post in this thread (note that file A and B is reversed from what you have).