Optimized Method

Hi All,

I have got two files.

File A with 50000 records and
File B with some 500 million records.

I need to extract the mapping data (common data) from both the files.

There should be definitely many ways :slight_smile: though I have a way which is definitely not optimzed and takes a longer time to run.

What bothers me is the time taken to run?
Any optimized way to do that?

Cheers!

I would vote for Python to be the best performer ( based on my experience ) but depending on what you need to extract - you may use awk, or simple grep.

See my suggestion in my last post in this thread (note that file A and B is reversed from what you have).