Start process on X number of files and then wait for the next batch

Thanks for RudiC for his extraordinary help on organizing files in a batch of 10 using below code.

FL=($(ls)); 
for ((i=0;i<=${#FL[@]};i++)); do 
for j in ${FL[@]:$i:10};
do 
$batch  ${j}  ${j}.txt
done; 
echo "Pausing for next iteration";
echo "------------------------------------------------------------------------------------"; 
((i+=10)); 
done

Problem:

I want to run the batch on 10 files ( which takes a lot of time ) and then once the batch is finished , I want to execute the next iteration

if I do something like this,

for j in ${FL[@]:$i:10};
do 
$batch -in  ${j} -out  ${j}.txt &
done; 

the problem with this approach is that every batch process is sent to background and I can't find if there are any processes running in the background for the current session so that I can start the next session.

Alternatively, if I run like this, which is again a sequential approach and doesn't fulfill the requirement of parallel run.

$batch -in  ${j} -out  ${j}.txt && $batch -in  ${j} -out  ${j}.txt && $batch -in  ${j} -out  ${j}.txt && $batch -in  ${j} -out  ${j}.txt && $batch -in  ${j} -out  ${j}.txt && $batch -in  ${j} -out  ${j}.txt 

Regards,
NMR

Use the wait command to wait for all background processes to complete.

I suppose that there is a question about how do you want tit to run. Is it that you want them all in straight away but with no more that 10 running (presumably to manage the load/contention) or do you want to run in blocks of 10 where all 10 have to finish before you start the next block?

For instance if you have a job that just went to sleep for the number of seconds of it's job number, (1, 2, 3, 4, .... etc.) Would you want logic A or B?

Logic A

00:00:00 Start job 1
00:00:00 Start job 2
00:00:00 Start job 3
00:00:00 Start job 4
00:00:00 Start job 5
00:00:00 Start job 6
00:00:00 Start job 7
00:00:00 Start job 8
00:00:00 Start job 9
00:00:00 Start job 10
00:00:01 Ended job 1
00:00:02 Ended job 2
00:00:03 Ended job 3
00:00:04 Ended job 4
00:00:05 Ended job 5
00:00:06 Ended job 6
00:00:07 Ended job 7
00:00:08 Ended job 8
00:00:09 Ended job 9
00:00:10 Ended job 10
00:00:10 Started job 11
00:00:10 Started job 12
00:00:10 Started job 13
:
:
etc.

Logic B

00:00:00 Start job 1
00:00:00 Start job 2
00:00:00 Start job 3
00:00:00 Start job 4
00:00:00 Start job 5
00:00:00 Start job 6
00:00:00 Start job 7
00:00:00 Start job 8
00:00:00 Start job 9
00:00:00 Start job 10
00:00:01 Ended job 1
00:00:01 Started job 11
00:00:02 Ended job 2
00:00:02 Started job 12
00:00:03 Ended job 3
00:00:03 Started job 13
00:00:04 Ended job 4
00:00:04 Started job 14
:
:
etc.

Logic A will work if you are sure you have no other background processes and 'simply':-

for single_job in ${job_list}
do
   "${single_job}" &
   ((active_count=$active_count+1))
   if [ $active_count -ge 10 ]
   then
      wait                    # All background processes must end before we start the next block
      active_count=0  # Reset the counter
   fi
done

echo "Finished them in blocks or up to ten"

If you need logic B then you need to keep a track of them more carefully. You can use clever data holding processes to keep track of a specific process id, or if they will be unique enough, just count the running processes:-

for single_job in ${job_list}
do
   active_jobs=$(ps -f|grep -c slee[p])            #  This is the important bit!
   if [ $active_count -ge 10 ]
   then
      echo "All running at full capacity"
      sleep 1    # Pick a suitable check frequency
   else
      "${single_job}" &
   fi
done

echo "Finished them with up-to-ten running"

The important bit
You need to consider

  1. what you are searching for. Get it wrong and you might not submit anything or you might submit everything in one go.
  2. how to make sure you don't find the grep you are searching with as a process itself else you are using up one of your planned ten background jobs

For consideration a, my example I just use a sleep, so you can see it in what I've coded. You need to be sure that you find the right things and that your 'single jobs' don't actually spawn sub-processes of the same name else you will count them twice. You could look for sub-processes of your running script if that is important to you.
For consideration b, I have set the search with an expression by wrapping a character in [ & ] This means that the grep will expand this to any string matching the expression (for which there is only one option, of course) but crucially it will ignore itself, because the expression doesn't match the square bracket itself.

Do either of these approaches help?

Kind regards,
Robin

the exact narration is

execute 10 jobs in either background or foreground -- wait for all them to finish and then start the next 10 or whatever count is coming from the next batch of files.

I think I'll have to go with the Plan-B ( execute 10 jobs at a given time , and start a new if there are less than 10 jobs running ) --

but what if 2 or more background processes are finished before the sleep is timed out, You see your code is running only 1 job in the else case.

for single_job in ${job_list}
do
   active_jobs=$(ps -f|grep -c slee[p])            #  This is the important bit!
   if [ $active_count -ge 10 ]
   then
      echo "All running at full capacity"
      sleep 1    # Pick a suitable check frequency
   else
      "${single_job}" &                 <----- Should I do the calculation on this point as well , how many jobs are running and how many to start ?
   fi
done

echo "Finished them with up-to-ten running"

There is a command which does exactly what you want with one word. I suggested it before. To repeat:

Use the wait command to wait for all background processes to complete.

Well, that's is just totally confusing. How would you propose to start 10 in the foreground? There is a way to put a process into the background that opens a new terminal session and it then runs the command you want in the foreground there, but I doubt that is what is being requested. In any case, you have almost certainly chosen the opposite of the brief if you go for logic b.

You want logic a and the advice given by Corona688

Okay, perhaps I should have added wait before the echo "Finished them with up-to-ten running" to ensure they all finished. The loop for logic b will run and keep adding jobs up to the limit until there are none more left to run at which point the loop will end, but the jobs keep running to completion. There will always be a position where there are none left & fewer than 10 running.

Kind regards,
Robin

You could have GNU parallel deal with this for you, as long as it's available on you target system.

For example:

CMD=(J L M)
batch=echo
for j in ${CMD[@]}
do
   echo $batch -in  ${j} -out  ${j}.txt
done | parallel -j 10 --load 80% --noswap '{}'

The above will continue to start jobs (up to 10 at once) as long as total CPU load is below 80% and no swap in/out activity.

1 Like