Hi
I am using cloud foundry -231 and have created a bosh release of a job, which takes long time (more than 5 mins) to complete.
During bosh deploy, sometimes the job fails. On observing the monit log, the bosh checks the pid every 30 secs and reports error and retries after 10 secs. After retrying 6/7 times, the job fails.
Just after the job has failed, the job is completed successfully and all the process is running in the bosh vm.
So I increased the timeout of the monit process to 90 secs instead of default 30sec. Now the bosh checks for the pid file (every 90 secs) but only thrice(not 7 or 8 times as before) and reports failure, though .
Pls suggest me how can I control the monit of the bosh vm in cloud foundry to wait until the job completes. Please note this issue doesnt happen always..
Regards
Nithiyasri