This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

OO Scheduled tasks stop running

Hi all,
I had a strange one today I've never seen before but am wondering if someone has.
So First to the symptoms
1) At 00:21 this morning all scheduled tasks just did not submit. In the OO Web UI you could see the time pass and the web UI would update the next run time correctly. Last run time remained the last time it actually ran.
2) A flow that was not scheduled BUT had parallel paths would run until the point of exiting the parallel path and then just stop. The flow would say running but it was going nowhere fast.
3) Manually running said flow in 2) via the WEB UI still hung at the same stage, you could try to cancel them but they ended up in pending cancel and the run time kept ticking up.
4) Other flows triggered manually via the UI or remotely kept running as per normal and ending in success.

Analysis :
I've been through every log I can find for OO and it's single RAS (which exists on the same VM as central itself).
There is nothing untoward at all.
The one reason I can pin it down to 00:21 for the time is there is one flow that causes a TLS error talking to an SMTP server. I have yet to track that one down but in this case it gave me a VERY good way to pin down exactly when it broke.
On a windows event log level I checked 1 hour either side of 00:21 for Application, Security, and System logs and could find nothing out of the ordinary.
On a VM level I checked vCenter 1 hour either side of 00:21 for events, tasks and just performance expecting to see a spike somewhere close. Nothing.
On an OO Database level checked the purge jobs to keep it in check were still running and pruning (which they were).
Checked disk space on OO server and the OO database server, nothing to give concern.
Once again there is nothing at all untoward in the OO or RAS logs to indicate ANY sort of issue.

Workaround:

For the cancel pending flows updated OO_EXECUTION_SUMMARY so the status was canceled, Result status type was NULL,Result status name was NULL and ROI was NULL
Restarted OO RAS and Central services on the OO server and all of a sudden it was fine with the world.
OO Central did not respond to the service restart very well and I had to kill the java process.

Any thoughts about this event would be appreciated.

Thanks,
Terry

  • Suggested Answer

    0  

    Customers reported flows not being triggered by scheduler once a while, but the root causes were different. If you encounter the issue again please open a ticket, support will work with you to figure out the root cause.