We've got a customer that is getting this error regularly in /var/log/messages and their Post Office crashes everytime. 2 weeks ago we migrated the Post Office from OES2sp3 / gw8.0.3 to oes11sp2 (64bit) / gw2012, everything ran smoothly for at least 3-4 days, then our current problems started. It is the error message that gets written to messages after the POA crashes. We have gw2012sp1 on oes11sp1 (64bit) (4GB RAM, 2 CPUs around 250 users) and plenty of space. Top shows that the system is not lacking in resources, etc... We have two Post Offices running on the server one is for users and the other is a library. The POA crashes with 50, 150 and 200 users connected. The users connect via C/S and are using gwclient8. After doing some tests it appears as if the POA crashes when a user tries to send an e-mail an all users.... The POA crashed yesterday and it appears as if the GWIA also "died" after the POA "died". When the POA "dies" (crashes), sometimes we can restart it with the rcgrpwise start po.dom command and sometimes that does not work. When it does not work we have to unload the running instance of gwpoa (the library post office, there are 2 instances of gwpoa running) then start the User PO and then restart the Lib-PO....then it is running again. They were several old mails in the wpcsout/problem directory which Ive "cleaned out", but the system has crashed since then.
Normally a Segmentation Fault means not enough resources are available for the POA (seems unlikely, maybe update to 8GB Ram. threads look ok.), or there is a bug, maybe a corrupt email or user database? But our gwchecks show nothing dramatic.
There have been Netwokr problems, a switch died last week but unfortunately I have no more info on this. We've added a secondary ip address to the server and have configured the poa to use it.
Well, any help would be greatly appreciated and thank you in advance for your assistance.