BES / Domino Shutting down

Hey Folks.

I have a BES w/Lotus Domino on a windows 2003 server. I have an issue where after a new days (or weeks) my BES (5.0.1.58) and my Domino Instance (8.5.1) will shutdown without any good reason. The OS is still running.

I look at the logs and both shutdown cleanly, not like a crash. I look in the event viewer for the time of the ‘crash’ and see:

Agent missed 2 heartbeats; restarting Domino

then

Shutting down BlackBerry Server

Any ideas?

Thanks

Justin

Subject: Resolved. I think…

After contacting Blackberry support I think I have a resolution. Time will tell.

The BlackBerry Controller is designed to monitor the health of certain BlackBerry components. When necessary, the BlackBerry Controller will request a component to restart when a threshold is met. However, the key word here is restart. In your scenario, the component did not restart. This is a known issue and has been escalated internally (Issue Number: 442654). There are 2 work arounds in place to address this issue. Please see the following options regarding this.

  1. Start the Domino server as an application instead of a windows service on the BlackBerry Enterprise server. This is what I recommend because we still allow the BlackBerry Controller to function the way it suppose to.

The below article from IBM describes how to configure Domino to be in application mode.

Link: www-01.ibm.com/support/docview.wss?rs=463&uid=swg21178948 IBM - Change Domino to run as an Application after choosing to run as a Service

  1. Prevent the BlackBerry Controller from restarting the BlackBerry Messaging Agent when a missing heartbeat occurs.

Please Note: Please back up your registry before making changes.

  1. On the computer that hosts the BlackBerry® Enterprise Server, open the Registry Editor.

  2. In the left pane, navigate to HKEY_LOCAL_MACHINE\Software\Research In Motion\BlackBerry Enterprise Server.

  3. Click Controller.

  4. Create a DWORD value that is named MissedHeartbeatThreshold.

  5. Double-click the new DWORD value.

  6. In the Value data field, type 0.

You have a choice to increase the threshold value or you can set it to 0. By setting it to 0 means that the BlackBerry Controller will never restart the Domino server or Agent when heartbeats are missed. This can potentially be problem in itself. Missing heartbeats can be caused by silent crashes, non responsive threads etc which will affect message flow. This is why I recommend option 1.

Thank you again for contacting BlackBerry Customer Support.

Subject: BES/Domino shutting down

The error message does not appear to be Domino related. Searching the Lotus/IBM knowledgebase there were no documents that contained that error. I then searched in Google and found several links to Blackberry sites and forums. I would suggst speaking with RIM to see why this is happening.

Link with my search results:

http://www.google.com/search?q=agent+missed+two+heartbeats&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-US:official&client=firefox-a

Hope that helps!