OS = NTServer A = 6.5 (upgraded to 6.5.3 to see if it fixed anything)
Server B = 6.5.1
Server SMTP = 6.5.3 (outside the firewall)
We have entered the twilight zone… I would love some suggestions on what to look at.
We began having replication issues with remote sites, very slow/sporadic replication. It took a while to find the server and then it would start replication, freeze for 30 minutes or so and then burst thru some more data, then hang for a while more. The database that it hangs on is not the same one. The data in the database doesn’t seem to be an issue. We could immediately follow a successful replication and it would hang again. Or sometimes it would go as normal. No consistent pattern. Replication in the office is not a problem.
Then we started having trouble delivering mail to one of our remote servers. We could receive mail from them but not deliver it. (Server B)
Then we could no longer open databases on the remote server. Notes finds it & connects to it, but never opens it. We can ping the servers all day long. (Server B, and people on Server B could not open Server A, but could open DBs on Server SMTP)
At one point, Server A stopped with a fault on NRouter. The offending thread appears to be on the mail.box on a different server outside the firewall. (We upgraded to 6.5.3 after that)
############################################################
FATAL THREAD 7/25 [ nRouter:0a50: 2180]
FP=0x0a62dd54, PC=0x60001263, SP=0x0a62dd4c, stksize=8
EAX=0x00000884, EBX=0x2b428944, ECX=0x00000000, EDX=0x00000000
ESI=0x000000e0, EDI=0x00000a50, CS=0x0000001b, SS=0x00000023
DS=0x00000023, ES=0x00000023, FS=0x00000038, GS=0x00000000 Flags=0x00010246
Exception code: c0000005 (ACCESS_VIOLATION)
############################################################
** VThread [ nRouter:0a50: 7]
.Mapped To: PThread [ nRouter:0a50: 2180]
… SOBJ: addr=0x0a736244, h=0xf0104029 t=c30a (BLK_LOOKUP_THREAD)
… SOBJ: addr=0x0a7b0904, h=0xf0104028 t=ca35 (BLK_TRACECONNECTION)
… SOBJ: addr=0x010ccfc0, h=0xf0104026 t=c130 (BLK_TLA)
… SOBJ: addr=0x07f21e6c, h=0xf0104027 t=c820 (BLK_CLIENT_OPENSESSION_TIME)
… Database: “Server SMTP”!!mail.box
… DBH: 701, By: “Server SMTP”
… Database: D:\Lotus\Domino\Data\mail.box
… DBH: 348, By: “Server A”
We see no unusual traffic on the network, bandwidth appears fine. No changes were made to the Notes.ini file before the ‘collapse’, we’ve also checked the nic cards. We’ve rebooted routers, put in a VPN Accelerator on the router, rebooted servers deleted mail.boxes and the only thing that has consistently worked is taking the server down, turning off the machine and restarting.
Any ideas would be greatly appreciated.