We are getting ready to implement IBMs Content Collector (ICC) to archive e-mail that is over 1 yr old. This process places all e-mail into a searchable repository and deletes the e-mails from the user’s prod. mail file. Part of the archiving process requires that any e-mail restored by the end user gets deleted from the prod. mail file after 48 hours. Since the restored e-mail already exists in the repository we need to just delete it. The ICC process does not account for this requirement and IBMs solution was to provide us with an agent to add to and run in EVERY mail file on all of our servers. We have about 4500 mail files on 9 servers. Some of the mail files are very large in size (that will change once we begin the archive process). The agent uses db.Search to create a collection of restored e-mails and then uses ndc.RemoveAll to delete the collection of documents found by the search formula ( SELECTION_FORMULA$ = |(IBMAfuMessageState = “RSEARCH” & IBMAfuRestoreDate < @Adjust(@Now;0;0;0;-48;0;0;[InGMT] ))| ).
So my question is, if we add this agent to the mail template so that it runs in every mail file every night can you see any issues with server performance? Mail file count on each server ranges from 300 - 1000. How many mail files are on each server currently depends on the size of the mail file so we will likely more evenly distribute the mail files but will also reduce the number of servers.
I did run a test of the agent in 1 user’s mail file and it worked fine and ran rather fast ( ~10 seconds ) but the mail file was rather small and only had about 4000 docs in it. Even if the agent took only 10 seconds to run in each mail file it would still take about 3 hours to run the agent in all mail files on a server that has 1000 mail files and 1 agent manager running.
I have my doubts about an agent running in every mail file on every server every night but wanted to get some feedback from others. It also seems like a waste of time and resources because most users will not be restoring much e-mail so the agent search, in most cases, will return nothing and the agent will have run for nothing. I’m not looking for suggestions to change anything, just want some feedback on your thought regarding the agent and server performance. Thx!