This document explains some of the first and more common steps that can be taken to attempt to resolve a CA ARCserve RHA issue during troubleshooting.
So you have tried a couple of things (engine restarts, machine restarts, some log analysis, followed the cookie crumbs...) and the problem still exists and you are now having trouble with where to go next. Below are some initial options you have to attempt to quickly fix an issue (also known as shotgunning).
- Refresh the Engine and Control Service services
- By this you will remove any configuration sent to the engine host servers and get rid of anything that could possibly be hanging around causing the engine to not act as it should be.
- This process is the first line of defense in any RHA issue and situation where you don't know where to start.
- The engine refresh is outlined in the following KB article: First Step Troubleshooting for disconnections or hangs in synchronization
- Along with this procedure there are two other things that should be done along side of it to refresh the engine prior to restarting the engine service on the hosts.
- Within the scenario find the 'Spool' folder property for each host (Master, Replica, Replica-N...etc)
- Default location will be the [Engine Install Directory]\tmp which is C:\Program Files\CA\ARCserve RHA\engine\tmp
- With the Engine Service stopped, delete ALL contents of this directory
- Next rename the '[Engine Install Dir]\log' folder to refresh logging.
- Then start the engine service
- Note that a lot of times it is well worth performing all of these steps on all engine hosts prior to restarting any of the engine services
- On the Control Service server there are a couple of folders that can be changed to refresh both the GUI and the Control Service itself for troubleshooting
- The first is the 'ws_events' folder for the Control Service itself. This folder contains all event history up until the max size of file per scenario for easy repopulation of the events pane within the RHA Manager.
- Renaming the 'ws_events' folder while the Control Service is stopped will help to resolve some unexplained issues sometimes revolving around the GUI. You can find the procuedures in this KB article: Unable to view the Scenario in RHA Manager
- Another way to reset some of the GUI or Control Service issues is to reset the scenarios back to a default and then put them back in to be re-translated by the Control Service. This is done by manipulating the 'ws_scenarios' folder. This folder however is required to be in place so follow the below exactly for troubleshooting purposes:
- Close out of all RHA Managers (you may need to kill the 'ws_gui.exe' processes from task manager)
- Next rename the 'ws_scenarios' folder to 'ws_scenarios.old' within the Control Service installation directory. (Default location is C:\Program Files(x86)\CA\ARCserve RHA\Manager\ws_scenarios)
- Next create a new and empty 'ws_scenarios' folder within the Control Service installation directory.
- Start the CA ARCserve RHA Control Service service.
- Log into the overview page
- Open the RHA Manager
- (Don't be alarmed) You should not see any scenarios showing here now
- Verify functionality for your troubleshooting purposes
- Close the RHA manager
- Stop the Control Service
- Delete the 'ws_scenarios' folder that you previously created in step iii.
- Rename the 'ws_scenarios.old' folder back to 'ws_scenarios' folder
- Start the Control Service again and continue your testing
- Also the GUI sometimes needs a reset, this can especially happen during upgrades from older versions
- To refresh the GUI follow this KB article: The Arcserve RHA GUI is scrambled or collapsed and unreadable