Monday, June 16, 2014

Edgesight Unrecoverable, fatal DB error

One error to watch out for on Citrix EdgeSight deployments is the one below.

Error:  An unrecoverable, fatal database error has occurred.  Shutting down the Citrix System Monitoring Agent. 

This error is generally a sign of local FireBird database corruption. This could happen for a variety of reasons: system power failure, hard drive corruption, or improper anti-virus exclusions to name just a few. 

If this error were to experience frequent re-occurrence, be sure to take a look at the EdgeSight logs (SYS_EVENT_TXT) to see if a cause can be found. 

Additionally, keep an eye on the local DB size. If it regularly exceeds the size specified in the management infrastructure I have found corruption is much more likely.

Finally, to fix this error when corruption is the cause (the System Monitoring service will often not restart if corruption is the cause) follow the steps below to clean the local database.

  1. Stop the Citrix System Monitoring service on the problem server
  2. Rename the FireBird database file for EdgeSight (RSDATR.FDB)
  3. Restart the Citrix System Monitoring service.
The service should restart and the FDB database recreate. Yes, you will loose any data that was not previously uploaded to the primary infrastructure database.

Not sure where the FireBird database is? (Common for PVS deployed devices with a re-directed DB.) Check out the DataPath value in this registry key:
HKEY_LOCAL_MACHINE\SOFTWARE\Wow6432Node\Citrix\System Monitoring\Agent\Core\4.00

No comments:

Post a Comment