Hi All,
We run our C product on many verari linux x86_64 RHWS 4 linux boxes but recently we found some boxes hang and crashed and we have to restart.
We tried to find any core file or system logs but we can't find the reason for that crash.

Do you know any way or tool that can help to monitor the system and alarm us if there is something wrong or can audit the system so we can find the reason from the audit files even after restarting?