health metrics


if manage large infrastructure >100 windows servers, kinds of metrics monitor ensure know of issues/potential issues , when arrive, before affect performance availability etc. know free disk space common one, assume there must many more.. monitor?

personally, monitor following aspects

  • cpu load
  • memory usage
  • network utilisation
  • service state ( critical services )
  • machine reset data ( unexpected reboots, manual reboots )
  • disk space utilisation
  • change in h/w configuration ( vms )
  • access ( make sure no unauthorised user allowed log on server )
  • patch updates ( make sure windows patches being updated regularly , on time )
  • av definitions ( make sure av definitions date )
  • pending reboots after patch installation 

regards, santosh

not represent organisation work for, opinions expressed here, own , posted is.




Windows Server  >  Windows Server General Forum



Comments

Popular posts from this blog

some help on Event 540

WMI Repository 4GB limit - Win 2003 Ent Question

Event ID 1302 (error 1307) DFS replication service encountered an error while writing to the debug log file