Unexplained Crashes- I've trried everything... Server 2012 RS Standard w/Essentials Role


hello everyone,

my name jessie , looking troubleshooting server. running microsoft server 2012 r2 essentials experience. on last year, have suffered many server failures , crashes. i’m writing details here in hopes more experience may direct me in right way can solve hideous problem having. so, here details….

server info:

  • dell poweredge r710 server (running latest firmware/bios) 6 drive bays (4 disk raid 10 server os, 2 disk raid 1 array client backups , internal company docs – drive bays occupied)
  • 2 intel xeon l5640 @ 2.27ghz cpu’s
  • 4 broadcom netextreme ii gige nic’s, “teamed” 1 interface
  • 12 – 8gb ram sticks installed (12 of 18 slots occupied) = 96gb total ram
  • perc h700 raid controller (embedded) running latest firmware)
  • perc h800 raid adapter controller (on pci slot 3 - riser) (running latest firmware)
  • md1200 drive enclosure attached server via perc h800 adapter 12 drive bays. 12 disks in raid 6 array total of ~29 tb.
  • 2 seagate 4 tb external drives offsite backups connected via usb 2.0
  • 2 seagate 8 tb external drives offsite backups connected via usb 2.0
  • cyberpower ups 1000w (sine wave)
  • 10 client computers managed through server essentials experience role

here’s story…

our server crashing without warning. shuts down if held power button, reboots errors , broken services on place! on occasion, have managed see pattern these shut downs, typically occur during file sync or backup task. days neither of these tasks cause problem, , on other days, suffer many 6 shut downs in day.

i have tried numerous troubleshooting tools, including dell’s tools, hd sentinel hdd testing, memory testing, graphics testing, load testing, , on. i’ve reviewed hours-worth of logs , cruised every forum can think of advice.

after exhaustive troubleshooting, decided replace few parts in server narrow down culprit. have received 2 error codes front led on server: error e1410 fatal err , e171f fatal err, indicate pci error , cpu error, throws these 2 errors after crash. if shut down, unplug , restart both errors go away , not reproduce until crash occurs. have rebuilt server ground up, including clean install of server 2012 r2. here list of hardware have replaced:

  • server motherboard
  • 2 cpu’s
  • new perc h700 battery (that stuck in “learning” cycle)
  • 2 new pci riser cards perc adapters
  • new perc h800 adapter card new perc battery
  • 4 new hdd in md1200 raid 6 array
  • new 1000w ups

i have logs , screen shots of server on hand can help! please if can, i’m @ complete loss try (other buying new server, not in budget). l

hi,

according description, understanding – there failure/crash problem on windows server 2012 r2 essentials experience role installed. 

check event viewer see if related event has been logged, helpful narrow down problem. 

make sure windows server has been patched windows update/hotfix. might helpful resolving known issues , improving performance.

crash related problem, dump file better choose further identify problem. below articles can considered reference.

steps catch simple “crash dump” of crashing process:
https://blogs.msdn.microsoft.com/chaun/2013/11/12/steps-to-catch-a-simple-crash-dump-of-a-crashing-process/

how read small memory dump file created windows if crash occurs:
https://support.microsoft.com/en-us/kb/315263

best regards,
eve wang

please remember mark replies answers if , unmark them if provide no help. if have feedback technet support, contact tnmff@microsoft.com.



Windows Server  >  Windows Server 2012 Essentials



Comments

Popular posts from this blog

some help on Event 540

WMI Repository 4GB limit - Win 2003 Ent Question

Event ID 1302 (error 1307) DFS replication service encountered an error while writing to the debug log file