I took on a customer that has been running into this error for about 4 months now. They have an SBS 2011 server. They have a mixed environment of both Windows 7 and Windows XP PC’s, however all the users running Sage are on Windows 7.
So far their HP switch has been replaced, and power management settings have been checked on the server and a workstation.
Server is an HP Proliant ML350 G6
NIC HP NC326i PCIe Dual Port – driver 220.127.116.11 (10 Feb 2012)
Update NIC to 18.104.22.168 (18 Feb 2014)
Questions to Ask
- When did the issue start? March 13th, end of February, begin March 1
- Who is affected, do they have logs of when it happens and where?
- Can they recreate the issue? No, computers get really slow before it happens
- Are the latest patches installed for Sage?
- How old is their network switch, are they running gigabit?
- When does the issue happen most often?
Things to check/try
List of users having the issue, all of them? Go through event logs on server on 4/24/2014 to see what changed Scan server for viruses and malware Check firewall settings – firewall was on, disabled it
- Wireshark capture
Update NIC drivers on server ML350 G6 Disable HP power management on server, set maximum performance and static high performance, disable all Cstates
- Check for rouge hub or other network device
Check speeds on NIC’s, auto negotiate?
- Test cables with Fluke for EMI
- Switch server to use its other NIC
Disable interrupt moderation on NIC Backups only run at 6AM and 6PM would not cause issues during day Disable shadow copies Run MalwareBytes to check for malware No Windows Updates were installed around the time issue started happening Update BIOS, from 5/5/2011 to latest version, 7/2/2013 Balanced Power and Performance, Dynamic Power Savings was set in BIOS Changed above settings to maximum performance and disabled all Cstates Disabled Flow Control on NIC, was at TX and RX Priority and VLAN disabled, was enabled Speed and duplex set to 1000Mb Full duplex, was set on auto negotiate HP V1810 J9660A 48G switch was running firmware 1.15 which is known to cause switch lockups. Upgraded switch firmware to 1.18 Run HP Service Pack for Proliant (SPP) on server to upgrade firmware and BIOS of server