Networking

ESXi 5x 集群硬體故障場景

  • August 28, 2013

你好,工程師們。

我有一個帶有 3 個 ESXi 主機的 ESXi5.0 集群設置。現在我需要為網路硬體故障創建一個測試案例,並在數據中心進行測試。

My Setup:

   1) 3 DELL R820 Servers (all identical in the configuration and hardware)

   2) PHYSICAL: Pair of 1GB ports for vSphere Management Network (active/standby)
      VIRTUAL: 1 VMkernel Port vmk0 on standard vSwitch0

   3) PHYSICAL: Pair of 10GB ports for regular network communications between guests MESH(active/active using IP Hash load balancing connected to the redundant switches) 
      VIRTUAL: dvSwitch0 with exposed and needed VLANs.
   4) PHYSICAL: Pair of 10GB for storage NFS/VMDK (active/passive, Failover Only with "Link Status Only" network failure detection connected to different switches)
      VIRTUAL: 1 VMkernel port vmk1 connected to distibuted switch dvSwitch01
   5) PHYSICAL: Pair of 10GB for storage (guest initiated) (active/active, load balancing is based on Port ID with "Link Status Only" network failure detection connected to different switches)

HA and DRS enabled.

我打算只做定期拉線測試,但可能會遺漏一些因素。我將不勝感激執行此類測試的任何建議和/或最佳實踐。

- 關閉主機。- 測試高可用性和准入控制。

- 關閉開關。- 測試故障轉移連結。

- 獨立斷開數據和儲存網路電纜。- 測試彈性、負載平衡和數據儲存心跳/主機隔離狀態。還有儲存控制器故障轉移。

當我們測試故障場景時,我們首先移除單個電線/光纖,然後是整個 NIC/HBA,然後是伺服器,然後是交換機 - 即從小到大 - 僅僅是因為如果平台不能處理小問題,那麼對大數據進行測試將毫無意義.

也就是說,我看不出您的設置有任何問題,而不是您已經解釋過的問題。

引用自:https://serverfault.com/questions/534533