In this article, we invite you to see how we designed a high-availability electrical and Ethernet device for a customer in the field of cybersecurity.
We are going to detail each step from the design to its realization in a Rack Matrix model M1 1U rackmount case.
The entire software management part of failure detection and switching from one motherboard to another is not covered here, because it depends heavily on the software and operating systems implemented on the motherboards and how you want to handle it.
The goal is to ensure optimal service continuity of the device to the maximum of its capacities and for as long as possible. The duration of service continuity strongly depends on the source of the failure.
Thus,
Have the operation of the device ensured regardless of the following events:
Here is the final outline of the design with the above prerequisites
Here is the 1U router used as the basis for our design.
Here we present the stages of the design responding to each point mentioned in the prerequisites above.
To overcome the failure of a motherboard, the solution is to add a second. The 2 cards are connected to the same transformer. This involves doubling the power of the transformer to be able to power the 2 motherboards at the same time.
To overcome this failure, we add an Ethernet interconnection between the two motherboards of the previous solution #1.
The connection being located internally (inside the box) this has the advantage of not being able to be manipulated or removed by a third person (the customer or an employee) and at the same time there is no unnecessary cable on the front of the router.
to overcome this failure, we add a transformer to the previous solution #2 to power the second motherboard.
If a motherboard's power supply fails, the standby motherboard detects it because there is no more response to ping or any other mechanism from the more powered motherboard.
Ethernet interconnection between the two motherboards (white cable)
To overcome this unlucky scenario, we add a redundancy module between the two transformers and the two motherboards from the previous solution #3.
To compensate for the total power failure on the 2 mains sources (really bad luck), we are adding an internal battery for each motherboard to the previous solution. Thus depending on the age of the battery and the consumption of the motherboards with their options, the duration of the power supply can be from 20 minutes to more than 2 hours. This is more than enough for the cases of micro cuts (from a few seconds to 5 min) which are the most common, or even gives the motherboard equipped with a 4G modem the time to send the information of the total cut.
Here is the final result for the high-availability router from the company WaWuWe