What is MC-LAG?

June 12, 2023

Whether it’s a conventional data center or a modern cloud-based one, a shared necessity exists: high availability.

Concurrently, a mutual challenge persists: the potential for a single point of failure within the network. Physical networking aimed to establish highly reliable connections between devices by employing LAG (Link Aggregation) technology. This ensures high availability of links between servers and switches or between switches themselves. However, LAG technology is not without its flaws and cannot entirely eliminate equipment failure risks.

Consequently, there’s a need for technology that expands the original singular connections between devices to multiple connections, while still guaranteeing multi-device redundancy and multi-link redundancy. Moreover, this technology must ensure robust intercommunication between devices. Enter MC-LAG or M-LAG(Multi-Chassis Link Aggregation), the focal point of this article. As an extension of LAG, it is crucial to comprehend the workings of LAG prior to understanding MC-LAG’s principles.

What is LAG (Link Aggregation group)

Link Aggregation group, often referred to as Port Channel, is an innovative technology that combines multiple physical connections into a single logical port at the data link layer. This process effectively increases bandwidth and enhances link redundancy. When utilized for connecting servers and switches, the physical topology can be observed in the illustration below.

In order to facilitate dynamic aggregation of physical links, Link Aggregation requires the implementation of LACP (Link Aggregation Control Protocol) between two devices for seamless negotiation.

what is LACP (Link Aggregation Control Protocol)

LACP(Link Aggregation Control Protocol) is an integral part of the IEEE 802.3ad standard and functions as a method to manage combining several physical ports into one cohesive logical channel. This protocol enables a network device to automatically form bundles of links by sending LACP packets to its corresponding device.

LACP provides multi-link redundancy through multi-link load balancing executed by hashing information such as quintuples of packets. As a result, all link bandwidths can be efficiently utilized. Once the aggregated link is established, LACP diligently maintains the link state and auto-adjusts or dissolves the aggregated link should any changes in aggregation conditions occur.

LAG offers redundancy between one-to-one devices by allowing for an uninterrupted flow of data even if a physical link in the aggregated group disconnects. However, it doesn’t address data path interruptions caused by switch downtime. In response, MC-LAG was developed to solve this issue.

What is MC-LAG(mlag)?

MC-LAG extends the original link aggregation technology from one-to-one devices to one-to-many devices, as depicted in the illustration below.

With node redundancy in place, traffic is appropriately load-balanced between two switching devices using a hash algorithm. Additionally, MC-LAG has a built-in anti-loop mechanism, eliminating the need for complex STP protocols or Layer 3 routing and forwarding configurations. This simplifies network configuration complexities.

MC-LAG relies on the LACP protocol’s working mechanism to achieve cross-device link redundancy. When negotiating with two switches, it needs to present itself as a single device in the LAG scenario. To accomplish this illusion, the system IDs of LACP packets within the cross-device redundant links must match; that is, SwitchA’s system ID during negotiation with the server must be identical to SwitchB’s system ID. This concept forms the foundation for MC-LAG’s ability to implement cross-device link aggregation.

MC-LAG Failure Scenario&Traffic Forwarding Paths

As depicted in the diagram below, ServerA and ServerB are linked to the access switch via MC-LAG. In normal circumstances, traffic transmitted from ServerA to ServerB follows the forwarding path indicated by the green line, which defaults to Switch A.

If the connection between ServerB and SwitchA fails, either due to the physical port linking ServerB to SwitchA or the physical port linking SwitchA to ServerB, the forwarding path of the traffic from ServerA to ServerB changes, represented by the brown line.

If SwitchA encounters a device failure, SwitchB takes over the forwarding of traffic from ServerA to ServerB, following the route demonstrated by the red line.

If a PeerLink fault occurs, the traffic sent from ServerA to ServerB remains forwarded via the green path.

application scenarios of MC-LAG

Networking Solution 1: MC-LAG At the Aggregation Layer :Through the cross-device port virtualization technology (MC-LAG), the network logic between the aggregation layer and the access layer switch is realized without looping, replacing STP. Compared with the traditional STP breakpoint protection, this design has a clearer logical topology and more efficient link utilization. For MC-LAG paired devices, the control plane and management plane are independent, and only the protocol plane is coupled. In theory, the reliability is higher than that of stacking. It also provides the ability to upgrade devices independently, bringing convenience to maintenance.

Networking Solution 2: Establishing MC-LAG at the Access Layer:The same MC-LAG technology is applicable to the application scenario where dual network cards of the server require active-active access. The server is dual-active connected to the two NICs to share the MAC. Dual NICs implement a flow-based load sharing strategy. Therefore, configure the port connected to the server as a member port of MC-LAG through MC-LAG, and the MAC and ARP entries of the two ports will be synchronized in real time.

As a cross-device link aggregation technology, MLAG not only has the advantages of increasing bandwidth, improving link reliability, and load sharing, but also has the following advantages:

Enables simpler network design: With MLAG, multiple switches can be treated as a single logical device, simplifying the network design and reducing management complexity.
Provides faster failover and better network stability: MLAG provides rapid failover and increased network stability, reducing downtime and improving overall network performance.
Offers more flexible deployment options: MLAG can be deployed at different layers of the network stack and is compatible with different devices from different vendors, making it a more flexible option than some other link aggregation technologies.
Improves scalability: MLAG can improve the scalability of network links, allowing for the addition of more devices and increased bandwidth without sacrificing network performance.

Overall, MLAG is a powerful technology that offers a range of benefits for network optimization, including increased bandwidth, improved link reliability, load sharing, faster failover, and greater network stability. By providing a more flexible and scalable option for link aggregation, M-LAG can help organizations to build more efficient and reliable networks.

About MC-LAG Configuration on Asterfusion Enterprise SONiC Distribution Switch

MC-LAG Configuration on Asterfusion Enterprise SONiC Distribution Switch

Re-engineering Cloud Network with Disaggregated yet Turnkey Switching Solutions

Low Latency Data Center Switch

Campus Access and Aggregation

Enterprise Ready SONiC NOS

Optical Transceiver

P4-Programmerable Bare Metal Switch

ARM64 Open Network Appliance

Network Packet Broker

What is MC-LAG?

Table of Contents

What is LAG (Link Aggregation group)

what is LACP (Link Aggregation Control Protocol)

What is MC-LAG(mlag)?

MC-LAG Failure Scenario&Traffic Forwarding Paths

application scenarios of MC-LAG

About MC-LAG Configuration on Asterfusion Enterprise SONiC Distribution Switch

Latest Posts

Unlock the Power of Campus Network with Asterfusion SONiC Gigabit Switches

Asterfusion SONiC based 32 x 400G QSFP-DD Data Center Switch Overview

Asterfusion SONiC NOS based 48x1G RJ45 PoE+ layer 3 Enterprise Switch Overview

AsterNOS: Enterprise-ready SONiC NOS for Cloud, AI and Campus

Low Latency Data Center Switch

Campus Access and Aggregation

Enterprise Ready SONiC NOS

Optical Transceiver

P4-Programmerable Bare Metal Switch

ARM64 Open Network Appliance

Network Packet Broker

Data Center

Enterprise

Carrier Network

What is MC-LAG?

Table of Contents

What is LAG (Link Aggregation group)

what is LACP (Link Aggregation Control Protocol)

What is MC-LAG(mlag)?

MC-LAG Failure Scenario&Traffic Forwarding Paths

application scenarios of MC-LAG

About MC-LAG Configuration on Asterfusion Enterprise SONiC Distribution Switch

Latest Posts

Unlock the Power of Campus Network with Asterfusion SONiC Gigabit Switches

Asterfusion SONiC based 32 x 400G QSFP-DD Data Center Switch Overview

Asterfusion SONiC NOS based 48x1G RJ45 PoE+ layer 3 Enterprise Switch Overview

AsterNOS: Enterprise-ready SONiC NOS for Cloud, AI and Campus